AlexsLemonade / reproducible-research

Repository for Reproducible Research Practices Training Workshop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Consider small fastq files

sjspielman opened this issue · comments

Related to #101
Rather than only paying attention to timing of the FASTQ activity, we can also use different FASTQ files that are smaller in the first place. One approach for this could be to simply subset the FASTQs we've identified and keep those in an S3 bucket on our end. During training, we'd want to explain to participants that:

  • They can still curl from SRA! We'll want to show them the link and command to do so.
  • The only reason we using a subsetted version in our AWS data storage is for timing needs during training. Internet speed is unreliable!