Daniel S. Standage

Psst! I've posted some details about my notebook setup here.

Streaming data from the SRA with fastq-dump

Permalink: 2017-01-24 by Daniel S. Standage in blog tags: sra streaming ngs

NCBI's Sequence Read Archive is the go-to repository for published genome-scale sequence data sets. Although there are a variety of ways to download sequence data from SRA, the fastq-dump command from the SRA Toolkit is the most convenient in my opinion. In fact, with a few settings tweaks fastq-dump can stream data directly from the SRA into an analysis pipeline …

more…

Composing generator functions in Python

Permalink: 2016-11-23 by Daniel S. Standage in notebook tags: python generators streaming

In which I briefly motivate the utility of generator functions and demonstrate that they can be nested to create a data processing stream.

more…