Psst! I've posted some details about my notebook setup here.

On genomic interval notation

Intervals are one of the most common data abstractions used in genome informatics, along with strings and graphs. DNA has an intricate dynamic three-dimensional structure, but for many bioinformatics applications we can get away with ignoring this level of detail and representing the molecule instead as a static linear sequence of symbols. Genomic features—such as genes or transposable elements …

more…

Random gene distribution: follow up

I recently sketched out some notes on random placement of genomic intervals. Here I show a more concrete implementation

more…

Some notes on random gene distribution

I've been looking at statistics of gene spacing, but as of yet have not compared these to benchmarks of "randomly" distributed genes. Here I sketch out a few rough notes on the topic.

more…