We blog about genomics. We also make a platform for open-source analysis of next generation data in the cloud. Hello.

Gibbon genome reveals new insights into mechanisms of primate chromosomal evolution

The gibbon now joins the ranks of nearly 100 mammals with genome assemblies deposited into NCBI’s genome database. – Gracing the cover of today’s issue of Nature, the sequencing and initial analysis of the gibbon genome represents a continuation of efforts, directed by the National Human Genome Research Institute (NHGRI), to sequence …

Written by Devin

Developing an open standard for reproducible genomics pipelines

July was an exciting month for Seven Bridges Software Engineer Boysha Tijanic, who traveled from our Belgrade, Serbia office to give a presentation at ISMB 2014. In his talk, Boysha discussed the importance of transparency and thoroughness when it comes to sharing data and highlighted the complexity of bioinformatic workflows …

Written by carol

De novo transcriptome assembly with RNA-seq data: Using Trinity to examine Caribbean Millepora

Despite the abundance of the hydrocoral genus Millepora, which consists of 19 species distributed in warm waters around the globe, the milleporids have received limited attention in coral reef studies. However, Millepora act as an important component of the reef framework and thus are geologically important. We recently hosted a …

Written by carol

Ion Proton RNA-Seq: in search of the best alignment method

Each sequencing technology comes with a unique and unavoidable error profile due to the chemistry, biology, and hardware involved [1,2,3]. If we want avoid analysis artifacts and arrive safely at the biological reality underlying the data, we must to account for these errors during informatic analysis. Recently, researchers have begun tackling …

Written by Kate Blair

Better sequences (and fewer homopolymer errors) for Ion Torrent

The short read files that Ion Torrent’s sequencing machines give us still contain many homopolymer errors: errors in the number of bases called when a single nucleotide occurs more than once in sequence. This makes alignment harder and drowns real indels in a sea of noise. These homopolymer errors arise …

Written by Nate

A mystery muscle lincRNA

We know that our whole genome is distributed to (almost) every cell of our bodies. This fact can be used both to surprise introductory biology students and to usefully refine a fundamental scientific question. Instead of merely asking how it comes to be that different parts of our bodies have …

Written by Nate

RNA-Seq Interactive Literature Review

RNA-Seq is fast becoming the top method for examining the transcriptional activity of genomes. In the five years since the first publications described the technology, RNA-Seq has enabled the discovery of new transcripts in well-studied genomes, challenged our views of imprinting, offered insights into the biology of cancer, and transformed new …

Written by Kate Blair

Short read alignment: seeding

In my last post I explained some of the basics of short read alignment algorithms. Go read it if you like; if not, recall that: Many modern alignment algorithms rely on what is called seeding and extending. “Seeding” is finding exact matches of part of the read with part of the …

Written by Nate

Short read alignment: an introduction

Biologists today often find themselves with lots of–say, 10^6–short sequences of DNA from a sample, and their ability to do scientifically useful things with those sequences depends on their ability to align those sequences to a reference sequence. Many of the hard and important projects in genomics either are alignment …

Written by Nate

We are always engaged in research and development, working to build the future of genomics, science, and health. Let's work together. We'd love to hear about your projects and challenges, so drop us a line.

get in touch