How Much Data From a Sequencing Run?

June 25th, 2025
bio, nao
Cross-posted from my NAO Notebook.

Manufacturers often give optimistic estimates for how much data their systems produce, but performance in practice can be pretty different. What have we seen at the NAO?

We've worked with two main sample types, wastewater and nasal swabs, but Simon already wrote something up on swabs so I'm just going to look at wastewater here.

We've sequenced with both Illumina and Oxford Nanopore (ONT), and the two main flow cells we've used are the:

  • Illumina NovaSeq X 25B, at 2x150. List price is $16,480, though we've seen groups negotiate discounts in the range of 15%. Marketed as producing a maximum of "52 billion single reads" (26B read pairs, 8,000 Gbp).

  • ONT PromethION. List price is $900 in bulk, compared to $600 for the smaller MinION flow cell. Since the PromethION flow cells have 5x the pores (2,675 vs 512) and correspondingly higher output, I think most folks who are using MinION should be considering switching over to the PromethION. Marketed as producing ">200 Gb on metagenomic samples".

Ignoring library prep consumables, labor, lab space, machine cost, and looking just at the flow cells, if we took the manufacturer's costs at face value this would be:

  • Illumina 25B: $2.06/Gbp ($16,480 / 8,000 Gbp)
  • ONT PromethION: $4.50/Gbp ($900 / 200 Gbp)

With 25B flow cells we've generally seen output meeting or exceeding the advertised 26B read pairs (8,000 Gbp). In our sequencing at BCL we've averaged 29.4B read pairs per sample (8,800 Gbp; n=37), while recently MU has been averaging 27.2B read pairs (8,200 Gbp; n=4, older runs were ~20% lower). It's great to be getting more than we expected!

On the other hand, with PromethION flow cells we've generally seen just 3.3 Gbp (n=25) on wastewater. This is slightly higher than the 2.5 Gbp we've seen with nasal swabs, but still far below 200 Gbp. We don't know why our output is so much lower than advertised, but this is what we've seen so far.

This would give us:

  • Illumina 25B: $1.87/Gbp ($16,480 / 8,800 Gbp)
  • ONT PromethION: $272/Gbp ($900 / 3.3 Gbp)

We're still not done, though, because while this is correct in terms of raw bases coming off the sequencer, with paired-end sequencing on short fragments like we have in wastewater a portion of many of your reads will be adapters. We see a median of 170bp after adapter trimming, out of an initial 300bp, which means we only retain ~60% of the raw bases. Accounting for this, we have:

  • Illumina 25B: $3.30/Gbp ($16,480 / 8,800 Gbp / 60%)
  • ONT PromethION: $272/Gbp ($900 / 3.3 Gbp)

Overall, Illumina is much more cost-effective for us with our current protocols. If we were able to get better results from ONT that would close the gap partially, but a gap of nearly two orders of magnitude we'd need very large improvements.

Comment via: facebook, lesswrong, mastodon, bluesky, substack

Recent posts on blogs I like:

What Effective Altruists Believe: An Unmanifesto

This is a rerun of an old post, now with links!

via Thing of Things June 23, 2025

Elixir's Last Dance

On May 18th, the contra dance band Elixir had their last gig ever. The dance was packed: there were three hundred people. It was the only dance BIDA has ever done where they sold tickets. People flew from across the country just to hear Elixir play one la…

via Lily Wise's Blog Posts June 5, 2025

Workshop House case study

Lauren Hoffman interviewed me about Workshop House and wrote this post about a community I’m working on building in DC.

via Home April 30, 2025

more     (via openring)