PolITiGenomics

Politics, Information Technology, and Genomics

Next-Generation Sequencing Informatics

Below is a table with informatics and IT statistics for the major next-generation/massively parallel sequencing platforms. The information in the table is approximate and should only be used for general, informational purposes.

Next-Generation Sequencing Statistics

Vendor: Roche Illumina ABI
Technology: 454 Solexa SOLiD
Platform: GS 20 FLX Ti GA GA II 1 2
Reads: (M) 0.5 0.5 1 28 100 40 115
Fragment
Read length: 100 200 350 35 50 75 25 35
Run time: (d) 0.25 0.3 0.4 3 3 4.5 6 5
Yield: (Gb) 0.05 0.1 0.4 1 5 7.5 1 4
Rate: (Gb/d) 0.2 0.33 1 0.33 1.67 1.67 0.34 1.6
Images: (TB) 0.01 0.01 0.03 0.5 1.1 1.7 1.8 2.5
PA Disk: (GB) 3 3 15 175 300 350 300 750
PA CPU: (hr) 10 140 220 100 70 100 NA NA
SRA: (GB) 0.5 1 4 30 50 75 100 140
Paired-end
Read length: 200 2×35 2×50 2×75 2×25 2×35
Insert: (kb) 3.5 0.2 0.2 0.2 3 3
Run time: (d) 0.3 6 10 15 12 10
Yield: (Gb) 0.1 2 9 12 2 8
Rate: (Gb/d) 0.33 0.33 1.67 1.67 0.34 1.6
Images: (TB) 0.01 1 2.2 3.4 3.6 5
PA Disk: (GB) 3 350 500 600 600 1500
PA CPU: (hr) 140 160 120 170 NA NA
SRA: (GB) 1 60 100 150 200 280

Notes:

  • Units: B - bytes, b - bases
  • PA is primary analysis (includes image feature extraction and base calling)
  • PA CPU is calculated as the wall clock multiplied by the number of CPU cores
  • ABI SOLiD data, except rate, are representative of a single slide
  • ABI SOLiD primary analysis is done on the instrument cluster
  • SRA is the size of the files (SFF or SRF) that are submitted to the NCBI Short Read Archive