Complete Genomics
From Just Solve the File Format Problem
Complete Genomics is a company that is involved in DNA sequencing of human genomes. It has its own file format standards for storing such genome sequence information, which use tab delimited data (stored with a .tsv extension), often distributed compressed with bzip2 compression (giving the resulting distribution files a .tsv.bz2 double extension).
File format documentation
Sample files
- Public genome data is downloadable from their FTP site.