SAM
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) (Created page with "{{FormatInfo |formattype=electronic |subcat=Scientific Data formats }} '''SAM''' (Sequence Alignment/Map) is a data format used for mapping sequences of DNA. It is in text (ta...") |
Revision as of 02:08, 4 August 2020
SAM (Sequence Alignment/Map) is a data format used for mapping sequences of DNA. It is in text (tab-separated) form, while A companion format BAM is binary. It is classified as an alignment format, as is CRAM. This is as opposed to sequence-only, unaligned, formats such as FASTA and FASTQ.
Identification
Files will often start with a header line (though this is technically optional), such as @HD VN:1.6 SO:coordinate
, where the VN parameter gives the version number of the format and this may be followed with a SO or G0 parameter giving sorting order or grouping of alignments.