meta data for this page
Differences
This shows you the differences between two versions of the page.
Previous revision | |||
— | general:molmet:sequencequalities [Unknown date] (current) – external edit (Unknown date) 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== Base quality information ====== | ||
+ | |||
+ | All DNA sequencing approaches are similar in a way that the sequencer has to interpret a signal that indicates the presence of a given nucleotide at a given position. The resulting raw data serves then as input into a software, the // | ||
+ | |||
+ | The base calling is obviously not error free, as some signals suffer from a lot of noise (Trace A in figure {{ref> | ||
+ | |||
+ | <figure BQ> | ||
+ | {{general: | ||
+ | < | ||
+ | |||
+ | </ | ||
+ | |||
+ | |||
+ | ===== Base quality file formats ===== | ||
+ | |||
+ | Base qualities are traditionally represented as the negative decadic logarithm of the error probability of a base call. | ||
+ | |||
+ | * Q=10 represents an error probability of one in 10 | ||
+ | * Q=20 represents an error probability of one in 100 | ||
+ | * Q=30 represents and error probability of one in 1000 | ||
+ | |||
+ | Base qualities for individual sequencing reads typically range between 0 and maximally 40, although some technologies slightly differ in the range (Figure {{ref> | ||
+ | |||
+ | Traditionally, | ||
+ | |||
+ | <figure ASCII> | ||
+ | {{: | ||
+ | < | ||
+ | </ | ||
+ | |||
+ | ===== Relevance ===== | ||
+ | |||
+ | ==== Genome sequencing ==== | ||
+ | |||
+ | Quality values used to be relevant for all applications of DNA sequencing. Nowadays, however, DNA sequencing has become cheap to an extent that read coverage, i.e. the number of reads covering a nucleotide in your template sequence, has taken away quite a bit of their importance, when it comes to the de-novo sequencing of genomes. | ||
+ | |||
+ | ==== Transcriptomics / Genotyping ==== | ||
+ | |||
+ | When it comes to applications where a uniformly high coverage is not guaranteed, e.g. when sequencing transcriptomes, | ||
+ | |||
+ | ===== Storing base quality information ===== | ||
+ | |||
+ | Base quality values are stored differently, | ||
+ | |||
+ | The [[wp> | ||
+ | |||
+ | The [[wp> | ||
+ | |||
+ | <figure FQ> | ||
+ | |||
+ | [{{: | ||
+ | |||
+ | < | ||
+ | |||