Differences

This shows you the differences between two versions of the page.

--- general:bioseqanalysis:intro [2019/01/21 20:04] – created ingo
+++ general:bioseqanalysis:intro [2019/01/21 20:22] (current) – [In silico characterization] ingo
@@ Line 10: / Line 10: @@
 ===== In silico characterization =====
-The //in silico// way to learn about what a gene might be doing is typically considered the easier one. We use the computer to tentatively assign a certain activity to the gene product, based on the results of various algorithms for biological sequence analysis. One of the most widely used evidence is the sharing of a significant sequence similarity to either an already functionally characterized gene product, or to statistical models that represent conserved regions in aligned sequences. In both cases, this similarity is interpreted as a signal that the sequences share a common ancestry, i.e. are homologous.
+The //in silico// way to learn about what a gene might be doing is typically considered the easier one. We use the computer to tentatively assign a certain activity to the gene product, based on the results of various algorithms for biological sequence analysis. One of the most widely used evidence is the sharing of a significant sequence similarity to either an already functionally characterized gene product, or to statistical models that represent conserved regions in aligned sequences. In both cases, this similarity is interpreted as a signal that the sequences share a common ancestry, i.e. are homologous.\\
+Basically, in silico analyses of gene function boil down to three general approaches
+  * The identification of functionally annotated sequences displaying a significant sequence similarity. The most common tool to do this is [[https://blast.ncbi.nlm.nih.gov|BLAST]].
+  * The identification of evolutionary conserved subsequences -sometimes referred to as //domains// - in the sequence of interest. One of the most widely used tools/databases is [[general:bioseqanalysis|Pfam]]
+  * The identification of short motifs in a sequence. Note, contrary to the other two approaches, a motif search typically does not rely on the inference of an evolutionary relationship((Motifs are short sub-sequences of DNA or a protein of defined length, e.g. the start codon //**ATG**// or the canonical splice donor and splice acceptor sites GT-AG. Due to their short length, it is feasible to assume that they can emerge independently more than once in the course of evolution.))
 ===== Integrative approaches =====
 Nowadays, experimental approaches, and those that are computer-based are far from being mutually exclusive. People rarely enter the lab without a prior idea about what a certain gene product could do. And likewise, //in silico// analyses on the computer heavily depend on the amount of existing information that can be exploited in the process annotating the function of an unknown gene product. In essence, annotating a novel gene product is equivalent to connecting it to the network of existing information about gene product function. Information is transferred, rather than generated //de novo//. If there is no network, then no connection can be established, no annotation transfer is possible, and the function of the gene product remains unknown.

Tools

menus and quick search

quick search

site status

Page Tools

meta data for this page

Differences