meta data for this page
  •  

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
general:bioseqanalysis:intro [2019/01/21 20:21] – [In silico characterization] ingogeneral:bioseqanalysis:intro [2019/01/21 20:22] (current) – [In silico characterization] ingo
Line 13: Line 13:
 Basically, in silico analyses of gene function boil down to three general approaches Basically, in silico analyses of gene function boil down to three general approaches
   * The identification of functionally annotated sequences displaying a significant sequence similarity. The most common tool to do this is [[https://blast.ncbi.nlm.nih.gov|BLAST]].   * The identification of functionally annotated sequences displaying a significant sequence similarity. The most common tool to do this is [[https://blast.ncbi.nlm.nih.gov|BLAST]].
-  * The identification of evolutionary conserved subsequences -sometimes referred to as //domains// - in the sequence of interest. One of the most widely used tools/databases is [[https://pfam.xfam.org|Pfam]]+  * The identification of evolutionary conserved subsequences -sometimes referred to as //domains// - in the sequence of interest. One of the most widely used tools/databases is [[general:bioseqanalysis|Pfam]]
   * The identification of short motifs in a sequence. Note, contrary to the other two approaches, a motif search typically does not rely on the inference of an evolutionary relationship((Motifs are short sub-sequences of DNA or a protein of defined length, e.g. the start codon //**ATG**// or the canonical splice donor and splice acceptor sites GT-AG. Due to their short length, it is feasible to assume that they can emerge independently more than once in the course of evolution.))     * The identification of short motifs in a sequence. Note, contrary to the other two approaches, a motif search typically does not rely on the inference of an evolutionary relationship((Motifs are short sub-sequences of DNA or a protein of defined length, e.g. the start codon //**ATG**// or the canonical splice donor and splice acceptor sites GT-AG. Due to their short length, it is feasible to assume that they can emerge independently more than once in the course of evolution.))  
  
 ===== Integrative approaches ===== ===== Integrative approaches =====
 Nowadays, experimental approaches, and those that are computer-based are far from being mutually exclusive. People rarely enter the lab without a prior idea about what a certain gene product could do. And likewise, //in silico// analyses on the computer heavily depend on the amount of existing information that can be exploited in the process annotating the function of an unknown gene product. In essence, annotating a novel gene product is equivalent to connecting it to the network of existing information about gene product function. Information is transferred, rather than generated //de novo//. If there is no network, then no connection can be established, no annotation transfer is possible, and the function of the gene product remains unknown. Nowadays, experimental approaches, and those that are computer-based are far from being mutually exclusive. People rarely enter the lab without a prior idea about what a certain gene product could do. And likewise, //in silico// analyses on the computer heavily depend on the amount of existing information that can be exploited in the process annotating the function of an unknown gene product. In essence, annotating a novel gene product is equivalent to connecting it to the network of existing information about gene product function. Information is transferred, rather than generated //de novo//. If there is no network, then no connection can be established, no annotation transfer is possible, and the function of the gene product remains unknown.