meta data for this page
  •  

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
general:bioseqanalysis:genesetanalysis [2022/03/04 18:33] – created ingogeneral:bioseqanalysis:genesetanalysis [2024/02/15 06:58] (current) ingo
Line 1: Line 1:
 +======Gene set characterization ======
 +In this part  of the course, we will be working on analysing the gene sets that we have annotated on the assembled genome sequence. The work packages will have three main objectives
  
 +<WRAP round box center>
 +  - Assess the quality of the annotated gene set with respect to
 +     - possibile contaminations. :!: We could have done this already during the assembly , e.g. using [[https://blobtoolkit.genomehubs.org/blobtools2/|BlobTools]], but we didn't… 
 +    - completeness in terms of gene loci
 +    - completeness of the gene structure
 +  - Trace evolutionary changes in the gene set. :!: We will be concentrating on the loss of genes in //C. parvum//((why is analysing gene gain harder??))
 +</WRAP>
 +Once you have completed this set of excersises, you should have an idea how to
 +  * **run the individual analyses.** :!: Bear in mind, however, that input data is diverse with respect to data formats and data set size. Depending on your own data, you may have to adjust the commands that we provide in the individual exercises.
 +  * **determine quality measures of your genome assembly on the level of gene sets**. :!: Watch out, //QUALITY// is not measured in absolute quantities. You will have to specify for yourself, and probably for each analysis separately, what data you consider excellent/very good/good/acceptable/poor
 +  * differentiate between methodological artefacts and evolutionary changes
 +<WRAP round box>
 +Follow the links below to 
 <WRAP tabs> <WRAP tabs>
-  * [[general:bioseqanalysis:genesetanalysis:taxaminer]] +  * [[:general:bioseqanalysis:genesetanalysis:taxaminer|1) Contamination check - taXaminer]] 
 +  * [[:general:bioseqanalysis:genesetanalysis:busco|2) Gene set completeness - BUSCO]] 
 +  * [[:general:bioseqanalysis:genesetanalysis:fcat|3) Gene set completeness - fCAT]] 
 +</WRAP> 
 +Use the following links to navigate through the course 
 +<WRAP tabs> 
 +  * [[:physaliacg|Main page of the Physalia course]] 
 +  * [[:physaliacg:genomeannotation|Previous work package: Genome annotation]] 
 +  * [[:physaliacg:genefunction|Move to the next package: Phylogenetic profiles - fDOG]]
 </WRAP> </WRAP>