Gene set characterization

In this part of the course, we will be working on analysing the gene sets that we have annotated on the assembled genome sequence. The work packages will have three main objectives

  1. Assess the quality of the annotated gene set with respect to
    1. possibile contaminations. :!: We could have done this already during the assembly , e.g. using BlobTools, but we didn't…
    2. completeness in terms of gene loci
    3. completeness of the gene structure
  2. Trace evolutionary changes in the gene set. :!: We will be concentrating on the loss of genes in C. parvum1)

Once you have completed this set of excersises, you should have an idea how to

1)
why is analysing gene gain harder??