===== Biodiverse genome assemblies =====
How do we explain the sporadic presence of orthologs in a clade?
----
**Assumption:** All detected genes are encoded in the genome of the respective species
==== Analysis ====
In this module, we will analyse the orthologs of the GH28 glycosyl hydrolase in the two invertebrates //Bradysia coprophila// and //Spodoptera litura// in more detail.
=== Task 1: Spodoptera litura ===
* Find the NCBI accession ID of the GH28_QRW23661.1 in //Spodoptera litura// using the [[https://ebersberger-46-155.biologie.uni-frankfurt.de/phyloppcd/|interactive webviewer]]
[[https://www.ncbi.nlm.nih.gov/protein/XP_022836218.1?report=genpept|XP_022836218.1]]
* Perform a [[https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins|BLASTp search]] against the "Non-redundant protein sequences (nr)" database
* Take a look at the list of best hits. Which taxonomic clade do they represent?
=== Task 2: Bradysia coprophila ===
* Repeat the analysis with the GH28_QRW23661.1 ortholog in //Bradysia coprophila//
[[https://www.ncbi.nlm.nih.gov/protein/XP_037031340.1]]
=== Summary and Discussion ===
* Note down your observation and discuss with the group
----
[[:compgenomics|Back to main]]