meta data for this page
  •  

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
compgenomics:modul2 [2025/05/16 09:08] – [Analysis] felixcompgenomics:modul2 [2025/05/17 12:54] (current) felix
Line 8: Line 8:
  
 <WRAP round tip 500px center> <WRAP round tip 500px center>
-Genes in two species that originated from the same ancestral gene in the last common ancestor of the species are good candidates for being 'corresponding' genes (orthologs)+**Assumption 1:** Genes in two species that originated from the same ancestral gene in the last common ancestor of the species are good candidates for being 'corresponding' genes (orthologs)
 </WRAP> </WRAP>
  
 <WRAP round tip 500px center> <WRAP round tip 500px center>
-Orthologs are genes in two species that are mutually most similar to each other+**Assumption 2:** Orthologs are genes in two species that are mutually most similar to each other
 </WRAP> </WRAP>
  
Line 23: Line 23:
 To find out how widespread the ability to degrade plant cell wall, really is we trace  the distribution of 235 potential PCDs across all eukaryotic datasets available in the RefSeq database. To find out how widespread the ability to degrade plant cell wall, really is we trace  the distribution of 235 potential PCDs across all eukaryotic datasets available in the RefSeq database.
  
-=== Task 1: Exploration ===+=== Task 1: Finding corresponding genes ===
  
-  - Open a visualizaiton of the results generated using [[https://ebersberger-46-155.biologie.uni-frankfurt.de/phyloppcd/|this interactive web-viewer]]  +The task at hand is to find out which genes "correspond" to each other in different species and should be displayed in the same row.  
-  - <wrap tip>Wait a moment </wrap> for the data to load. Once you were redirected to the "Main profile" page, select a taxonomic rank and click the red **PLOT** button +
-  - Explore the "Main plot" on different taxonomic levels +
-    - Which information is displayed in the rows, which in the columns? +
-    - What patterns can you observe in the plot and how do you interpret them? +
-  - Select the "Dimension reduction" plot from the top menu and explore the plot with "Phylum"-level labels +
- +
-=== Task 2: Finding corresponding genes === +
- +
-After having explored the results, let's find out how to generate them. The task at hand is to find out which genes "correspond" to each other in different speciesand should be displayed together in one row.  +
  
 Have another look at the assumptions at the top of the page. Our best bet to find "corresponding" genes between species is to identify orthologs. In practice, this means finding genes in two species that are mutually most similar to each other. Have another look at the assumptions at the top of the page. Our best bet to find "corresponding" genes between species is to identify orthologs. In practice, this means finding genes in two species that are mutually most similar to each other.
  
-Finding genes with significantly similar sequences is typically done with a BLAST search. To identify orthologs, we will perform a "reciprocal best hit serach". As an example, we will use the GH45 type cellulase of //Rhizoctonia solani// ([[https://www.ncbi.nlm.nih.gov/protein/XP_043186466.1?report=genbank&log\$=prottop&blast_rank=1&RID=2A0JJUND016|XP_043186466.1]]).+Finding genes with significantly similar sequences is typically done with a BLAST search. To identify orthologs, we will perform a "reciprocal best hit search". As an example, we will use the GH45 type cellulase of //Rhizoctonia solani// ([[https://www.ncbi.nlm.nih.gov/protein/XP_043186466.1?report=genbank&log\$=prottop&blast_rank=1&RID=2A0JJUND016|XP_043186466.1]]).
  
   - Open [[https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome|NCBI BLAST]]   - Open [[https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome|NCBI BLAST]]
-  - Start a search with the following parameters +  - Start a search with the following parameters: 
-    - Paste the sequence accession number into the search field (**XP_043186466.1**) +    - Paste the sequence accession number into the search field: <wrap tip>XP_043186466.1</wrap> 
-    - Select database -**Reference proteins (refseq_protein)** +    - Select database: <wrap tip>Reference proteins (refseq_protein)</wrap> 
-    - Select target species: **//Bradysia coprophila//**+    - Select target species: <wrap tip>//Bradysia coprophila//</wrap>
   - Extract the accession number of the best hit   - Extract the accession number of the best hit
   - Perform reverse BLAST serach in //Rhizoctonia solani//   - Perform reverse BLAST serach in //Rhizoctonia solani//
  
 +=== Task 2: Exploration ===
 +
 +  - Open a visualizaiton of the results generated using [[https://ebersberger-46-155.biologie.uni-frankfurt.de/phyloppcd/|this interactive web-viewer]]
 +  - <wrap important>Alternatively:</wrap> start PhyloProfile locally and input the data deposited at: ''https://applbio.biologie.uni-frankfurt.de/download/fDOG_pPCD/pPCD-PhylogeneticProfiles.tar.gz'' 
 +  - <wrap tip>Wait a moment </wrap> for the data to load. Once you were redirected to the "Main profile" page, select a taxonomic rank and click the red **PLOT** button
 +  - Explore the "Main plot" on different taxonomic levels
 +    - You can draw selection boxes on the overview plot and generate a more detailed plot of the selected section
 +    - Which information is displayed in the rows, which in the columns?
 +    - What patterns can you observe in the plot and how do you interpret them?
 +
 +=== Summary and Disussion ===
 +
 +  * Record your observations and questions
 +  * Discuss with the group
  
 ---- ----
  
 [[:compgenomics|Back to main]] [[:compgenomics|Back to main]]