meta data for this page
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
general:bioseqanalysis:genesetanalysis:fcat [2024/02/16 13:50] – ingo | general:bioseqanalysis:genesetanalysis:fcat [2025/04/08 16:34] (current) – [fCAT analysis - Output visualization and interpretation] ingo | ||
---|---|---|---|
Line 10: | Line 10: | ||
</ | </ | ||
===== fCAT Core sets ===== | ===== fCAT Core sets ===== | ||
- | For the course, we have prepared | + | For the course, we have prepared |
==== Core set eukaryota ==== | ==== Core set eukaryota ==== | ||
Line 46: | Line 46: | ||
To solve this issue temporarily for the current shell, type< | To solve this issue temporarily for the current shell, type< | ||
< | < | ||
- | export COILSDIR=/ | + | export COILSDIR=/ |
</ | </ | ||
</ | </ | ||
Line 75: | Line 75: | ||
- run the fCAT analysis on the AWS with the following core set | - run the fCAT analysis on the AWS with the following core set | ||
- eukaryota | - eukaryota | ||
- | - for the **eukaryota core set** and the MetaEuk gene prediction on the CryPa_BCM2021a assembly, invoke the analysis with the following command((This assumes that you are in $HOME/ | + | - for the **eukaryota core set** and the MetaEuk gene prediction on the CryPa_BCM2021a assembly, invoke the analysis with the following command((This assumes that you are in $HOME/ |
<hidden Spoiler> | <hidden Spoiler> | ||
< | < | ||
Line 81: | Line 81: | ||
Mode 1: | Mode 1: | ||
genomeID similar dissimilar duplicated missing ignored total | genomeID similar dissimilar duplicated missing ignored total | ||
- | CRYPA@5807@240206 149 89 0 86 8 333 | + | CRYPA@5807@240206 149 89 0 86 8 332 |
Mode 2: | Mode 2: | ||
genomeID similar dissimilar duplicated missing ignored total | genomeID similar dissimilar duplicated missing ignored total | ||
- | CRYPA@5807@240206 141 97 0 86 8 333 | + | CRYPA@5807@240206 141 97 0 86 8 332 |
Mode 3: | Mode 3: | ||
genomeID similar dissimilar duplicated missing ignored total | genomeID similar dissimilar duplicated missing ignored total | ||
- | CRYPA@5807@240206 215 23 0 86 8 333 | + | CRYPA@5807@240206 215 23 0 86 8 332 |
Mode 4: | Mode 4: | ||
genomeID complete fragmented duplicated missing ignored total | genomeID complete fragmented duplicated missing ignored total | ||
- | CRYPA@5807@240206 217 21 0 86 8 333 | + | CRYPA@5807@240206 217 21 0 86 8 332 |
</ | </ | ||
Line 101: | Line 101: | ||
==== fCAT analysis - Output visualization and interpretation ==== | ==== fCAT analysis - Output visualization and interpretation ==== | ||
- | fCAT in combination with [[https:// | + | fCAT in combination with [[https:// |
<hidden PrecomputedFiles> | <hidden PrecomputedFiles> | ||
You will find all pre-computed fCAT results at ''/ | You will find all pre-computed fCAT results at ''/ | ||
</ | </ | ||
=== Downloading the data === | === Downloading the data === | ||
- | Download the following three files from the fcat output folder, e.g. '' | + | Download the following three files from the fcat output folder, e.g. '' |
- *.phyloprofile :!: These files contains the information about the presence/ | - *.phyloprofile :!: These files contains the information about the presence/ | ||
- *.mod.fa :!: This file contains the sequences of the orthologs in FASTA format | - *.mod.fa :!: This file contains the sequences of the orthologs in FASTA format | ||
Line 121: | Line 121: | ||
- upload the *domains file into the field at the lower left | - upload the *domains file into the field at the lower left | ||
- specify the origin of group IDs you are using | - specify the origin of group IDs you are using | ||
- | - Dataset // | ||
- Dataset // | - Dataset // | ||
- plot the results by clicking on ‘’Plot’’ | - plot the results by clicking on ‘’Plot’’ | ||
Line 136: | Line 135: | ||
- redo the selection, this time selecting all genes from the // | - redo the selection, this time selecting all genes from the // | ||
- if you do not find a single clade comprising all the genes that are missing in //C. parvum// do the following: | - if you do not find a single clade comprising all the genes that are missing in //C. parvum// do the following: | ||
- | - Look for the file '' | + | - Look for the file '' |
- go to the tab '' | - go to the tab '' | ||
- find the button to upload a gene list for selecting a gene set of interest< | - find the button to upload a gene list for selecting a gene set of interest< |