meta data for this page
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
general:bioseqanalysis:genesetanalysis:taxaminer [2024/02/14 13:42] – [taXaminer analysis] ingo | general:bioseqanalysis:genesetanalysis:taxaminer [2025/04/08 17:11] (current) – [taXaminer analysis] ingo | ||
---|---|---|---|
Line 20: | Line 20: | ||
- the annotation file in gff3. We will be using < | - the annotation file in gff3. We will be using < | ||
- **optionally: | - **optionally: | ||
- | - **optionally: | + | - **optionally: |
+ | - **optionally** a local installation of the [[https:// | ||
**What you get** | **What you get** | ||
- | - a per-base coverage file. :!: Only when mapping information was provided | ||
- a taxonomic assignment for each gene based on a modified version of the DIAMOND Last Common Ancestor algorithm((We modified it a bit because we have a strong prior from which organism the gene should come from)) | - a taxonomic assignment for each gene based on a modified version of the DIAMOND Last Common Ancestor algorithm((We modified it a bit because we have a strong prior from which organism the gene should come from)) | ||
- a file with feature vectors for each gene in CSV | - a file with feature vectors for each gene in CSV | ||
- a html-file with the PCA as a 3D plotly plot | - a html-file with the PCA as a 3D plotly plot | ||
+ | - a file with the proteins encoded by the annotated genes | ||
+ | - a text file with the diamond hits | ||
==== Running taXaminer ==== | ==== Running taXaminer ==== | ||
- Check for the presence of taXaminer on your system. To do so: | - Check for the presence of taXaminer on your system. To do so: | ||
- activate the conda environment: | - activate the conda environment: | ||
- | < | + | < |
</ | </ | ||
- issue the following command to test if you can run taxaminer: < | - issue the following command to test if you can run taxaminer: < | ||
Line 49: | Line 51: | ||
* The genome sequence in fasta file | * The genome sequence in fasta file | ||
* The genome annotation in gff3 format | * The genome annotation in gff3 format | ||
- | * any read mapping information in BAM format | + | * any read mapping information in BAM format |
+ | - we will be using the unref50 database for the Diamond search. '' | ||
- edit the config-script according to your needs< | - edit the config-script according to your needs< | ||
<hidden ConfigScriptExample> | <hidden ConfigScriptExample> | ||
Line 59: | Line 62: | ||
output_path: | output_path: | ||
taxon_id: " | taxon_id: " | ||
+ | database_path: | ||
############################################################# | ############################################################# | ||
Line 83: | Line 87: | ||
</ | </ | ||
</ | </ | ||
- | - activate the taxaminer conda environment unless you have already done so((:!: It is a good idea to **deactivate** the environment that you are currently in)): < | ||
- run taXaminer by issuing the following command((make sure that the correct conda environment is active))< | - run taXaminer by issuing the following command((make sure that the correct conda environment is active))< | ||
</ | </ |