meta data for this page
  •  

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
general:bioseqanalysis:genesetanalysis:taxaminer [2024/02/09 14:56] ingogeneral:bioseqanalysis:genesetanalysis:taxaminer [2025/04/08 17:11] (current) – [taXaminer analysis] ingo
Line 18: Line 18:
 **What you need** **What you need**
   - The genome sequence in fasta format. We will be using ''/home/ubuntu/Share/Assemblies/crypto_BCM2021_v2.fasta''   - The genome sequence in fasta format. We will be using ''/home/ubuntu/Share/Assemblies/crypto_BCM2021_v2.fasta''
-  - the annotation file in gff3. We will be using <code>/home/ubuntu/Share/Analysis/taxaminer/results/metaeuk</code>+  - the annotation file in gff3. We will be using <code>/home/ubuntu/Share/Analysis/taxaminer/results/metaeuk/Crypto_Metaeuk.sorted.gff3</code>
   - **optionally:** the protein fasta file. :!: taXaminer will extract the protein sequences from the gff file if not provided.   - **optionally:** the protein fasta file. :!: taXaminer will extract the protein sequences from the gff file if not provided.
-  - **optionally:** read mapping information: One BAM file per library+  - **optionally:** read mapping information: One BAM file per library ''/home/ubuntu/fritz/sv-detection/for_ingo/illumina_pairs.mapped.sort.bam'' 
 +  - **optionally** a local installation of the [[https://github.com/bionf/taxaminer-dashboard|taxaminer-dashboard]]. 
  
 **What you get** **What you get**
-  - a per-base coverage file. :!: Only when mapping information was provided +  - a taxonomic assignment for each gene based on a modified version of the DIAMOND Last Common Ancestor algorithm((We modified it a bit because we have a strong prior from which organism the gene should come from))
-  - a taxonomic assignment for each gene based on the DIAMOND Least Common Ancestor algorithm((in fact, we modified it a bit because we have a strong prior from which organism the gene should come from))+
   - a file with feature vectors for each gene in CSV   - a file with feature vectors for each gene in CSV
   - a html-file with the PCA as a 3D plotly plot   - a html-file with the PCA as a 3D plotly plot
 +  - a file with the proteins encoded by the annotated genes
 +  - a text file with the diamond hits 
  
 ==== Running taXaminer ==== ==== Running taXaminer ====
   - Check for the presence of taXaminer on your system. To do so:   - Check for the presence of taXaminer on your system. To do so:
     - activate the conda environment: <WRAP>     - activate the conda environment: <WRAP>
-<code>conda activate /home/ubuntu/anaconda3/envs/taxaminer</code>+<code>conda activate /home/ubuntu/miniconda3/envs/taxaminer</code>
 </WRAP> </WRAP>
     - issue the following command to test if you can run taxaminer: <WRAP>     - issue the following command to test if you can run taxaminer: <WRAP>
Line 49: Line 51:
     * The genome sequence in fasta file     * The genome sequence in fasta file
     * The genome annotation in gff3 format     * The genome annotation in gff3 format
-    * any read mapping information in BAM format+    * any read mapping information in BAM format  
 +  - we will be using the unref50 database for the Diamond search. ''$HOME/Share/DBs/uniref50/db.dmnd''
   - edit the config-script according to your needs<WRAP>   - edit the config-script according to your needs<WRAP>
 <hidden ConfigScriptExample> <hidden ConfigScriptExample>
Line 59: Line 62:
 output_path: "AddYourInputDataHere" # directory to save results to output_path: "AddYourInputDataHere" # directory to save results to
 taxon_id: "AddYourInputDataHere" # NCBI Taxon ID of query species. Cryptosporidium parvum has the taxon id 5807 taxon_id: "AddYourInputDataHere" # NCBI Taxon ID of query species. Cryptosporidium parvum has the taxon id 5807
 +database_path: "$HOME/Share/DBs/uniref50/db.dmnd"
  
 ############################################################# #############################################################
Line 83: Line 87:
 </hidden> </hidden>
 </WRAP> </WRAP>
-  - activate the taxaminer conda environment unless you have already done so((:!: It is a good idea to **deactivate** the environment that you are currently in)): <code>conda activate /home/ubuntu/anaconda3/envs/taxaminer</code> 
   - run taXaminer by issuing the following command((make sure that the correct conda environment is active))<WRAP><code>taxaminer.run config.yml</code>:!: Make sure that you are either in the directory where the config.yml is located, or provide the path.   - run taXaminer by issuing the following command((make sure that the correct conda environment is active))<WRAP><code>taxaminer.run config.yml</code>:!: Make sure that you are either in the directory where the config.yml is located, or provide the path.
 </WRAP> </WRAP>