====== Running rnaQUAST + BUSCO step by step ====== ===== Installation of rnaQUAST ===== * Create an anaconda environment for rnaQUAST and install it in the new environment. conda create --name "quastbusco" conda activate quastbusco #install rnaQUAST conda install -c bioconda rnaquast * If you forgot the commands for this, remember that you can always google the conda cheat sheet.*/ * Next, test your installation by running: #this will create an output-folder in your current directory: rnaQUAST.py --test ===== Running BUSCO ===== rnaQUAST is a tool for evaluating RNA-Seq assemblies using reference genome and gene database. In our case, we don't have a reference genome so we use a [[https://busco.ezlab.org/|BUSCO]] set. The datasets of BUSCO contain only those genes which are shared between all members of a certain taxonomic group. * Thinking of that, try to answer the following questions: - Which BUSCO dataset contains more genes, the one of the eukaryotes or the one of the plants? - Which BUSCO dataset should you choose for your rnaQUAST analysis? Try to have in mind what rnaQUAST is doing exactly and why we are using BUSCO in the first place. * Document your choice and your reasoning. Discuss with your colleagues if you're not sure. * Identify the appropriate BUSCO data set from the list that you obtain with busco --list * Download the chosen data set with the following command: busco --download ===== Running rnaQUAST ===== * Try to extract all the parameters you need from the [[https://github.com/ablab/rnaquast|rnaQUAST github page]] and run rnaQUAST * Run this command in your rnaQUAST conda environment: rnaQUAST.py --transcripts Trinity.fasta --busco -o ===== Interpretation of the rnaQUAST output ===== * Have a look at the test output and try to interpret it with the [[https://github.com/ablab/rnaquast|rnaQUAST github page]]. Document what you learned about the usage of rnaQUAST. * After the rnaQUAST + BUSCO run has finished, interpret and document the results. As always, try to - State your expectations - Summarize the results - Discuss