meta data for this page
  •  

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
general:bioseqanalysis:readquality:fastqc [2021/10/20 12:23] – created ingogeneral:bioseqanalysis:readquality:fastqc [2021/10/20 12:53] (current) ingo
Line 1: Line 1:
 ====== FASTQC Analysis ====== ====== FASTQC Analysis ======
-So, you will be working with Illumina sequence data, and you are interested in data quality. This is one way to look at the data. +So, you will be working with Illumina sequence data, and you are interested in data quality, and particularly in your sequencing error expressed as [[general:bioseqanalysis:sequencequalities|base quality values]]. This is one way to look at the data. 
-  - Identify the read set that you want to analyse. :!: Make sure that you have a fastq format<WRAP>+<figure FastQC> 
 +{{ :general:bioseqanalysis:images:fastqc.png?600 |}} 
 +<caption><fs 0.8em>Example output of a FASTQC analysis taken from the [[https://www.bioinformatics.babraham.ac.uk/projects/fastqc/|software homepage]]</fs></caption> 
 +</figure> 
 + 
 +  - Identify the read set that you want to analyse. :!: Make sure that you have a [[general:bioseqanalysis:sequencequalities#storing_base_quality_information|fastq]] format<WRAP>
 <hidden PairedEndLayout>In the case you are having a paired end library layout, is convention that you analyse forward and reverse reads separately, since their quality typically varies considerably. You can use samtools to split your readset into forward and reverse reads in the case you are having both in one file <hidden PairedEndLayout>In the case you are having a paired end library layout, is convention that you analyse forward and reverse reads separately, since their quality typically varies considerably. You can use samtools to split your readset into forward and reverse reads in the case you are having both in one file
 </hidden> </hidden>
Line 8: Line 13:
   - Use the fastq file as input for a [[https://www.bioinformatics.babraham.ac.uk/projects/fastqc/|FASTQC]] analysis<WRAP>   - Use the fastq file as input for a [[https://www.bioinformatics.babraham.ac.uk/projects/fastqc/|FASTQC]] analysis<WRAP>
 <hidden ReadMore> <hidden ReadMore>
-After installing FASTQC in its own conda environment, activate the new environment with the corresponding [[https://docs.conda.io/projects/conda/en/4.6.0/_downloads/52a95608c49671267e40c689e0bc00ca/conda-cheatsheet.pdf|conda command]]. Now you can call fastqc by simply typing <WRAP>  +After installing FASTQC in its own conda environment, activate the new environment with the corresponding [[https://docs.conda.io/projects/conda/en/4.6.0/_downloads/52a95608c49671267e40c689e0bc00ca/conda-cheatsheet.pdf|conda command]]. Now you can call FASTQC by simply typing 
-<code>fastqc filename.fq -o /path/to/output/directory </code> +<code>fastqc filename.fq -o /path/to/output/directory </code
 +</hidden
 This will generate a html file which you can copy to your local computer using scp. We made a short [[https://youtu.be/vhPFzSJEP2U|video]] that explains you how to do this.  This will generate a html file which you can copy to your local computer using scp. We made a short [[https://youtu.be/vhPFzSJEP2U|video]] that explains you how to do this. 
-</hidden> 
 </WRAP> </WRAP>
    - Open the html output files with any browser. With your results try to answer the following questions:    - Open the html output files with any browser. With your results try to answer the following questions:
Line 19: Line 24:
   - Perform an end trimming of the sequencing reads using [[http://www.usadellab.org/cms/?page=trimmomatic|Trimmomatic]]. What kind of information do you need to perform this analysis step? The [[:general:computerenvironment:software:trimmomatic|Trimmomatic page]] may provide some initial help.<WRAP>   - Perform an end trimming of the sequencing reads using [[http://www.usadellab.org/cms/?page=trimmomatic|Trimmomatic]]. What kind of information do you need to perform this analysis step? The [[:general:computerenvironment:software:trimmomatic|Trimmomatic page]] may provide some initial help.<WRAP>
 <hidden Spoiler> <hidden Spoiler>
- - First you need to extract the sequence of the adapter from FASTQC and save as a fastafile using a texteditor like nano:+ - First you need to extract the sequence of the adapter from FASTQC and save as a fasta file using a texteditor like nano:
 <code> <code>
 touch adaptator.fasta touch adaptator.fasta
Line 32: Line 37:
 </code> </code>
 - Make sure that you are in the conda environment that contains Trimmomatic and document the command you use to run it. Discuss with the group or the tutors if you are unsure about the parameters. - Make sure that you are in the conda environment that contains Trimmomatic and document the command you use to run it. Discuss with the group or the tutors if you are unsure about the parameters.
 +</hidden>
 +</WRAP>