meta data for this page
This is an old revision of the document!
FASTQC Analysis
So, you will be working with Illumina sequence data, and you are interested in data quality. This is one way to look at the data.
- Identify the read set that you want to analyse.
Make sure that you have a fastq format
- Install FASTQC using Anaconda
- Open the html output files with any browser. With your results try to answer the following questions:
- What kind of information do you get after running FASTQC?
- Try to make a statement about the quality of your sequencing run.
- Take a look at the overrepresented sequences, and overrepresented Kmers report. Interpret the results and reconcile with your expectation. In case you have no expectation, make sure to discuss with the tutors.
- Perform an end trimming of the sequencing reads using Trimmomatic. What kind of information do you need to perform this analysis step? The Trimmomatic page may provide some initial help.