Whole genome sequencing (WGS)

The most comprehensive sequencing method is whole genome sequencing that allows for identification of genetic variations and somatic mutations across the entire human genome. This type of sequencing encompasses both chromosomal and mitochondrial DNA. The advantage of sequencing the entire genome is that not only the protein-coding regions are sequenced, but information is also provided for regulatory and non-protein-coding regions.

Eleven ready-to-use workflows are available for analysis of whole genome sequencing data (figure 15.1). The concept of the pre-installed ready-to-use workflows is that read data are used as input in one end of the workflow and in the other end of the workflow you get a track based genome browser view and a table with all the identified variants, which may or may not have been subjected to different kinds of filtering and/or annotation.

Image wgs_workflows
Figure 15.1: The eleven workflows available for analyzing whole genome sequencing data.

In this chapter we will discuss what the individual ready-to-use workflows can be used for and go through step by step how to run the workflows.

Note! Often you will have to prepare data with one of the two Preparing Raw Data workflows described in Preparing Raw Data before you proceed to Automatic analysis of sequencing data (WGS).



Subsections