Map Reads to Reference
Read mapping is a very fundamental step in most applications of high-throughput sequencing data. The CLC Genomics Workbench includes read mapping in several other tools (e.g. in the RNA-Seq Analysis), but this chapter will focus on the core read mapping algorithm. At the end of the chapter you can find descriptions of the read mapping reports and a tool to merge read mappings.
There are two different versions of the core mapper: one for color space data, and one for base space data. At http://www.clcbio.com/white-paper you can find white papers with detailed benchmarks and descriptions of both algorithms.
The following description focuses on the parameters that can be directly influenced by the user.
Subsections
- Selecting reads and reference
- Including or excluding regions (masking)
- Mapping parameters
- Gap placement
- Computational requirements
- Reference Caching