Retrieving reference data tracks

For most applications (except de novo sequencing), you will need reference data in the form of a reference genome sequence, annotations, known variants etc. There are three basic ways of obtaining reference data tracks:

  1. Use the integrated tool for downloading reference genomes as tracks (see Download Genomes).

  2. Import tracks from files (see Import tracks from file).

  3. Convert sequences with annotations to tracks (learn more in Converting data to tracks and back). Sequences can come from a variety of sources:
    Standard Import (Image Next_Folder_16_n_p)
    The standard import accepts common data formats like fasta, genbank etc. (learn more in Bioinformatic data formats)
    Downloading from NCBI
    The integrated tool for searching and downloading data from NCBI (learn more in GenBank search).
    Contigs created from de novo assembly.
    Contig sequences from de novo assembly can be considered a reference genome for e.g. subsequent resequencing analysis applications.
Please note that tracks are not yet supported with all transcriptomics tools of CLC Genomics Workbench. In this case, you have to provide standard sequences (downloading from NCBI or importing files).