Sequence data formats

File type Suffix Import Export Description
Annotation CSV export .csv   X Annotations in csv format
CLC .clc X X Rich format including all information
FASTA .fsa/.fasta X X Simple format, name & description
GCG sequence .gcg X   Rich information incl. annotations
Raw sequence any X   Only sequence (no name)
Sequence CSV .csv X X Simple format. One seq per line: name, description(optional), sequence
Tab delimited text .txt   X Annotations in tab delimited text format
PIR(NBRF) .pir X X Simple format, name and description
Swiss-Prot .swp X   Rich information incl. annotations (only peptides)

Note that high-throughput sequencing data formats from Illumina, SOLiD, IonTorrent, 454 and also high-throughput fasta and trace files are imported using a special import as described in Import high-throughput sequencing data. These data can also be exported in fastq format (using NCBI/Sanger Phred quality scores).

When exporting in fasta format, it is possible to remove sequence ends covered by annotations of type "Trim" (read more in Trimming).