SOLiD sequencing data

This section outlines the general structure of the data from SOLiD based sequencers.

Structure of SOLiD run names

For multiplex fragment sequencing the run names will have the form:

<instrument-name>_<date-stamp>_FRAG_BC[_2]

(For example: solid0123_20110315_FRAG_BC).

The components are:

  • <instrument_name>: name of the SOLiD instrument e.g. solid0123

  • <date-stamp>: a date stamp in year-month-day format e.g. 20110315 is 15th March 2011

  • FRAG: indicates a fragment library was used

  • BC: indicates bar-coding was used (note that not all samples in the run might be bar-coded, even if this appears in the name)

  • 2: if this is present then it indicates the data came from flow cell 2; otherwise it’s from flow cell 1.

For multiplex paired-end sequencing the run names have the form:

<instrument-name>_<date-stamp>_PE_BC

Here the PE part of the name indicates a paired-end run.

Note

If the run name contains WFA then it’s a work-flow analysis and not final sequence data.

See also SOLiD 4 System Instrument Operation Quick Reference (PDF) for more information.