Difference between revisions of "FastQValidator"

From Genome Analysis Wiki
Jump to navigationJump to search
Line 1: Line 1:
 
== Status ==
 
== Status ==
  
The FastQ Validator is on our [[Todo List]].
+
The [http://en.wikipedia.org/wiki/FASTQ_format FastQ] Validator is on our [[Todo List]].
  
 
== Valid FastQ File Requirements ==
 
== Valid FastQ File Requirements ==

Revision as of 10:44, 13 January 2010

Status

The FastQ Validator is on our Todo List.

Valid FastQ File Requirements

A valid fastQ file should meet the following requirements:

  • A base sequence should have non-zero length.
  • A quality string should be present for every base sequence.
  • Paired quality and base sequences should be of the same length.
  • Valid quality values should all have ASCII codes > 32.
  • Valid bases should be ACTG or N, unless ambiguous bases are explicitly allowed by the application consuming the file.
  • Every entry in the file should have a unique identifier.
  • Reads should be of a minimum length; many mappers will get into trouble with very short reads.
  • Base composition should be reported and tracked by position.