Difference between revisions of "FastQValidator"
From Genome Analysis Wiki
Jump to navigationJump to searchLine 1: | Line 1: | ||
== Status == | == Status == | ||
− | The FastQ Validator is on our [[Todo List]]. | + | The [http://en.wikipedia.org/wiki/FASTQ_format FastQ] Validator is on our [[Todo List]]. |
== Valid FastQ File Requirements == | == Valid FastQ File Requirements == |
Revision as of 10:44, 13 January 2010
Status
The FastQ Validator is on our Todo List.
Valid FastQ File Requirements
A valid fastQ file should meet the following requirements:
- A base sequence should have non-zero length.
- A quality string should be present for every base sequence.
- Paired quality and base sequences should be of the same length.
- Valid quality values should all have ASCII codes > 32.
- Valid bases should be ACTG or N, unless ambiguous bases are explicitly allowed by the application consuming the file.
- Every entry in the file should have a unique identifier.
- Reads should be of a minimum length; many mappers will get into trouble with very short reads.
- Base composition should be reported and tracked by position.