Changes

From Genome Analysis Wiki
Jump to navigationJump to search
33 bytes removed ,  06:55, 15 November 2009
no edit summary
Line 7: Line 7:  
*  Binary reference genome in nucleotide space (see [[#Input file requirement|Input file requirement]]})  
 
*  Binary reference genome in nucleotide space (see [[#Input file requirement|Input file requirement]]})  
 
*  Binary reference genome and word index in color space (see \ref{sec:2})  
 
*  Binary reference genome and word index in color space (see \ref{sec:2})  
*  Color space reads in valid color space FASTQ format (see \ref{sec:4.1} for file specification)  
+
*  Color space reads in valid color space FASTQ format (see [[#Input file requirement|Input file requirement]] for file specification)  
 
*  Color space reads are longer than minimum length requirement. (see \ref{sec:4.2})  
 
*  Color space reads are longer than minimum length requirement. (see \ref{sec:4.2})  
 
*&nbsp; Specify color space parameter when starting KARMA (see \ref{sec:3})<br>  
 
*&nbsp; Specify color space parameter when starting KARMA (see \ref{sec:3})<br>  
Line 65: Line 65:     
==Input file requirement==
 
==Input file requirement==
{{sec4.1}}
      
&nbsp; KARMA require input files in valid color space FASTQ format.<br> &nbsp; We require the length of reads(including leading primer) should equal to the length of its quality string.<br> &nbsp;<br> &nbsp; A valid example of color space FASTQ file:<br>
 
&nbsp; KARMA require input files in valid color space FASTQ format.<br> &nbsp; We require the length of reads(including leading primer) should equal to the length of its quality string.<br> &nbsp;<br> &nbsp; A valid example of color space FASTQ file:<br>
Line 77: Line 76:  
!!1111111111111111111111111111111111
 
!!1111111111111111111111111111111111
   −
== {{anchor|sec4.2}}Minimum read length requirement ==
+
==Minimum read length requirement==
    
&nbsp; Keep in mind that the requirement of minimum color space read length for KARMA is<br> &nbsp; twice the size of word plus two (including leading primer) \footnote{For nucleotide space,<br> &nbsp; the minimum length requirement is twice the word size.}.<br> &nbsp; For example, KARMA use word size of 15 by default, so it will try to map color space<br> &nbsp; reads that are longer than 32 base pairs.<br>
 
&nbsp; Keep in mind that the requirement of minimum color space read length for KARMA is<br> &nbsp; twice the size of word plus two (including leading primer) \footnote{For nucleotide space,<br> &nbsp; the minimum length requirement is twice the word size.}.<br> &nbsp; For example, KARMA use word size of 15 by default, so it will try to map color space<br> &nbsp; reads that are longer than 32 base pairs.<br>
   −
== {{anchor|sec4.3}} Auxiliary tools ==
+
==Auxiliary tools==
    
&nbsp; ABI SOLiD platform generated FASTA file (e.g. XXX.csfasta) and quality file (e.g. XXX\_QV.qual) separately. We wrote a script, ''solid2csfastq.py'', to convert it to color space FASTQ file(e.g. XXX.csfastq). We believe a single color space FASTQ file will simplify post processing.<br>
 
&nbsp; ABI SOLiD platform generated FASTA file (e.g. XXX.csfasta) and quality file (e.g. XXX\_QV.qual) separately. We wrote a script, ''solid2csfastq.py'', to convert it to color space FASTQ file(e.g. XXX.csfastq). We believe a single color space FASTQ file will simplify post processing.<br>
   −
== {{anchor|sec4.4}} Choose an appropriate size for word index ==
+
==Choose an appropriate size for word index==
    
Size for word index is sensitive to mapping performance. A small size of word index will increase the number of calculation cycles for a single read and duplications of a single word. On the other side, a big size will require much larger memory. Please also keep in mind that appropriate size is related to your hardware architecture. For practically purpose, we found size of 15 is optimal.
 
Size for word index is sensitive to mapping performance. A small size of word index will increase the number of calculation cycles for a single read and duplications of a single word. On the other side, a big size will require much larger memory. Please also keep in mind that appropriate size is related to your hardware architecture. For practically purpose, we found size of 15 is optimal.
255

edits

Navigation menu