Changes

From Genome Analysis Wiki
Jump to navigationJump to search
532 bytes added ,  10:01, 22 May 2018
no edit summary
Line 5: Line 5:  
Accompanying the SAV format is the [[Savvy C++ Library | Savvy C++ programming library ]] for interfacing with it and other file formats. This library was designed for efficient association analysis and provides a mechanism to plug in linear algebra and numerical libraries, which reduces the overhead of copying data and lowers the memory footprint.
 
Accompanying the SAV format is the [[Savvy C++ Library | Savvy C++ programming library ]] for interfacing with it and other file formats. This library was designed for efficient association analysis and provides a mechanism to plug in linear algebra and numerical libraries, which reduces the overhead of copying data and lowers the memory footprint.
    +
==S1R Index==
 +
SAV files are indexed using an S1R (Sort-tile-recursive One-dimensional R-tree) index file. Genomic regions are organized into an r-tree to enable fast random access to a SAV file without having to parse the entire index file. Each leaf entry in the tree points to a zstd compressed block in the corresponding SAV file. The entry also encodes the number of variants in the block, which can be variable depending on the parameters for compressing the SAV file.
 +
 +
===Diagram of R-tree===
 +
[[File:s1r_diagram.png|900 px]]
    
== APIs ==
 
== APIs ==
20

edits

Navigation menu