Changes

From Genome Analysis Wiki
Jump to navigationJump to search
348 bytes added ,  18:33, 15 July 2015
Line 57: Line 57:     
With the above in mind, we are able to combine these two tables to obtain the relationship between older rs number and new rs number.
 
With the above in mind, we are able to combine these two tables to obtain the relationship between older rs number and new rs number.
We have developed a script (for internal use), named [http://genome.sph.umich.edu/wiki/LiftRsNumber.py liftRsNumber.py] AAAA for lift rs numbers between builds.
+
We have developed a script (for internal use), named [http://genome.sph.umich.edu/wiki/LiftRsNumber.py liftRsNumber.py] for lift rs numbers between builds.
 
This scripts require RsMergeArch.bcp.gz  and SNPHistory.bcp.gz, those can be found in [[#Resources | Resources]].
 
This scripts require RsMergeArch.bcp.gz  and SNPHistory.bcp.gz, those can be found in [[#Resources | Resources]].
    
Example input:
 
Example input:
 
<pre>
 
<pre>
3115860
+
3000
12124819
+
3001
2229002
+
3002
1130683
   
</pre>
 
</pre>
   Line 73: Line 72:  
</pre>
 
</pre>
   −
Example otuput:
+
Example output:
 
<pre>
 
<pre>
unchanged 3115860
+
unchanged       3000
unchanged 12124819
+
lifted 2032
lifted 2229002
+
unchanged      3002
lifted 1130683
   
</pre>
 
</pre>
   Line 125: Line 123:     
Similar to the human reference build, dbSNP also have different versions. You may consider change rs number from the old dbSNP version to new dbSNP version  
 
Similar to the human reference build, dbSNP also have different versions. You may consider change rs number from the old dbSNP version to new dbSNP version  
depending on your needs. Such steps are described in [[#Lift dbSNP rs numbers | Lift dbSNP rs numbers]]. AAAA
+
depending on your needs. Such steps are described in [[#Lift dbSNP rs numbers | Lift dbSNP rs numbers]].
    
==== Method 2 ====
 
==== Method 2 ====
Line 136: Line 134:  
(2) Lookup SNP positions from rs number
 
(2) Lookup SNP positions from rs number
   −
dbSNP provides a file [[#Resources | joinb132_SNPChrPosOnRef_37_1.bcp.gz]] which contains rsNumber, chromosome and its position.
+
dbSNP provides a file [[#Resources | b132_SNPChrPosOnRef_37_1.bcp.gz]] which contains rsNumber, chromosome and its position.
 
Use this file along with the new rsNumber obtained in the first step.
 
Use this file along with the new rsNumber obtained in the first step.
 
In practice, some rs numbers do not exist in build 132, or not suitable to be considered ( e.g. they do not reside on human reference, or they are mapped to multiple locations, these scenarios are noted by the chromosome column with values like "AltOnly", "Multi", "NotOn", "PAR", "Un"), we can drop them in the liftover procedure.
 
In practice, some rs numbers do not exist in build 132, or not suitable to be considered ( e.g. they do not reside on human reference, or they are mapped to multiple locations, these scenarios are noted by the chromosome column with values like "AltOnly", "Multi", "NotOn", "PAR", "Un"), we can drop them in the liftover procedure.
Line 147: Line 145:     
==== Method 3 ====
 
==== Method 3 ====
NCBI dbSNP team has provided a [[ #Resources | provisional map ]] AAAA for converting the genome position of a larget set dbSNP from NCBI build 36 to NCBI build 37.  
+
NCBI dbSNP team has provided a [[ #Resources | provisional map ]] for converting the genome position of a larget set dbSNP from NCBI build 36 to NCBI build 37.  
 
In the second step, we have obtained unlifted genome positions, so we can try to use the table to convert those unlfted dbSNPs.
 
In the second step, we have obtained unlifted genome positions, so we can try to use the table to convert those unlfted dbSNPs.
 
After this step, there are still some SNPs that cannot be lifted, as they are mostly located on non-reference chromosome.
 
After this step, there are still some SNPs that cannot be lifted, as they are mostly located on non-reference chromosome.
Line 236: Line 234:     
== Resources ==
 
== Resources ==
* liftRsNumber.py [[liftRsNumber.py]] and its interal location: /net/dumbo/net/dumbo/home/zhanxw/amd/analyze/verifyBamID/liftRsNumber.py
+
* liftRsNumber.py [[liftRsNumber.py]] and its interal location: /net/fantasia/home/zhanxw/amd/analyze/verifyBamID/liftRsNumber.py
 
* liftMap.py [[liftMap.py]]
 
* liftMap.py [[liftMap.py]]
 
* NCBI provisional map [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/misc/exchange/Remap_36_3_37_1.txt.gz file] and [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/misc/exchange/Remap_36_3_37_1.info info]
 
* NCBI provisional map [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/misc/exchange/Remap_36_3_37_1.txt.gz file] and [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/misc/exchange/Remap_36_3_37_1.info info]
 
* NCBI RgMergeArch [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/database/organism_data/RsMergeArch.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=RsMergeArch schema]
 
* NCBI RgMergeArch [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/database/organism_data/RsMergeArch.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=RsMergeArch schema]
 
* NCBI SNPHistory [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/database/organism_data/SNPHistory.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=SNPHistory schema]
 
* NCBI SNPHistory [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/database/organism_data/SNPHistory.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=SNPHistory schema]
* NCBI SNPChrPosOnRef build 132 [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/database/b132_archive/organism_data/b132_SNPChrPosOnRef_37_1.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=SNPChrPosOnRef schema]
+
* NCBI SNPChrPosOnRef build 132 [http://qbrc.swmed.edu/zhanxw/software/liftOver/b132_SNPChrPosOnRef_37_1.bcp.gz file] and [http://www.ncbi.nlm.nih.gov/SNP/snp_db_table_description.cgi?t=SNPChrPosOnRef schema]. If this link becomes unavailable, please consider using this updated file ([ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606/database/organism_data/b144_SNPChrPosOnRef_107.bcp.gz link]).
 +
 
 
* How UCSC dbSNP differs from NCBI dbSNP [http://genomewiki.ucsc.edu/index.php/DbSNP_Track_Notes UCSC dbSNP track note]
 
* How UCSC dbSNP differs from NCBI dbSNP [http://genomewiki.ucsc.edu/index.php/DbSNP_Track_Notes UCSC dbSNP track note]
 
* The dbSNP mapping process [http://www.ncbi.nlm.nih.gov/books/NBK44455/ link]
 
* The dbSNP mapping process [http://www.ncbi.nlm.nih.gov/books/NBK44455/ link]
 
* NCBI dbSNP release 132 [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/VCF/v4.0/ByChromosomeNoGeno/00-All.vcf.gz 00-All.vcf.gz]
 
* NCBI dbSNP release 132 [ftp://ftp.ncbi.nih.gov:/snp/organisms/human_9606/VCF/v4.0/ByChromosomeNoGeno/00-All.vcf.gz 00-All.vcf.gz]
 
* UCSC dbSNP release 132 [http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/snp132.txt.gz snp132.txt.gz]
 
* UCSC dbSNP release 132 [http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/snp132.txt.gz snp132.txt.gz]
 +
 +
== Third party contribution ==
 +
 +
* liftOver for BEDPE format by [mailto://doug.phanstiel@gmail.com Doug] - [http://www.sharedproteomics.com/forum/showthread.php?2930-liftOverBedpe.py-Convert-genome-coordinates-for-bedpe-files&p=4854#post4854 liftOverBedpe.py].
    
== Acknowledge ==
 
== Acknowledge ==
Line 257: Line 260:     
Please contact [mailto:zhanxw@umich.edu Xiaowei Zhan].
 
Please contact [mailto:zhanxw@umich.edu Xiaowei Zhan].
  −
  −
AAAA
 
255

edits

Navigation menu