Suppr超能文献

SNPdetector:一款用于灵敏且准确地检测单核苷酸多态性的软件工具。

SNPdetector: a software tool for sensitive and accurate SNP detection.

作者信息

Zhang Jinghui, Wheeler David A, Yakub Imtiaz, Wei Sharon, Sood Raman, Rowe William, Liu Paul P, Gibbs Richard A, Buetow Kenneth H

机构信息

Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2005 Oct;1(5):e53. doi: 10.1371/journal.pcbi.0010053. Epub 2005 Oct 28.

Abstract

Identification of single nucleotide polymorphisms (SNPs) and mutations is important for the discovery of genetic predisposition to complex diseases. PCR resequencing is the method of choice for de novo SNP discovery. However, manual curation of putative SNPs has been a major bottleneck in the application of this method to high-throughput screening. Therefore it is critical to develop a more sensitive and accurate computational method for automated SNP detection. We developed a software tool, SNPdetector, for automated identification of SNPs and mutations in fluorescence-based resequencing reads. SNPdetector was designed to model the process of human visual inspection and has a very low false positive and false negative rate. We demonstrate the superior performance of SNPdetector in SNP and mutation analysis by comparing its results with those derived by human inspection, PolyPhred (a popular SNP detection tool), and independent genotype assays in three large-scale investigations. The first study identified and validated inter- and intra-subspecies variations in 4,650 traces of 25 inbred mouse strains that belong to either the Mus musculus species or the M. spretus species. Unexpected heterozygosity in CAST/Ei strain was observed in two out of 1,167 mouse SNPs. The second study identified 11,241 candidate SNPs in five ENCODE regions of the human genome covering 2.5 Mb of genomic sequence. Approximately 50% of the candidate SNPs were selected for experimental genotyping; the validation rate exceeded 95%. The third study detected ENU-induced mutations (at 0.04% allele frequency) in 64,896 traces of 1,236 zebra fish. Our analysis of three large and diverse test datasets demonstrated that SNPdetector is an effective tool for genome-scale research and for large-sample clinical studies. SNPdetector runs on Unix/Linux platform and is available publicly (http://lpg.nci.nih.gov).

摘要

单核苷酸多态性(SNP)和突变的识别对于发现复杂疾病的遗传易感性很重要。PCR重测序是从头发现SNP的首选方法。然而,对推定SNP进行人工筛选一直是该方法应用于高通量筛选的主要瓶颈。因此,开发一种更灵敏、准确的自动化SNP检测计算方法至关重要。我们开发了一个软件工具SNPdetector,用于在基于荧光的重测序读数中自动识别SNP和突变。SNPdetector旨在模拟人类目视检查过程,具有非常低的假阳性和假阴性率。通过在三项大规模研究中将其结果与人工检查、PolyPhred(一种流行的SNP检测工具)以及独立基因型分析得出的结果进行比较,我们证明了SNPdetector在SNP和突变分析中的卓越性能。第一项研究识别并验证了属于小家鼠物种或西班牙小鼠物种的25个近交系小鼠品系的4650条序列中的亚种间和亚种内变异。在1167个小鼠SNP中,有两个在CAST/Ei品系中观察到意外的杂合性。第二项研究在人类基因组的五个ENCODE区域中识别出11241个候选SNP,覆盖2.5 Mb的基因组序列。大约50%的候选SNP被选用于实验基因分型;验证率超过95%。第三项研究在1236条斑马鱼的64896条序列中检测到ENU诱导的突变(等位基因频率为0.04%)。我们对三个大型且多样的测试数据集的分析表明,SNPdetector是基因组规模研究和大样本临床研究的有效工具。SNPdetector在Unix/Linux平台上运行,可公开获取(http://lpg.nci.nih.gov)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364c/1274293/cdb064a6388e/pcbi.0010053.g001.jpg

相似文献

1
SNPdetector: a software tool for sensitive and accurate SNP detection.
PLoS Comput Biol. 2005 Oct;1(5):e53. doi: 10.1371/journal.pcbi.0010053. Epub 2005 Oct 28.
2
InSNP: a tool for automated detection and visualization of SNPs and InDels.
Hum Mutat. 2005 Jul;26(1):11-9. doi: 10.1002/humu.20188.
3
SNP-VISTA: an interactive SNP visualization tool.
BMC Bioinformatics. 2005 Dec 8;6:292. doi: 10.1186/1471-2105-6-292.
4
Mining SNPs from EST sequences using filters and ensemble classifiers.
Genet Mol Res. 2010 May 4;9(2):820-34. doi: 10.4238/vol9-2gmr765.
5
SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries.
Nat Methods. 2008 Mar;5(3):247-52. doi: 10.1038/nmeth.1185. Epub 2008 Feb 24.
6
7
PolyScan: an automatic indel and SNP detection approach to the analysis of human resequencing data.
Genome Res. 2007 May;17(5):659-66. doi: 10.1101/gr.6151507. Epub 2007 Apr 6.
8
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
9
Automating sequence-based detection and genotyping of SNPs from diploid samples.
Nat Genet. 2006 Mar;38(3):375-81. doi: 10.1038/ng1746. Epub 2006 Feb 19.

引用本文的文献

1
Comprehensive genomic analysis reveals molecular heterogeneity in pediatric ALK-positive anaplastic large cell lymphoma.
Leukemia. 2025 Jan;39(1):199-210. doi: 10.1038/s41375-024-02468-4. Epub 2024 Nov 26.
3
Chromosomal and gonadal factors regulate microglial sex effects in the aging brain.
Brain Res Bull. 2023 Apr;195:157-171. doi: 10.1016/j.brainresbull.2023.02.008. Epub 2023 Feb 15.
4
Early reactivation of clustered genes on the inactive X chromosome during somatic cell reprogramming.
Stem Cell Reports. 2022 Jan 11;17(1):53-67. doi: 10.1016/j.stemcr.2021.11.008. Epub 2021 Dec 16.
5
Genomic subtyping and therapeutic targeting of acute erythroleukemia.
Nat Genet. 2019 Apr;51(4):694-704. doi: 10.1038/s41588-019-0375-1. Epub 2019 Mar 29.
6
MLH1-rheMac hereditary nonpolyposis colorectal cancer syndrome in rhesus macaques.
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2806-2811. doi: 10.1073/pnas.1722106115. Epub 2018 Feb 28.
7
Current Progresses of Single Cell DNA Sequencing in Breast Cancer Research.
Int J Biol Sci. 2017 Jul 18;13(8):949-960. doi: 10.7150/ijbs.19627. eCollection 2017.
8
A Bayesian Model for SNP Discovery Based on Next-Generation Sequencing Data.
IEEE Int Workshop Genomic Signal Process Stat. 2012 Dec;2012:42-45. doi: 10.1109/GENSIPS.2012.6507722.
9
PAX5 is a tumor suppressor in mouse mutagenesis models of acute lymphoblastic leukemia.
Blood. 2015 Jun 4;125(23):3609-17. doi: 10.1182/blood-2015-02-626127. Epub 2015 Apr 8.
10
The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias.
Nat Genet. 2015 Apr;47(4):330-7. doi: 10.1038/ng.3230. Epub 2015 Mar 2.

本文引用的文献

1
The patterns of natural variation in human genes.
Annu Rev Genomics Hum Genet. 2005;6:287-312. doi: 10.1146/annurev.genom.6.080604.162309.
2
Enzymatic mutation detection technologies.
Biotechniques. 2005 May;38(5):749-58. doi: 10.2144/05385RV01.
3
InSNP: a tool for automated detection and visualization of SNPs and InDels.
Hum Mutat. 2005 Jul;26(1):11-9. doi: 10.1002/humu.20188.
4
novoSNP, a novel computational tool for sequence variation discovery.
Genome Res. 2005 Mar;15(3):436-42. doi: 10.1101/gr.2754005.
6
The ENCODE (ENCyclopedia Of DNA Elements) Project.
Science. 2004 Oct 22;306(5696):636-40. doi: 10.1126/science.1105136.
8
The International HapMap Project.
Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.
9
Statistical significance for genomewide studies.
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5. doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.
10
An exhaustive DNA micro-satellite map of the human genome using high performance computing.
Genomics. 2003 Jul;82(1):10-9. doi: 10.1016/s0888-7543(03)00076-4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验