• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SNiPer-HD:通过用于高密度单核苷酸多态性(SNP)阵列的期望最大化算法提高基因型分型准确性。

SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays.

作者信息

Hua Jianping, Craig David W, Brun Marcel, Webster Jennifer, Zismann Victoria, Tembe Waibhav, Joshipura Keta, Huentelman Matthew J, Dougherty Edward R, Stephan Dietrich A

机构信息

Computational Biology Division Phoenix, 445 N 5th Street, Phoenix, AZ, USA.

出版信息

Bioinformatics. 2007 Jan 1;23(1):57-63. doi: 10.1093/bioinformatics/btl536. Epub 2006 Oct 24.

DOI:10.1093/bioinformatics/btl536
PMID:17062589
Abstract

MOTIVATION

The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype.

RESULTS

We have developed a novel SNP genotype calling program, SNiPer-High Density (SNiPer-HD), for highly accurate genotype calling across hundreds of thousands of SNPs. The program employs an expectation-maximization (EM) algorithm with parameters based on a training sample set. The algorithm choice allows for highly accurate genotyping for most SNPs. Also, we introduce a quality control metric for each assayed SNP, such that poor-behaving SNPs can be filtered using a metric correlating to genotype class separation in the calling algorithm. SNiPer-HD is superior to the standard dynamic modeling algorithm and is complementary and non-redundant to other algorithms, such as BRLMM. Implementing multiple algorithms together may provide highly accurate genotyping calls, without inflation of false positives due to systematically miss-called SNPs. A reliable and accurate set of SNP genotypes for increasingly dense panels will eliminate some false association signals and false negative signals, allowing for rapid identification of disease susceptibility loci for complex traits.

AVAILABILITY

SNiPer-HD is available at TGen's website: http://www.tgen.org/neurogenomics/data.

摘要

动机

以极高密度对单核苷酸多态性(SNP)进行基因分型的技术,为无假设的全基因组扫描常见复杂疾病相关多态性提供了可能。然而,我们发现常用基因分型算法引入的一些错误可能会导致标记与表型之间的错误关联增加。

结果

我们开发了一种新型SNP基因分型程序,即高密度SNP分型程序(SNiPer-HD),用于对数十万SNP进行高精度基因分型。该程序采用基于训练样本集参数的期望最大化(EM)算法。这种算法选择能对大多数SNP进行高精度基因分型。此外,我们为每个检测的SNP引入了一个质量控制指标,这样行为不佳的SNP可以使用与分型算法中基因型类别分离相关的指标进行过滤。SNiPer-HD优于标准动态建模算法,并且与其他算法(如BRLMM)互补且不冗余。一起使用多种算法可能会提供高精度的基因分型结果,而不会因系统性误判的SNP导致假阳性增加。对于越来越密集的面板,一套可靠且准确的SNP基因型将消除一些错误关联信号和假阴性信号,从而能够快速识别复杂性状的疾病易感位点。

可用性

SNiPer-HD可在TGen网站获取:http://www.tgen.org/neurogenomics/data 。

相似文献

1
SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays.SNiPer-HD:通过用于高密度单核苷酸多态性(SNP)阵列的期望最大化算法提高基因型分型准确性。
Bioinformatics. 2007 Jan 1;23(1):57-63. doi: 10.1093/bioinformatics/btl536. Epub 2006 Oct 24.
2
SNiPer: improved SNP genotype calling for Affymetrix 10K GeneChip microarray data.SNiPer:改进对Affymetrix 10K基因芯片微阵列数据的单核苷酸多态性(SNP)基因型分型
BMC Genomics. 2005 Oct 31;6:149. doi: 10.1186/1471-2164-6-149.
3
Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples.使用270个HapMap样本评估基因分型算法BRLMM对Affymetrix GeneChip Human Mapping 500 K芯片组的批次效应。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2105-9-S9-S17.
4
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.一种用于Affymetrix SNP微阵列的多阵列多SNP基因分型算法。
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
5
A genotype calling algorithm for affymetrix SNP arrays.一种用于Affymetrix SNP阵列的基因型分型算法。
Bioinformatics. 2006 Jan 1;22(1):7-12. doi: 10.1093/bioinformatics/bti741. Epub 2005 Nov 2.
6
A genotype calling algorithm for the Illumina BeadArray platform.一种适用于Illumina BeadArray平台的基因型分型算法。
Bioinformatics. 2007 Oct 15;23(20):2741-6. doi: 10.1093/bioinformatics/btm443. Epub 2007 Sep 10.
7
Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.基于动态模型的寡核苷酸微阵列上100K以上单核苷酸多态性(SNP)筛选和基因分型算法
Bioinformatics. 2005 May 1;21(9):1958-63. doi: 10.1093/bioinformatics/bti275. Epub 2005 Jan 18.
8
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.基于APEX微阵列数据的SNP基因型自动分型中的动态变量选择
BMC Bioinformatics. 2006 Nov 30;7:521. doi: 10.1186/1471-2105-7-521.
9
Evaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individuals.评估Affymetrix SNP Array 6.0平台在400名日本个体中的性能。
BMC Genomics. 2008 Sep 22;9:431. doi: 10.1186/1471-2164-9-431.
10
Assessing consistency between versions of genotype-calling algorithm Birdseed for the Genome-Wide Human SNP Array 6.0 using HapMap samples.使用 HapMap 样本评估 Genome-Wide Human SNP Array 6.0 的基因型调用算法 Birdseed 的不同版本之间的一致性。
Adv Exp Med Biol. 2010;680:355-60. doi: 10.1007/978-1-4419-5913-3_40.

引用本文的文献

1
A Novel Quality-Control Procedure to Improve the Accuracy of Rare Variant Calling in SNP Arrays.一种提高SNP阵列中罕见变异检测准确性的新型质量控制程序。
Front Genet. 2021 Oct 26;12:736390. doi: 10.3389/fgene.2021.736390. eCollection 2021.
2
BCRgt: a Bayesian cluster regression-based genotyping algorithm for the samples with copy number alterations.BCRgt:一种基于贝叶斯聚类回归的基因分型算法,用于具有拷贝数改变的样本。
BMC Bioinformatics. 2014 Mar 15;15:74. doi: 10.1186/1471-2105-15-74.
3
SITDEM: a simulation tool for disease/endpoint models of association studies based on single nucleotide polymorphism genotypes.
SITDEM:基于单核苷酸多态性基因型的关联研究疾病/终点模型的仿真工具。
Comput Biol Med. 2014 Feb;45:136-42. doi: 10.1016/j.compbiomed.2013.11.021. Epub 2013 Dec 19.
4
Using the R Package crlmm for Genotyping and Copy Number Estimation.使用R软件包crlmm进行基因分型和拷贝数估计。
J Stat Softw. 2011 May 1;40(12):1-32.
5
A review of software for microarray genotyping.微阵列基因分型软件综述。
Hum Genomics. 2011 May;5(4):304-9. doi: 10.1186/1479-7364-5-4-304.
6
Whole genome association analysis shows that ACE is a risk factor for Alzheimer's disease and fails to replicate most candidates from Meta-analysis.全基因组关联分析表明,血管紧张素转换酶是阿尔茨海默病的一个风险因素,并且未能重复荟萃分析中的大多数候选基因。
Int J Mol Epidemiol Genet. 2010;1(1):19-30. Epub 2009 Jul 26.
7
SAQC: SNP array quality control.SAQC:SNP 芯片质量控制。
BMC Bioinformatics. 2011 Apr 18;12:100. doi: 10.1186/1471-2105-12-100.
8
A knowledge-based weighting framework to boost the power of genome-wide association studies.基于知识的加权框架,提高全基因组关联研究的效力。
PLoS One. 2010 Dec 31;5(12):e14480. doi: 10.1371/journal.pone.0014480.
9
Clustering algorithms: on learning, validation, performance, and applications to genomics.聚类算法:关于学习、验证、性能以及在基因组学中的应用。
Curr Genomics. 2009 Sep;10(6):430-45. doi: 10.2174/138920209789177601.
10
PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data.PICNIC:一种利用微阵列癌症数据预测绝对等位基因拷贝数变异的算法。
Biostatistics. 2010 Jan;11(1):164-75. doi: 10.1093/biostatistics/kxp045. Epub 2009 Oct 15.