• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种解决大规模关联研究中基因分型差异偏倚的方法。

A method to address differential bias in genotyping in large-scale association studies.

作者信息

Plagnol Vincent, Cooper Jason D, Todd John A, Clayton David G

机构信息

Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, United Kingdom.

出版信息

PLoS Genet. 2007 May 18;3(5):e74. doi: 10.1371/journal.pgen.0030074. Epub 2007 Apr 5.

DOI:10.1371/journal.pgen.0030074
PMID:17511519
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1868951/
Abstract

In a previous paper we have shown that, when DNA samples for cases and controls are prepared in different laboratories prior to high-throughput genotyping, scoring inaccuracies can lead to differential misclassification and, consequently, to increased false-positive rates. Different DNA sourcing is often unavoidable in large-scale disease association studies of multiple case and control sets. Here, we describe methodological improvements to minimise such biases. These fall into two categories: improvements to the basic clustering methods for identifying genotypes from fluorescence intensities, and use of "fuzzy" calls in association tests in order to make appropriate allowance for call uncertainty. We find that the main improvement is a modification of the calling algorithm that links the clustering of cases and controls while allowing for different DNA sourcing. We also find that, in the presence of different DNA sourcing, biases associated with missing data can increase the false-positive rate. Therefore, we propose the use of "fuzzy" calls to deal with uncertain genotypes that would otherwise be labeled as missing.

摘要

在之前的一篇论文中我们已经表明,当病例组和对照组的DNA样本在高通量基因分型之前于不同实验室制备时,评分不准确会导致差异性错误分类,进而导致假阳性率增加。在多个病例组和对照组的大规模疾病关联研究中,不同的DNA来源往往不可避免。在此,我们描述了将此类偏差降至最低的方法改进。这些改进分为两类:对从荧光强度识别基因型的基本聚类方法的改进,以及在关联测试中使用“模糊”调用以便适当考虑调用不确定性。我们发现主要的改进是对调用算法的修改,该算法在允许不同DNA来源的同时将病例组和对照组的聚类联系起来。我们还发现,在存在不同DNA来源的情况下,与缺失数据相关的偏差会增加假阳性率。因此,我们建议使用“模糊”调用处理否则会被标记为缺失的不确定基因型。

相似文献

1
A method to address differential bias in genotyping in large-scale association studies.一种解决大规模关联研究中基因分型差异偏倚的方法。
PLoS Genet. 2007 May 18;3(5):e74. doi: 10.1371/journal.pgen.0030074. Epub 2007 Apr 5.
2
Population structure, differential bias and genomic control in a large-scale, case-control association study.一项大规模病例对照关联研究中的群体结构、差异偏倚与基因组控制
Nat Genet. 2005 Nov;37(11):1243-6. doi: 10.1038/ng1653. Epub 2005 Oct 9.
3
Missing call bias in high-throughput genotyping.高通量基因分型中的缺失呼叫偏差。
BMC Genomics. 2009 Mar 13;10:106. doi: 10.1186/1471-2164-10-106.
4
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
5
Smarter clustering methods for SNP genotype calling.用于单核苷酸多态性(SNP)基因分型的更智能聚类方法。
Bioinformatics. 2008 Dec 1;24(23):2665-71. doi: 10.1093/bioinformatics/btn509. Epub 2008 Sep 29.
6
Bias Characterization in Probabilistic Genotype Data and Improved Signal Detection with Multiple Imputation.概率基因型数据中的偏差特征分析与多重填补改进信号检测
PLoS Genet. 2016 Jun 16;12(6):e1006091. doi: 10.1371/journal.pgen.1006091. eCollection 2016 Jun.
7
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.基于APEX微阵列数据的SNP基因型自动分型中的动态变量选择
BMC Bioinformatics. 2006 Nov 30;7:521. doi: 10.1186/1471-2105-7-521.
8
SNiPer: improved SNP genotype calling for Affymetrix 10K GeneChip microarray data.SNiPer:改进对Affymetrix 10K基因芯片微阵列数据的单核苷酸多态性(SNP)基因型分型
BMC Genomics. 2005 Oct 31;6:149. doi: 10.1186/1471-2164-6-149.
9
M(3)-S: a genotype calling method incorporating information from samples with known genotypes.M(3)-S:一种整合来自具有已知基因型样本信息的基因型分型方法。
BMC Bioinformatics. 2015 Dec 3;16:403. doi: 10.1186/s12859-015-0824-5.
10
A new expectation-maximization statistical test for case-control association studies considering rare variants obtained by high-throughput sequencing.一种用于病例对照关联研究的新期望最大化统计检验,该研究考虑通过高通量测序获得的罕见变异。
Hum Hered. 2011;71(2):113-25. doi: 10.1159/000325590. Epub 2011 Jul 6.

引用本文的文献

1
Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework.使用广义线性模型框架基于下一代测序数据对一组遗传标记进行复杂反应的关联测试。
Mathematics (Basel). 2023 Jun 1;11(11). doi: 10.3390/math11112560. Epub 2023 Jun 2.
2
A hybrid qPCR/SNP array approach allows cost efficient assessment of KIR gene copy numbers in large samples.一种混合定量聚合酶链反应/单核苷酸多态性阵列方法能够对大量样本中的杀伤细胞免疫球蛋白样受体(KIR)基因拷贝数进行经济高效的评估。
BMC Genomics. 2014 Apr 11;15:274. doi: 10.1186/1471-2164-15-274.
3
Association claims in the sequencing era.

本文引用的文献

1
Optimal genotype determination in highly multiplexed SNP data.高度多重单核苷酸多态性(SNP)数据中的最佳基因型确定
Eur J Hum Genet. 2006 Feb;14(2):207-15. doi: 10.1038/sj.ejhg.5201528.
2
A haplotype map of the human genome.人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.
3
Population structure, differential bias and genomic control in a large-scale, case-control association study.一项大规模病例对照关联研究中的群体结构、差异偏倚与基因组控制
测序时代的关联主张。
Genes (Basel). 2014 Mar 11;5(1):196-213. doi: 10.3390/genes5010196.
4
Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.对下一代测序基因关联进行单变量和多变量趋势检验,对测序错误具有稳健性。
Hum Hered. 2012;74(3-4):172-83. doi: 10.1159/000346824. Epub 2013 Apr 11.
5
ETHNOPRED: a novel machine learning method for accurate continental and sub-continental ancestry identification and population stratification correction.ETHNOPRED:一种用于准确进行大陆和次大陆祖先鉴定和群体分层校正的新型机器学习方法。
BMC Bioinformatics. 2013 Feb 22;14:61. doi: 10.1186/1471-2105-14-61.
6
Haplotypes with copy number and single nucleotide polymorphisms in CYP2A6 locus are associated with smoking quantity in a Japanese population.CYP2A6 基因座的拷贝数和单核苷酸多态性单体型与日本人群的吸烟量有关。
PLoS One. 2012;7(9):e44507. doi: 10.1371/journal.pone.0044507. Epub 2012 Sep 25.
7
Incorporating genotype uncertainties into the genotypic TDT for main effects and gene-environment interactions.将基因型不确定性纳入主效应和基因-环境相互作用的基因型 TDT 中。
Genet Epidemiol. 2012 Apr;36(3):225-34. doi: 10.1002/gepi.21615.
8
Blood pressure loci identified with a gene-centric array.基于基因芯片鉴定的血压相关基因座
Am J Hum Genet. 2011 Dec 9;89(6):688-700. doi: 10.1016/j.ajhg.2011.10.013. Epub 2011 Nov 17.
9
A review of software for microarray genotyping.微阵列基因分型软件综述。
Hum Genomics. 2011 May;5(4):304-9. doi: 10.1186/1479-7364-5-4-304.
10
Data quality control in genetic case-control association studies.遗传病例对照关联研究中的数据质量控制。
Nat Protoc. 2010 Sep;5(9):1564-73. doi: 10.1038/nprot.2010.116. Epub 2010 Aug 26.
Nat Genet. 2005 Nov;37(11):1243-6. doi: 10.1038/ng1653. Epub 2005 Oct 9.
4
Cohort profile: 1958 British birth cohort (National Child Development Study).队列简介:1958年英国出生队列(全国儿童发展研究)。
Int J Epidemiol. 2006 Feb;35(1):34-41. doi: 10.1093/ije/dyi183. Epub 2005 Sep 9.
5
Genome-wide association studies: theoretical and practical concerns.全基因组关联研究:理论与实际问题
Nat Rev Genet. 2005 Feb;6(2):109-18. doi: 10.1038/nrg1522.
6
Highly multiplexed molecular inversion probe genotyping: over 10,000 targeted SNPs genotyped in a single tube assay.高度多重分子倒置探针基因分型:在单管检测中对超过10,000个靶向单核苷酸多态性进行基因分型。
Genome Res. 2005 Feb;15(2):269-75. doi: 10.1101/gr.3185605.
7
Incorporating genotyping uncertainty in haplotype inference for single-nucleotide polymorphisms.在单核苷酸多态性的单倍型推断中纳入基因分型不确定性。
Am J Hum Genet. 2004 Mar;74(3):495-510. doi: 10.1086/382284. Epub 2004 Feb 13.
8
Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power.利用单倍型标签检测由连锁不平衡引起的疾病关联:一类检验方法及统计效能的决定因素。
Hum Hered. 2003;56(1-3):18-31. doi: 10.1159/000073729.
9
Multiplexed genotyping with sequence-tagged molecular inversion probes.使用序列标签分子倒置探针进行多重基因分型。
Nat Biotechnol. 2003 Jun;21(6):673-8. doi: 10.1038/nbt821. Epub 2003 May 5.