• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在基因关联研究中检测样本误认

Detecting sample misidentifications in genetic association studies.

作者信息

Ekstrøm Claus T, Feenstra Bjarke

机构信息

University of Southern Denmark, Biostatistics, Faculty of Health Sciences, Denmark.

出版信息

Stat Appl Genet Mol Biol. 2012;11(3):Article 13. doi: 10.1515/1544-6115.1772.

DOI:10.1515/1544-6115.1772
PMID:22611595
Abstract

Genetic association studies require that the genotype data from a given person can be correctly linked to the phenotype data from the same person. However, sample misidentification errors sometimes happen, whereby the link becomes invalid for some of the subjects in a study. This can have substantial consequences in terms of power to detect truly associated variants. In family-based studies, Mendelian inconsistencies can be used to detect sample misidentification. Genome-wide association studies (GWAS), however, typically use unrelated individuals, making error detection more problematic. Here we present a method for identifying potential sample misidentifications in GWAS and other genetic association studies building on ideas from forensic sciences. A widely used ad-hoc method for error detection is to check if the sex of an individual matches its X-linked genotype. We generalize this idea to less stringent associations between known genotypes and phenotypes, and show that if several known associations are combined, the power to detect misidentifications increases substantially. Individuals with an unlikely set of phenotypes given their genotypes are flagged as potential errors. We provide analytical and simulation results comparing the odds that the genotype and phenotype are both from the same individual for different numbers of available genotype-p henotype associations and for different information content of the associations. Our method has good sensitivity and specificity with as few as ten moderately informative genotype-phenotype associations. We apply the method to GWAS data from the Danish National Birth Cohort.

摘要

基因关联研究要求来自特定个体的基因型数据能够正确地与来自同一个体的表型数据相关联。然而,样本误识别错误有时会发生,从而导致研究中的某些受试者的这种关联变得无效。这在检测真正相关变异的效能方面可能会产生重大后果。在基于家系的研究中,孟德尔不一致性可用于检测样本误识别。然而,全基因组关联研究(GWAS)通常使用无亲缘关系的个体,这使得错误检测变得更具问题。在此,我们基于法医学的理念,提出一种在GWAS和其他基因关联研究中识别潜在样本误识别的方法。一种广泛使用的临时错误检测方法是检查个体的性别是否与其X连锁基因型匹配。我们将这一理念推广到已知基因型和表型之间不太严格的关联,并表明如果将几个已知关联结合起来,检测误识别的效能会大幅提高。根据其基因型具有一组不太可能的表型的个体被标记为潜在错误。我们提供了分析和模拟结果,比较了对于不同数量的可用基因型 - 表型关联以及不同关联信息含量,基因型和表型均来自同一个体的概率。我们的方法在仅有十个中等信息量的基因型 - 表型关联时就具有良好的敏感性和特异性。我们将该方法应用于丹麦国家出生队列的GWAS数据。

相似文献

1
Detecting sample misidentifications in genetic association studies.在基因关联研究中检测样本误认
Stat Appl Genet Mol Biol. 2012;11(3):Article 13. doi: 10.1515/1544-6115.1772.
2
Backward genotype-transcript-phenotype association mapping.反向基因型-转录本-表型关联映射。
Methods. 2017 Oct 1;129:18-23. doi: 10.1016/j.ymeth.2017.09.004. Epub 2017 Sep 14.
3
A novel association test for multiple secondary phenotypes from a case-control GWAS.一种针对病例对照全基因组关联研究中多个次要表型的新型关联测试。
Genet Epidemiol. 2017 Jul;41(5):413-426. doi: 10.1002/gepi.22045. Epub 2017 Apr 10.
4
PSEA: Phenotype Set Enrichment Analysis--a new method for analysis of multiple phenotypes.PSEA:表型集富集分析——一种分析多种表型的新方法。
Genet Epidemiol. 2012 Apr;36(3):244-52. doi: 10.1002/gepi.21617.
5
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
6
Covariate selection for association screening in multiphenotype genetic studies.多表型遗传研究中关联筛选的协变量选择
Nat Genet. 2017 Dec;49(12):1789-1795. doi: 10.1038/ng.3975. Epub 2017 Oct 16.
7
Extreme-phenotype genome-wide association study (XP-GWAS): a method for identifying trait-associated variants by sequencing pools of individuals selected from a diversity panel.极端表型全基因组关联研究(XP-GWAS):一种通过对从多样性面板中选择的个体群体进行测序来识别与特征相关变体的方法。
Plant J. 2015 Nov;84(3):587-96. doi: 10.1111/tpj.13029.
8
Model-based assessment of replicability for genome-wide association meta-analysis.基于模型的全基因组关联荟萃分析可重复性评估。
Nat Commun. 2021 Mar 30;12(1):1964. doi: 10.1038/s41467-021-21226-z.
9
MARS: leveraging allelic heterogeneity to increase power of association testing.MARS:利用等位基因异质性提高关联测试的功效。
Genome Biol. 2021 Apr 30;22(1):128. doi: 10.1186/s13059-021-02353-8.
10
Simulating autosomal genotypes with realistic linkage disequilibrium and a spiked-in genetic effect.模拟具有真实连锁不平衡和插入遗传效应的常染色体基因型。
BMC Bioinformatics. 2018 Jan 2;19(1):2. doi: 10.1186/s12859-017-2004-2.

引用本文的文献

1
reGenotyper: Detecting mislabeled samples in genetic data.reGenotyper:检测基因数据中标记错误的样本。
PLoS One. 2017 Feb 13;12(2):e0171324. doi: 10.1371/journal.pone.0171324. eCollection 2017.
2
Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study.表达遗传数据中样本混淆的识别与纠正:一个案例研究
G3 (Bethesda). 2015 Aug 19;5(10):2177-86. doi: 10.1534/g3.115.019778.