• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过对具有信息性缺失的单核苷酸多态性(SNP)基因型数据进行建模来减少等位基因频率估计的偏差。

Reducing bias of allele frequency estimates by modeling SNP genotype data with informative missingness.

作者信息

Lin Wan-Yu, Liu Nianjun

机构信息

Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University Taipei, Taiwan.

出版信息

Front Genet. 2012 Jun 18;3:107. doi: 10.3389/fgene.2012.00107. eCollection 2012.

DOI:10.3389/fgene.2012.00107
PMID:22719749
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3376470/
Abstract

The presence of missing single-nucleotide polymorphism (SNP) genotypes is common in genetic studies. For studies with low-density SNPs, the most commonly used approach to dealing with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve method is straightforward but is valid only when the missingness is random. However, a given assay often has a different capability in genotyping heterozygotes and homozygotes, causing the phenomenon of "differential dropout" in the sense that the missing rates of heterozygotes and homozygotes are different. In practice, differential dropout among genotypes exists in even carefully designed studies, such as the data from the HapMap project and the Wellcome Trust Case Control Consortium. Under the assumption of Hardy-Weinberg equilibrium and no genotyping error, we here propose a statistical method to model the differential dropout among different genotypes. Compared with the naïve method, our method provides more accurate allele frequency estimates when the differential dropout is present. To demonstrate its practical use, we further apply our method to the HapMap data and a scleroderma data set.

摘要

在基因研究中,单核苷酸多态性(SNP)基因型缺失的情况很常见。对于低密度SNP的研究,处理基因型缺失最常用的方法是在分析中简单地剔除基因型缺失的观测值。这种简单的方法很直接,但仅在缺失是随机的情况下才有效。然而,给定的检测方法在对杂合子和纯合子进行基因分型时往往具有不同的能力,从而导致“差异缺失”现象,即杂合子和纯合子的缺失率不同。实际上,即使在精心设计的研究中,如国际人类基因组单体型图计划(HapMap)项目和威康信托病例对照协会的数据,基因型之间也存在差异缺失。在哈迪-温伯格平衡和无基因分型错误的假设下,我们在此提出一种统计方法来模拟不同基因型之间的差异缺失。与简单方法相比,当存在差异缺失时,我们的方法能提供更准确的等位基因频率估计。为了展示其实际应用,我们进一步将我们的方法应用于HapMap数据和一个硬皮病数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d51/3376470/0db2f0e45f9a/fgene-03-00107-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d51/3376470/0db2f0e45f9a/fgene-03-00107-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d51/3376470/0db2f0e45f9a/fgene-03-00107-g001.jpg

相似文献

1
Reducing bias of allele frequency estimates by modeling SNP genotype data with informative missingness.通过对具有信息性缺失的单核苷酸多态性(SNP)基因型数据进行建模来减少等位基因频率估计的偏差。
Front Genet. 2012 Jun 18;3:107. doi: 10.3389/fgene.2012.00107. eCollection 2012.
2
Differential dropout among SNP genotypes and impacts on association tests.单核苷酸多态性(SNP)基因型之间的差异缺失及其对关联测试的影响。
Hum Hered. 2007;63(3-4):219-28. doi: 10.1159/000100480. Epub 2007 Mar 7.
3
A maximum-likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes.一种无重复基因型的微卫星数据等位基因缺失校正的最大似然法。
Genetics. 2012 Oct;192(2):651-69. doi: 10.1534/genetics.112.139519. Epub 2012 Jul 30.
4
Missingness in the T1DGC MHC fine-mapping SNP data: association with HLA genotype and potential influence on genetic association studies.青少年糖尿病研究组(T1DGC)MHC精细定位单核苷酸多态性(SNP)数据中的缺失值:与HLA基因型的关联以及对基因关联研究的潜在影响。
Diabetes Obes Metab. 2009 Feb;11 Suppl 1(Suppl 1):101-7. doi: 10.1111/j.1463-1326.2008.01010.x.
5
A novel test of informative missingness using inconsistent linkage disequilibrium signals between case-parent triads and incomplete data.一种利用病例-父母三体型之间不一致的连锁不平衡信号来测试信息缺失的新方法,同时考虑不完全数据。
J Hum Genet. 2012 Sep;57(9):601-9. doi: 10.1038/jhg.2012.78. Epub 2012 Jun 28.
6
Simple SNP-based minimal marker genotyping for Humulus lupulus L. identification and variety validation.基于单核苷酸多态性(SNP)的简易最小标记基因分型用于啤酒花鉴定和品种验证。
BMC Res Notes. 2015 Oct 6;8:542. doi: 10.1186/s13104-015-1492-2.
7
Statistical inference for Hardy-Weinberg proportions in the presence of missing genotype information.存在缺失基因型信息时 Hardy-Weinberg 比例的统计推断。
PLoS One. 2013 Dec 31;8(12):e83316. doi: 10.1371/journal.pone.0083316. eCollection 2013.
8
Departure from Hardy Weinberg Equilibrium and Genotyping Error.偏离哈迪-温伯格平衡与基因分型错误。
Front Genet. 2017 Oct 31;8:167. doi: 10.3389/fgene.2017.00167. eCollection 2017.
9
Exact Inference for Hardy-Weinberg Proportions with Missing Genotypes: Single and Multiple Imputation.针对存在缺失基因型的哈迪-温伯格比例的精确推断:单重和多重填补
G3 (Bethesda). 2015 Sep 15;5(11):2365-73. doi: 10.1534/g3.115.022111.
10
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.

引用本文的文献

1
Machine Learning Methods for Classifying Multiple Sclerosis and Alzheimer's Disease Using Genomic Data.使用基因组数据对多发性硬化症和阿尔茨海默病进行分类的机器学习方法
Int J Mol Sci. 2025 Feb 27;26(5):2085. doi: 10.3390/ijms26052085.
2
Leveraging informative missing data to learn about acute respiratory distress syndrome and mortality in long-term hospitalized COVID-19 patients throughout the years of the pandemic.利用有信息价值的缺失数据了解大流行期间长期住院的 COVID-19 患者的急性呼吸窘迫综合征和死亡率。
AMIA Annu Symp Proc. 2024 Jan 11;2023:942-950. eCollection 2023.
3
Leveraging informative missing data to learn about acute respiratory distress syndrome and mortality in long-term hospitalized COVID-19 patients throughout the years of the pandemic.

本文引用的文献

1
Practical Consideration of Genotype Imputation: Sample Size, Window Size, Reference Choice, and Untyped Rate.基因型填充的实际考量:样本量、窗口大小、参考选择及未分型率
Stat Interface. 2011;4(3):339-352. doi: 10.4310/sii.2011.v4.n3.a8.
2
Confirmation of association of the macrophage migration inhibitory factor gene with systemic sclerosis in a large European population.确认巨噬细胞移动抑制因子基因与欧洲大样本系统性硬化症的相关性。
Rheumatology (Oxford). 2011 Nov;50(11):1976-81. doi: 10.1093/rheumatology/ker259. Epub 2011 Aug 28.
3
Imputation of missing genotypes from sparse to high density using long-range phasing.
在新冠疫情的数年中,利用信息性缺失数据了解长期住院的新冠患者的急性呼吸窘迫综合征和死亡率。
medRxiv. 2023 Dec 19:2023.12.18.23300181. doi: 10.1101/2023.12.18.23300181.
4
Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?信息缺失:从电子健康记录中缺失实验室数据的模式中我们能学到什么?
J Biomed Inform. 2023 Mar;139:104306. doi: 10.1016/j.jbi.2023.104306. Epub 2023 Feb 3.
5
Systematically Prioritizing Candidates in Genome-Based Drug Repurposing.在基于基因组的药物重新利用中系统地对候选药物进行优先级排序。
Assay Drug Dev Technol. 2019 Nov/Dec;17(8):352-363. doi: 10.1089/adt.2019.950. Epub 2019 Nov 26.
利用长程定相对稀疏至高密度缺失基因型进行推断。
Genetics. 2011 Sep;189(1):317-27. doi: 10.1534/genetics.111.128082. Epub 2011 Jul 29.
4
Epidemiology of systemic sclerosis.系统性硬化症的流行病学。
Best Pract Res Clin Rheumatol. 2010 Dec;24(6):857-69. doi: 10.1016/j.berh.2010.10.007.
5
Imputation of missing single nucleotide polymorphism genotypes using a multivariate mixed model framework.使用多元混合模型框架对缺失的单核苷酸多态性基因型进行推断。
J Anim Sci. 2011 Jul;89(7):2042-9. doi: 10.2527/jas.2010-3297. Epub 2011 Feb 25.
6
Imputation of genotypes from different single nucleotide polymorphism panels in dairy cattle.从奶牛不同单核苷酸多态性面板中推断基因型。
J Dairy Sci. 2010 Nov;93(11):5443-54. doi: 10.3168/jds.2010-3255.
7
Modeling Informatively Missing Genotypes in Haplotype Analysis.单倍型分析中信息性缺失基因型的建模
Commun Stat Theory Methods. 2009;38(18):3445-3460. doi: 10.1080/03610920802696588.
8
Bayesian methods for examining Hardy-Weinberg equilibrium.用于检验哈迪-温伯格平衡的贝叶斯方法。
Biometrics. 2010 Mar;66(1):257-65. doi: 10.1111/j.1541-0420.2009.01267.x. Epub 2009 May 12.
9
Genotyping error detection in samples of unrelated individuals without replicate genotyping.在无重复基因分型的无关个体样本中进行基因分型错误检测。
Hum Hered. 2009;67(3):154-62. doi: 10.1159/000181153. Epub 2008 Dec 15.
10
Imputation of missing genotypes: an empirical evaluation of IMPUTE.缺失基因型的插补:IMPUTE的实证评估
BMC Genet. 2008 Dec 12;9:85. doi: 10.1186/1471-2156-9-85.