• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Modeling Informatively Missing Genotypes in Haplotype Analysis.单倍型分析中信息性缺失基因型的建模
Commun Stat Theory Methods. 2009;38(18):3445-3460. doi: 10.1080/03610920802696588.
2
Haplotype analysis in the presence of informatively missing genotype data.存在信息性缺失基因型数据时的单倍型分析。
Genet Epidemiol. 2006 May;30(4):290-300. doi: 10.1002/gepi.20144.
3
Quantifying the amount of missing information in genetic association studies.量化基因关联研究中缺失信息的数量。
Genet Epidemiol. 2006 Dec;30(8):703-17. doi: 10.1002/gepi.20181.
4
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
5
Haplotype frequency estimation error analysis in the presence of missing genotype data.存在缺失基因型数据时的单倍型频率估计误差分析
BMC Bioinformatics. 2004 Dec 1;5:188. doi: 10.1186/1471-2105-5-188.
6
Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.通过针对未分型二倍体基因型数据的期望最大化算法,对等位基因位点单倍型频率估计的准确性。
Am J Hum Genet. 2000 Oct;67(4):947-59. doi: 10.1086/303069. Epub 2000 Aug 22.
7
The APL test: extension to general nuclear families and haplotypes and examination of its robustness.APL 检验:扩展至一般核心家庭和单倍型及其稳健性检验
Hum Hered. 2006;61(4):189-99. doi: 10.1159/000094774. Epub 2006 Jul 27.
8
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
9
The use of family relationships and linkage disequilibrium to impute phase and missing genotypes in up to whole-genome sequence density genotypic data.利用家族关系和连锁不平衡信息,对全基因组序列密度基因型数据进行相位推断和缺失基因型填补。
Genetics. 2010 Aug;185(4):1441-9. doi: 10.1534/genetics.110.113936. Epub 2010 May 17.
10
Characterisation of SNP haplotype structure in chemokine and chemokine receptor genes using CEPH pedigrees and statistical estimation.利用CEPH家系和统计估计对趋化因子和趋化因子受体基因中的单核苷酸多态性单倍型结构进行表征。
Hum Genomics. 2004 Mar;1(3):195-207. doi: 10.1186/1479-7364-1-3-195.

引用本文的文献

1
Reducing bias of allele frequency estimates by modeling SNP genotype data with informative missingness.通过对具有信息性缺失的单核苷酸多态性(SNP)基因型数据进行建模来减少等位基因频率估计的偏差。
Front Genet. 2012 Jun 18;3:107. doi: 10.3389/fgene.2012.00107. eCollection 2012.
2
A powerful test of parent-of-origin effects for quantitative traits using haplotypes.利用单倍型对数量性状的母源效应进行强有力的检验。
PLoS One. 2011;6(12):e28909. doi: 10.1371/journal.pone.0028909. Epub 2011 Dec 13.

本文引用的文献

1
Methods to impute missing genotypes for population data.用于推算群体数据中缺失基因型的方法。
Hum Genet. 2007 Dec;122(5):495-504. doi: 10.1007/s00439-007-0427-y. Epub 2007 Sep 13.
2
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。
Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.
3
Sequential haplotype scan methods for association analysis.用于关联分析的序列单倍型扫描方法。
Genet Epidemiol. 2007 Sep;31(6):553-64. doi: 10.1002/gepi.20228.
4
Differential dropout among SNP genotypes and impacts on association tests.单核苷酸多态性(SNP)基因型之间的差异缺失及其对关联测试的影响。
Hum Hered. 2007;63(3-4):219-28. doi: 10.1159/000100480. Epub 2007 Mar 7.
5
An Arabidopsis example of association mapping in structured samples.一个拟南芥在结构化样本中进行关联作图的例子。
PLoS Genet. 2007 Jan 19;3(1):e4. doi: 10.1371/journal.pgen.0030004. Epub 2006 Nov 22.
6
Macrophage migration inhibitory factor promoter polymorphisms and the clinical expression of scleroderma.巨噬细胞移动抑制因子启动子多态性与硬皮病的临床表型
Arthritis Rheum. 2006 Nov;54(11):3661-9. doi: 10.1002/art.22179.
7
GEL: a novel genotype calling algorithm using empirical likelihood.GEL:一种使用经验似然的新型基因型分型算法。
Bioinformatics. 2006 Aug 15;22(16):1942-7. doi: 10.1093/bioinformatics/btl341. Epub 2006 Jun 29.
8
Haplotype analysis in the presence of informatively missing genotype data.存在信息性缺失基因型数据时的单倍型分析。
Genet Epidemiol. 2006 May;30(4):290-300. doi: 10.1002/gepi.20144.
9
Maximum likelihood estimation of haplotype effects and haplotype-environment interactions in association studies.关联研究中单体型效应及单体型-环境相互作用的最大似然估计
Genet Epidemiol. 2005 Dec;29(4):299-312. doi: 10.1002/gepi.20098.
10
Genetics of scleroderma: update on single nucleotide polymorphism analysis and microarrays.硬皮病的遗传学:单核苷酸多态性分析与微阵列研究进展
Curr Opin Rheumatol. 2005 Nov;17(6):761-7. doi: 10.1097/01.bor.0000179943.27777.b1.

单倍型分析中信息性缺失基因型的建模

Modeling Informatively Missing Genotypes in Haplotype Analysis.

作者信息

Liu Nianjun, Bucala Richard, Zhao Hongyu

机构信息

Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL.

出版信息

Commun Stat Theory Methods. 2009;38(18):3445-3460. doi: 10.1080/03610920802696588.

DOI:10.1080/03610920802696588
PMID:20052310
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2801447/
Abstract

It is common to have missing genotypes in practical genetic studies. The majority of the existing statistical methods, including those on haplotype analysis, assume that genotypes are missing at random-that is, at a given marker, different genotypes and different alleles are missing with the same probability. In our previous work, we have demonstrated that the violation of this assumption may lead to serious bias in haplotype frequency estimates and haplotype association analysis. We have proposed a general missing data model to simultaneously characterize missing data patterns across a set of two or more biallelic markers. We have proved that haplotype frequencies and missing data probabilities are identifiable if and only if there is linkage disequilibrium between these markers under the general missing data model. In this study, we extend our work to multi-allelic markers and observe a similar finding. Simulation studies on the analysis of haplotypes consisting of two markers illustrate that our proposed model can reduce the bias for haplotype frequency estimates due to incorrect assumptions on the missing data mechanism. Finally, we illustrate the utilities of our method through its application to a real data set from a study of scleroderma.

摘要

在实际的遗传学研究中,基因型缺失的情况很常见。现有的大多数统计方法,包括那些用于单倍型分析的方法,都假定基因型是随机缺失的——也就是说,在给定的标记位点,不同的基因型和不同的等位基因以相同的概率缺失。在我们之前的工作中,我们已经证明,违反这一假设可能会导致单倍型频率估计和单倍型关联分析中出现严重偏差。我们提出了一个通用的缺失数据模型,以同时刻画一组两个或更多双等位基因标记位点上的缺失数据模式。我们已经证明,在通用缺失数据模型下,当且仅当这些标记位点之间存在连锁不平衡时,单倍型频率和缺失数据概率才是可识别的。在本研究中,我们将工作扩展到多等位基因标记位点,并观察到了类似的结果。对由两个标记位点组成的单倍型进行分析的模拟研究表明,我们提出的模型可以减少由于对缺失数据机制的错误假设而导致的单倍型频率估计偏差。最后,我们通过将我们的方法应用于一项硬皮病研究的真实数据集来说明其效用。