Suppr超能文献

针对有缺失基因型数据的核心家庭和无关个体的基于似然性的关联分析。

Likelihood-based association analysis for nuclear families and unrelated subjects with missing genotype data.

作者信息

Dudbridge Frank

机构信息

MRC Biostatistics Unit, Cambridge, UK.

出版信息

Hum Hered. 2008;66(2):87-98. doi: 10.1159/000119108. Epub 2008 Mar 31.

Abstract

Missing data occur in genetic association studies for several reasons including missing family members and uncertain haplotype phase. Maximum likelihood is a commonly used approach to accommodate missing data, but it can be difficult to apply to family-based association studies, because of possible loss of robustness to confounding by population stratification. Here a novel likelihood for nuclear families is proposed, in which distinct sets of association parameters are used to model the parental genotypes and the offspring genotypes. This approach is robust to population structure when the data are complete, and has only minor loss of robustness when there are missing data. It also allows a novel conditioning step that gives valid analysis for multiple offspring in the presence of linkage. Unrelated subjects are included by regarding them as the children of two missing parents. Simulations and theory indicate similar operating characteristics to TRANSMIT, but with no bias with missing data in the presence of linkage. In comparison with FBAT and PCPH, the proposed model is slightly less robust to population structure but has greater power to detect strong effects. In comparison to APL and MITDT, the model is more robust to stratification and can accommodate sibships of any size. The methods are implemented for binary and continuous traits in software, UNPHASED, available from the author.

摘要

在基因关联研究中,缺失数据的出现有多种原因,包括家庭成员缺失和单倍型相位不确定。最大似然法是处理缺失数据常用的方法,但由于可能会因群体分层而失去对混杂因素的稳健性,所以难以应用于基于家系的关联研究。本文提出了一种针对核心家系的新似然法,其中使用不同的关联参数集来对父母基因型和后代基因型进行建模。当数据完整时,该方法对群体结构具有稳健性,而在存在缺失数据时,稳健性仅有轻微损失。它还允许一个新的条件步骤,在存在连锁的情况下对多个后代进行有效分析。通过将无关个体视为两个缺失父母的子女来纳入分析。模拟和理论表明,该方法与TRANSMIT具有相似的操作特性,但在存在连锁和缺失数据的情况下没有偏差。与FBAT和PCPH相比,所提出的模型对群体结构的稳健性略低,但检测强效应的能力更强。与APL和MITDT相比,该模型对分层更具稳健性,并且可以处理任何规模的同胞关系。这些方法已在作者提供的软件UNPHASED中针对二元性状和连续性状实现。

相似文献

2
A flexible model for association analysis in sibships with missing genotype data.
Ann Hum Genet. 2011 May;75(3):428-38. doi: 10.1111/j.1469-1809.2010.00636.x. Epub 2011 Jan 17.
4
Informative missingness in genetic association studies: case-parent designs.
Am J Hum Genet. 2003 Mar;72(3):671-80. doi: 10.1086/368276. Epub 2003 Feb 14.
7
Maximum-likelihood estimation of haplotype frequencies in nuclear families.
Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.
8
9
Handling missing data in transmission disequilibrium test in nuclear families with one affected offspring.
PLoS One. 2012;7(10):e46100. doi: 10.1371/journal.pone.0046100. Epub 2012 Oct 8.
10
Accounting for linkage in family-based tests of association with missing parental genotypes.
Am J Hum Genet. 2003 Nov;73(5):1016-26. doi: 10.1086/378779. Epub 2003 Oct 9.

引用本文的文献

1
Genetic Variations in , , and Increase the Risk of Extreme Obesity.
J Obes. 2024 Oct 24;2024:3813621. doi: 10.1155/2024/3813621. eCollection 2024.
2
7
UBASH3A Interacts with PTPN22 to Regulate Expression and Risk for Type 1 Diabetes.
Int J Mol Sci. 2023 May 12;24(10):8671. doi: 10.3390/ijms24108671.
8
Analysis of neurotransmitters validates the importance of the dopaminergic system in autism spectrum disorder.
World J Pediatr. 2023 Aug;19(8):770-781. doi: 10.1007/s12519-023-00702-0. Epub 2023 Feb 27.
10
A three-pronged analysis confirms the association of the serotoninergic system with attention deficit hyperactivity disorder.
World J Pediatr. 2022 Dec;18(12):825-834. doi: 10.1007/s12519-022-00614-5. Epub 2022 Sep 19.

本文引用的文献

1
Dealing with missing data in family-based association studies: a multiple imputation approach.
Hum Hered. 2007;63(3-4):229-38. doi: 10.1159/000100481. Epub 2007 Mar 7.
2
Efficient association mapping of quantitative trait loci with selective genotyping.
Am J Hum Genet. 2007 Mar;80(3):567-76. doi: 10.1086/512727. Epub 2007 Jan 30.
6
WHAP: haplotype-based association analysis.
Bioinformatics. 2007 Jan 15;23(2):255-6. doi: 10.1093/bioinformatics/btl580. Epub 2006 Nov 21.
7
A tutorial on statistical methods for population association studies.
Nat Rev Genet. 2006 Oct;7(10):781-91. doi: 10.1038/nrg1916.
8
Efficient study designs for test of genetic association using sibship data and unrelated cases and controls.
Am J Hum Genet. 2006 May;78(5):778-792. doi: 10.1086/503711. Epub 2006 Mar 20.
9
Family-based designs in the age of large-scale gene-association studies.
Nat Rev Genet. 2006 May;7(5):385-94. doi: 10.1038/nrg1839.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验