使用整数线性规划进行最大似然家系重建。

Maximum likelihood pedigree reconstruction using integer linear programming.

机构信息

Department of Computer Science, University of York, York, North Yorkshire, United Kingdom.

出版信息

Genet Epidemiol. 2013 Jan;37(1):69-83. doi: 10.1002/gepi.21686. Epub 2012 Oct 3.

Abstract

Large population biobanks of unrelated individuals have been highly successful in detecting common genetic variants affecting diseases of public health concern. However, they lack the statistical power to detect more modest gene-gene and gene-environment interaction effects or the effects of rare variants for which related individuals are ideally required. In reality, most large population studies will undoubtedly contain sets of undeclared relatives, or pedigrees. Although a crude measure of relatedness might sometimes suffice, having a good estimate of the true pedigree would be much more informative if this could be obtained efficiently. Relatives are more likely to share longer haplotypes around disease susceptibility loci and are hence biologically more informative for rare variants than unrelated cases and controls. Distant relatives are arguably more useful for detecting variants with small effects because they are less likely to share masking environmental effects. Moreover, the identification of relatives enables appropriate adjustments of statistical analyses that typically assume unrelatedness. We propose to exploit an integer linear programming optimisation approach to pedigree learning, which is adapted to find valid pedigrees by imposing appropriate constraints. Our method is not restricted to small pedigrees and is guaranteed to return a maximum likelihood pedigree. With additional constraints, we can also search for multiple high-probability pedigrees and thus account for the inherent uncertainty in any particular pedigree reconstruction. The true pedigree is found very quickly by comparison with other methods when all individuals are observed. Extensions to more complex problems seem feasible.

摘要

大型无关个体人群生物库在检测影响公众健康关注的疾病的常见遗传变异方面非常成功。然而，它们缺乏检测适度基因-基因和基因-环境相互作用效应或稀有变异效应的统计能力，而相关个体是检测这些效应的理想选择。实际上，大多数大型人群研究无疑会包含一系列未申报的亲属或家系。虽然有时粗略的亲缘关系测量可能就足够了，但如果能够有效地获得，则对真实家系进行良好估计将更具信息量。亲属在疾病易感基因座周围更有可能共享更长的单倍型，因此对于稀有变异，他们比无关的病例和对照更具生物学信息。由于遥远的亲属不太可能共享掩蔽环境效应，因此对于检测小效应的变体，他们可能更有用。此外，识别亲属可以对统计分析进行适当调整，这些分析通常假设不存在亲缘关系。我们建议利用整数线性规划优化方法进行系谱学习，该方法通过施加适当的约束来找到有效的系谱。我们的方法不仅限于小系谱，并且保证返回最大似然系谱。通过附加约束，我们还可以搜索多个高概率系谱，从而考虑到任何特定系谱重建中的固有不确定性。当所有个体都被观察到时，与其他方法相比，通过比较可以快速找到真实的系谱。扩展到更复杂的问题似乎是可行的。

相似文献

Maximum likelihood pedigree reconstruction using integer linear programming.使用整数线性规划进行最大似然家系重建。

Genet Epidemiol. 2013 Jan;37(1):69-83. doi: 10.1002/gepi.21686. Epub 2012 Oct 3.

Improved maximum likelihood reconstruction of complex multi-generational pedigrees.复杂多代系谱的改进最大似然重建。

Theor Popul Biol. 2014 Nov;97:11-9. doi: 10.1016/j.tpb.2014.07.002. Epub 2014 Aug 10.

Maximum likelihood haplotyping for general pedigrees.一般家系的最大似然单倍型分型

Hum Hered. 2005;59(1):41-60. doi: 10.1159/000084736.

Incorporating genotyping uncertainty in haplotype frequency estimation in pedigree studies.在系谱研究中，将基因分型不确定性纳入单倍型频率估计。

Hum Hered. 2007;64(3):172-81. doi: 10.1159/000102990. Epub 2007 May 25.

Detecting familial aggregation.检测家族聚集性。

Methods Mol Biol. 2012;850:119-50. doi: 10.1007/978-1-61779-555-8_8.

Pedigree reconstruction in wild cichlid fish populations.野生丽鱼科鱼类种群的谱系重建

Mol Ecol. 2008 Oct;17(20):4500-11. doi: 10.1111/j.1365-294X.2008.03925.x.

Relationship uncertainty linkage statistics (RULS): affected relative pair statistics that model relationship uncertainty.关系不确定性关联统计（RULS）：对关系不确定性进行建模的受影响亲属对统计。

Genet Epidemiol. 2008 May;32(4):313-24. doi: 10.1002/gepi.20306.

Likelihood approach for detecting imprinting and in utero maternal effects using general pedigrees from prospective family-based association studies.利用前瞻性家庭关联研究中的一般系谱检测印记和子宫内母体效应的似然方法。

Biometrics. 2012 Jun;68(2):477-85. doi: 10.1111/j.1541-0420.2011.01695.x. Epub 2011 Oct 18.

Efficient maximum likelihood pedigree reconstruction.高效最大似然系谱重建

Theor Popul Biol. 2009 Dec;76(4):285-91. doi: 10.1016/j.tpb.2009.09.002. Epub 2009 Sep 23.

Validation of DNA-based identification software by computation of pedigree likelihood ratios.通过计算系谱似然比验证基于 DNA 的识别软件。

Forensic Sci Int Genet. 2011 Aug;5(4):308-15. doi: 10.1016/j.fsigen.2010.06.005. Epub 2010 Aug 21.

引用本文的文献

Consistent Second-Order Conic Integer Programming for Learning Bayesian Networks.用于学习贝叶斯网络的一致性二阶锥整数规划

J Mach Learn Res. 2023;24.

Integer Programming for Learning Directed Acyclic Graphs from Continuous Data.用于从连续数据学习有向无环图的整数规划

INFORMS J Optim. 2021 Winter;3(1):46-73. doi: 10.1287/ijoo.2019.0040. Epub 2020 Nov 3.

Bonsai: An efficient method for inferring large human pedigrees from genotype data.盆景：一种从基因型数据推断大型人类家系的有效方法。

Am J Hum Genet. 2021 Nov 4;108(11):2052-2070. doi: 10.1016/j.ajhg.2021.09.013.

Joint Estimation of Pedigrees and Effective Population Size Using Markov Chain Monte Carlo.使用马尔可夫链蒙特卡罗方法联合估计家系和有效种群大小。

Genetics. 2019 Jul;212(3):855-868. doi: 10.1534/genetics.119.302280. Epub 2019 May 22.

Constrained likelihood for reconstructing a directed acyclic Gaussian graph.用于重建有向无环高斯图的约束似然法。

Biometrika. 2019 Mar;106(1):109-125. doi: 10.1093/biomet/asy057. Epub 2018 Dec 13.

Composite likelihood method for inferring local pedigrees.用于推断局部谱系的复合似然法。

PLoS Genet. 2017 Aug 21;13(8):e1006963. doi: 10.1371/journal.pgen.1006963. eCollection 2017 Aug.

Strategies for determining kinship in wild populations using genetic data.利用遗传数据确定野生种群亲缘关系的策略。

Ecol Evol. 2016 Jul 29;6(17):6107-20. doi: 10.1002/ece3.2346. eCollection 2016 Sep.

Family tree and ancestry inference: is there a need for a 'generational' consent?家族谱系与血统推断：是否需要“代际”同意？

BMC Med Ethics. 2015 Dec 9;16(1):87. doi: 10.1186/s12910-015-0080-2.

PRIMUS: rapid reconstruction of pedigrees from genome-wide estimates of identity by descent.PRIMUS：通过全基因组的同源性估计快速重建家系。

Am J Hum Genet. 2014 Nov 6;95(5):553-64. doi: 10.1016/j.ajhg.2014.10.005. Epub 2014 Oct 30.

Historical pedigree reconstruction from extant populations using PArtitioning of RElatives (PREPARE).利用亲属关系划分法（PREPARE）从现存群体重建历史谱系。

PLoS Comput Biol. 2014 Jun 19;10(6):e1003610. doi: 10.1371/journal.pcbi.1003610. eCollection 2014 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用整数线性规划进行最大似然家系重建。

Maximum likelihood pedigree reconstruction using integer linear programming.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献