从标记数据中重建系谱的最大似然法的改进。

An improvement on the maximum likelihood reconstruction of pedigrees from marker data.

机构信息

Institute of Zoology, Zoological Society of London, London, UK.

出版信息

Heredity (Edinb). 2013 Aug;111(2):165-74. doi: 10.1038/hdy.2013.34. Epub 2013 Apr 24.

DOI:10.1038/hdy.2013.34

PMID:23612692

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3716262/

Abstract

Many methods have been proposed to reconstruct the pedigree of a sample of individuals from their multilocus marker genotypes. These methods, like those in other fields of statistical inferences, may suffer from both type I (falsely related) and type II (falsely unrelated) errors. In sibship reconstruction, type I errors come from the spurious fusion of two or more small sibships into a single sibship, and type II errors originate from the spurious splitting of a large sibship into two or more small sibships. In this study I investigate the tendencies of both types of errors made by the likelihood methods in sibship reconstruction, using both analytical and simulation approaches. I propose an improvement on the likelihood methods to reduce sibship splitting, and thus type II errors by downscaling the number of inferred siblings sharing the same genotype at a locus. Simulations are then conducted to compare the accuracy of the original and improved likelihood methods in sibship reconstruction of a large sample of individuals in full-sib families of the same small size, the same large size and highly variable sizes, using a variable number of loci with a variable number of alleles per locus. The methods were also applied to the analysis of a salmon data set. I show that my scaling scheme prevents effectively the splitting of large sibships, and reduces type II errors greatly with little increase in type I errors. As a result, it improves the overall accuracy of sibship assignments, except when sibships are expected to be uniformly small or marker information is unrealistically scarce.

摘要

许多方法已经被提出，用于从个体的多位点标记基因型中重建样本的系谱。这些方法，与统计推断的其他领域的方法一样，可能会受到Ⅰ类（错误相关）和Ⅱ类（错误不相关）错误的影响。在亲缘关系重建中，Ⅰ类错误源于两个或更多小亲缘关系融合成单个亲缘关系，Ⅱ类错误源于大亲缘关系错误地分裂成两个或更多小亲缘关系。在本研究中，我使用分析和模拟方法研究了亲缘关系重建中似然法产生的这两种错误的趋势。我提出了一种对似然法的改进，通过缩小推断出在一个位点上具有相同基因型的共享同一基因型的兄弟姐妹数量，从而减少亲缘关系分裂，进而减少Ⅱ类错误。然后，使用不同数量的具有不同等位基因数量的位点，对大小相同、大小高度可变的全同胞家庭中大量个体的亲缘关系重建进行模拟，比较原始和改进的似然法在亲缘关系重建中的准确性。该方法还应用于鲑鱼数据集的分析。我表明，我的缩放方案有效地防止了大亲缘关系的分裂，并大大减少了Ⅱ类错误，同时Ⅰ类错误略有增加。因此，它提高了亲缘关系分配的整体准确性，除非预期亲缘关系均匀较小或标记信息极不丰富。

相似文献

An improvement on the maximum likelihood reconstruction of pedigrees from marker data.从标记数据中重建系谱的最大似然法的改进。

Heredity (Edinb). 2013 Aug;111(2):165-74. doi: 10.1038/hdy.2013.34. Epub 2013 Apr 24.

Sibship reconstruction from genetic data with typing errors.基于存在分型错误的遗传数据进行同胞关系重建。

Genetics. 2004 Apr;166(4):1963-79. doi: 10.1534/genetics.166.4.1963.

Estimating quantitative genetic parameters using sibships reconstructed from marker data.利用从标记数据重建的同胞关系估计数量遗传参数。

Genetics. 2000 Aug;155(4):1961-72. doi: 10.1093/genetics/155.4.1961.

Likelihood-ratio affected sib-pair tests applied to multiply affected sibships: issues of power and type I error rate.应用于多个患病同胞对的似然比患病同胞对检验：效能和I型错误率问题

Genet Epidemiol. 2001 Jan;20(1):44-56. doi: 10.1002/1098-2272(200101)20:1<44::AID-GEPI5>3.0.CO;2-E.

Parentage and sibship inference from multilocus genotype data under polygamy.一夫多妻制下基于多位点基因型数据的亲子关系和同胞关系推断

Genetics. 2009 Apr;181(4):1579-94. doi: 10.1534/genetics.108.100214. Epub 2009 Feb 16.

Parentage and sibship inference from markers in polyploids.多倍体中基于标记的亲本及亲缘关系推断

Mol Ecol Resour. 2014 May;14(3):541-53. doi: 10.1111/1755-0998.12210. Epub 2013 Dec 23.

Optimal weighting scheme for affected sib-pair analysis of sibship data.同胞关系数据的患病同胞对分析的最优加权方案。

Ann Hum Genet. 1997 Jan;61(Pt 1):61-9. doi: 10.1046/j.1469-1809.1997.6110059.x.

Accurate partition of individuals into full-sib families from genetic data without parental information.在没有亲本信息的情况下，根据遗传数据将个体准确划分到全同胞家系中。

Genetics. 2001 Jul;158(3):1329-38. doi: 10.1093/genetics/158.3.1329.

Robustness and power of the maximum-likelihood-binomial and maximum-likelihood-score methods, in multipoint linkage analysis of affected-sibship data.在受累同胞对数据的多点连锁分析中，最大似然二项式法和最大似然评分法的稳健性与效能

Am J Hum Genet. 1998 Aug;63(2):638-47. doi: 10.1086/301958.

Computationally efficient sibship and parentage assignment from multilocus marker data.基于多位点标记数据的高效同胞和亲子关系鉴定。

Genetics. 2012 May;191(1):183-94. doi: 10.1534/genetics.111.138149. Epub 2012 Feb 23.

引用本文的文献

Devastating disease can cause increased breeding effort and success that improves population resilience.毁灭性疾病可能会导致繁殖努力增加和繁殖成功率提高，从而增强种群恢复力。

Open Biol. 2025 May;15(5):240385. doi: 10.1098/rsob.240385. Epub 2025 May 28.

Pedigree reconstruction and distant pairwise relatedness estimation from genome sequence data: A demonstration in a population of rhesus macaques (Macaca mulatta).从基因组序列数据中进行家系重建和远缘亲缘关系估计：恒河猴（Macaca mulatta）群体中的演示。

Mol Ecol Resour. 2021 May;21(4):1333-1346. doi: 10.1111/1755-0998.13317. Epub 2021 Jan 27.

Restored river habitat provides a natural spawning area for a critically endangered landlocked Atlantic salmon population.恢复的河流生境为濒临灭绝的大西洋鲑鱼种群提供了天然的产卵区。

PLoS One. 2020 May 21;15(5):e0232723. doi: 10.1371/journal.pone.0232723. eCollection 2020.

Sibship assignment to the founders of a Bangladeshi Catla catla breeding population.将孟加拉国有须鲫养殖群体的创始个体分配给同一家系。

Genet Sel Evol. 2019 Apr 29;51(1):17. doi: 10.1186/s12711-019-0454-x.

Genomic analysis of morphometric traits in bighorn sheep using the Ovine Infinium HD SNP BeadChip.使用绵羊Infinium HD SNP基因分型芯片对大角羊形态特征进行基因组分析。

PeerJ. 2018 Feb 12;6:e4364. doi: 10.7717/peerj.4364. eCollection 2018.

Pedigree reconstruction from SNP data: parentage assignment, sibship clustering and beyond.从 SNP 数据进行系谱重建：亲子关系分配、亲缘聚类及其他。

Mol Ecol Resour. 2017 Sep;17(5):1009-1024. doi: 10.1111/1755-0998.12665. Epub 2017 Apr 6.

A supergene determines highly divergent male reproductive morphs in the ruff.一个超级基因决定了流苏鹬雄性生殖形态的高度分化。

Nat Genet. 2016 Jan;48(1):79-83. doi: 10.1038/ng.3443. Epub 2015 Nov 16.

本文引用的文献

Computationally efficient sibship and parentage assignment from multilocus marker data.基于多位点标记数据的高效同胞和亲子关系鉴定。

Genetics. 2012 May;191(1):183-94. doi: 10.1534/genetics.111.138149. Epub 2012 Feb 23.

A new version of PRT software for sibling groups reconstruction with comments regarding several issues in the sibling reconstruction problem.PRT 软件的新版本，用于兄弟姐妹群组重建，并对兄弟姐妹重建问题中的几个问题进行了评论。

Mol Ecol Resour. 2012 Jan;12(1):164-78. doi: 10.1111/j.1755-0998.2011.03061.x. Epub 2011 Aug 26.

Parentage and sibship inference from multilocus genotype data under polygamy.一夫多妻制下基于多位点基因型数据的亲子关系和同胞关系推断

Genetics. 2009 Apr;181(4):1579-94. doi: 10.1534/genetics.108.100214. Epub 2009 Feb 16.

Wild pedigrees: the way forward.野生谱系：前进的道路。

Proc Biol Sci. 2008 Mar 22;275(1635):613-21. doi: 10.1098/rspb.2007.1531.

Reconstructing sibling relationships in wild populations.重建野生种群中的同胞关系。

Bioinformatics. 2007 Jul 1;23(13):i49-56. doi: 10.1093/bioinformatics/btm219.

Parentage and sibship exclusions: higher statistical power with more family members.亲子关系和同胞关系排除：家庭成员越多，统计效力越高。

Heredity (Edinb). 2007 Aug;99(2):205-17. doi: 10.1038/sj.hdy.6800984. Epub 2007 May 9.

Accuracy, efficiency and robustness of four algorithms allowing full sibship reconstruction from DNA marker data.四种可根据DNA标记数据进行全同胞关系重建的算法的准确性、效率和稳健性。

Mol Ecol. 2004 Jun;13(6):1589-600. doi: 10.1111/j.1365-294X.2004.02152.x.

Sibship reconstruction from genetic data with typing errors.基于存在分型错误的遗传数据进行同胞关系重建。

Genetics. 2004 Apr;166(4):1963-79. doi: 10.1534/genetics.166.4.1963.

Sibship reconstruction in hierarchical population structures using Markov chain Monte Carlo techniques.使用马尔可夫链蒙特卡罗技术在分层群体结构中进行亲缘关系重建。

Genet Res. 2002 Jun;79(3):227-34. doi: 10.1017/s0016672302005669.

Relationship inference from trios of individuals, in the presence of typing error.存在分型错误情况下个体三人组之间的关系推断。

Am J Hum Genet. 2002 Jan;70(1):170-80. doi: 10.1086/338444. Epub 2001 Nov 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验