Suppr超能文献

关于单倍型频率最大似然估计的注释

Notes on the maximum likelihood estimation of haplotype frequencies.

作者信息

Mano S, Yasuda N, Katoh T, Tounai K, Inoko H, Imanishi T, Tamiya G, Gojobori T

机构信息

Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan.

出版信息

Ann Hum Genet. 2004 May;68(Pt 3):257-64. doi: 10.1046/j.1529-8817.2003.00088.x.

Abstract

The maximum likelihood estimation (MLE) is one of the most popular ways to estimate haplotype frequencies of a population with genotype data whose linkage phases are unknown. The MLE is commonly implemented in the use of the Expectation-Maximization (EM) algorithm. It is known that the EM algorithm carries the risk that an estimator may converge erroneously to one of the local maxima or saddle points of the likelihood surface, resulting in serious errors in the MLE of haplotype frequencies. In this note, by theoretical treatments we present the necessary and sufficient conditions that the local maxima or saddle points on the likelihood surface appear. As a rule of thumb, that the difference between the coupling and repulsive haplotype frequencies in phase known individuals is 3/2 times larger than the frequency of phase ambiguous individuals is the sufficient condition that the likelihood surface is unimodal. Moreover, we present the analytic solution to the biallelic two-locus problem, and construct a general algorithm to obtain the global maximum.

摘要

最大似然估计(MLE)是估计连锁相未知的基因型数据群体单倍型频率最常用的方法之一。最大似然估计通常通过期望最大化(EM)算法来实现。已知EM算法存在一种风险,即估计值可能会错误地收敛到似然曲面的局部最大值或鞍点之一,从而导致单倍型频率的最大似然估计出现严重误差。在本笔记中,通过理论分析,我们给出了似然曲面上出现局部最大值或鞍点的充分必要条件。根据经验法则,已知相个体中耦合单倍型频率与排斥单倍型频率之差比相位模糊个体的频率大3/2倍,是似然曲面为单峰的充分条件。此外,我们给出了双等位基因双位点问题的解析解,并构建了一种获得全局最大值的通用算法。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验