Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, 650091, China.
Genetics. 2013 Aug;194(4):927-36. doi: 10.1534/genetics.113.151571. Epub 2013 May 1.
Most studies of mutation rates implicitly assume that they remain constant throughout development of the germline. However, researchers recently used a novel statistical framework to reveal that mutation rates differ dramatically during sperm development in Drosophila melanogaster. Here a general framework is described for the inference of germline mutation patterns, generated from either mutation screening experiments or DNA sequence polymorphism data, that enables analysis of more than two mutations per family. The inference is made more rigorous and flexible by providing a better approximation of the probabilities of patterns of mutations and an improved coalescent algorithm within a single host with realistic assumptions. The properties of the inference framework, both the estimation and the hypothesis testing, were investigated by simulation. The refined inference framework is shown to provide (1) nearly unbiased maximum-likelihood estimates of mutation rates and (2) robust hypothesis testing using the standard asymptotic distribution of the likelihood-ratio tests. It is readily applicable to data sets in which multiple mutations in the same family are common.
大多数突变率的研究都隐含地假设它们在生殖细胞系的发育过程中保持不变。然而,研究人员最近使用一种新颖的统计框架揭示了果蝇精子发育过程中突变率的显著差异。本文描述了一个一般的框架,用于推断生殖系突变模式,该框架可以从突变筛选实验或 DNA 序列多态性数据中生成,能够分析每个家族中的多个突变。通过提供对突变模式概率的更好近似和改进的同系发生算法,以及在具有现实假设的单个宿主中进行的改进,使推断更加严格和灵活。通过模拟研究了推断框架的性质,包括估计和假设检验。结果表明,经过改进的推断框架能够(1)提供突变率的几乎无偏最大似然估计,(2)使用似然比检验的标准渐近分布进行稳健的假设检验。它可以很容易地应用于在同一家族中存在多个突变的数据集。