Fine Jason P, Zou Fei, Yandell Brian S
Department of Statistics, University of Wisconsin, Madison, WI 53706, USA.
Biostatistics. 2004 Oct;5(4):501-13. doi: 10.1093/biostatistics/kxh004.
Interval mapping of quantitative trait loci from breeding experiments plays an important role in understanding the mechanisms of disease, both in humans and other organisms. Standard approaches to estimation involve parametric assumptions for the component distributions and may be sensitive to model misspecification. Some nonparametric tests have been studied. However, nonparametric estimation of the phenotypic distributions has not been considered in the genetics literature, even though such methods might provide essential nonparametric summaries for comparing different loci. We develop a sufficient condition for identifiability of the phenotypic distributions. Simple nonparametric estimators for the distributions are proposed for uncensored and right censored data. They have a closed form and their small and large sample properties are readily established. Their practical utility as numerical summaries which complement nonparametric tests is demonstrated on two recent genetics examples.
育种实验中数量性状基因座的区间定位在理解人类和其他生物体疾病机制方面发挥着重要作用。标准的估计方法涉及对成分分布的参数假设,并且可能对模型误设敏感。已经研究了一些非参数检验。然而,即使此类方法可能为比较不同基因座提供重要的非参数总结,但遗传学文献中尚未考虑对表型分布进行非参数估计。我们为表型分布的可识别性建立了一个充分条件。针对未删失和右删失数据,提出了分布的简单非参数估计量。它们具有封闭形式,并且其小样本和大样本性质很容易确定。在最近的两个遗传学实例中展示了它们作为补充非参数检验的数值总结的实际效用。