Suppr超能文献

参数连锁分析中p值的计算

On computation of p-values in parametric linkage analysis.

作者信息

Kurbasic Azra, Hössjer Ola

机构信息

Division of Mathematical Statistics, Centre for Mathematical Sciences, Lund University, Sweden.

出版信息

Hum Hered. 2004;57(4):207-19. doi: 10.1159/000081448.

Abstract

Parametric linkage analysis is usually used to find chromosomal regions linked to a disease (phenotype) that is described with a specific genetic model. This is done by investigating the relations between the disease and genetic markers, that is, well-characterized loci of known position with a clear Mendelian mode of inheritance. Assume we have found an interesting region on a chromosome that we suspect is linked to the disease. Then we want to test the hypothesis of no linkage versus the alternative one of linkage. As a measure we use the maximal lod score Z(max). It is well known that the maximal lod score has asymptotically a (2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1)) distribution under the null hypothesis of no linkage when only one point (one marker) on the chromosome is studied. In this paper, we show, both by simulations and theoretical arguments, that the null hypothesis distribution of Zmax has no simple form when more than one marker is used (multipoint analysis). In fact, the distribution of Zmax depends on the number of families, their structure, the assumed genetic model, marker denseness, and marker informativity. This means that a constant critical limit of Zmax leads to tests associated with different significance levels. Because of the above-mentioned problems, from the statistical point of view the maximal lod score should be supplemented by a p-value when results are reported.

摘要

参数连锁分析通常用于寻找与某种疾病(表型)相关的染色体区域,该疾病由特定的遗传模型描述。这是通过研究疾病与遗传标记之间的关系来实现的,即已知位置且具有明确孟德尔遗传模式的特征明确的基因座。假设我们在一条染色体上发现了一个有趣的区域,怀疑它与该疾病有关。然后我们要检验无连锁假设与连锁备择假设。作为一种衡量方法,我们使用最大对数优势分数Z(max)。众所周知,在无连锁的零假设下,当只研究染色体上的一个点(一个标记)时,最大对数优势分数渐近地具有(2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1))分布。在本文中,我们通过模拟和理论论证表明,当使用多个标记(多点分析)时,Zmax的零假设分布没有简单的形式。事实上,Zmax的分布取决于家系数量、家系结构、假定的遗传模型、标记密度和标记信息性。这意味着Zmax的恒定临界值会导致与不同显著性水平相关的检验。由于上述问题,从统计学角度来看,在报告结果时,最大对数优势分数应该辅以p值。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验