Morris J K, Wald N J
Wolfson Institute of Preventive Medicine, Barts and the London Queen Mary's School of Medicine and Dentistry, London EC1M 6BQ, UK.
J Med Screen. 2005;12(3):155-60. doi: 10.1258/0969141054855283.
The screening performance of tests involving multiple markers is usually presented visually as two Gaussian relative frequency distributions of risk, one curve relating to affected and the other to unaffected individuals. If the distribution of the underlying screening markers is approximately Gaussian, risk estimates based on the same markers will usually also be approximately Gaussian. However, this approximation sometimes fails. Here we examine the circumstances when this occurs.
A theoretical statistical analysis.
Hypothetical log Gaussian relative distributions of affected and unaffected individuals were generated for three antenatal screening markers for Down's syndrome. Log likelihood ratios were calculated for each marker value and plots of the relative frequency distributions were compared with plots of Gaussian distributions based on the means and standard deviations of these log likelihood ratios.
When the standard deviations of the distributions of a perfectly Gaussian screening marker are similar in affected and unaffected individuals, the distributions of risk estimates are also approximately Gaussian. If the standard deviations differ materially, incorrectly assuming that the distributions of the risk estimates are Gaussian creates a graphical anomaly in which the distributions of risk in affected and unaffected individuals plotted on a continuous risk scale intersect in two places. This is theoretically impossible. Plotting the risk distributions empirically reveals that all individuals have an estimated risk above a specified value. For individuals with more extreme marker values, the risk estimates reverse and increase instead of continuing to decrease.
It is useful to check whether a Gaussian approximation for the distribution of risk estimates based on a screening marker is valid. If the value of the marker level at which risk reversal occurs lies within the set truncation limits, these may need to be reset, and a Gaussian model may be inappropriate to illustrate the risk distributions.
涉及多个标志物的检测的筛查性能通常以风险的两个高斯相对频率分布直观呈现,一条曲线与患病个体相关,另一条与未患病个体相关。如果基础筛查标志物的分布近似高斯分布,基于相同标志物的风险估计通常也会近似高斯分布。然而,这种近似有时会失败。在此我们研究这种情况发生的情形。
理论统计分析。
针对唐氏综合征的三种产前筛查标志物生成患病和未患病个体的假设对数高斯相对分布。计算每个标志物值的对数似然比,并将相对频率分布的图与基于这些对数似然比的均值和标准差的高斯分布图进行比较。
当完全高斯分布的筛查标志物在患病和未患病个体中的分布标准差相似时,风险估计的分布也近似高斯分布。如果标准差有显著差异,错误地假设风险估计的分布是高斯分布会产生一种图形异常,即在连续风险尺度上绘制的患病和未患病个体的风险分布在两个地方相交。这在理论上是不可能的。根据经验绘制风险分布表明,所有个体的估计风险都高于指定值。对于标志物值更极端的个体,风险估计会反转并增加,而不是继续降低。
检查基于筛查标志物的风险估计分布的高斯近似是否有效是有用的。如果发生风险反转的标志物水平值在设定的截断范围内,可能需要重新设置这些范围,并且高斯模型可能不适用于说明风险分布。