Department of Biostatistics and Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.
Department of Statistics and Actuarial-Financial Mathematics, University of the Aegean, Samos, Greece.
Stat Med. 2021 Mar 30;40(7):1767-1789. doi: 10.1002/sim.8869. Epub 2021 Feb 2.
During the early stage of biomarker discovery, high throughput technologies allow for simultaneous input of thousands of biomarkers that attempt to discriminate between healthy and diseased subjects. In such cases, proper ranking of biomarkers is highly important. Common measures, such as the area under the receiver operating characteristic (ROC) curve (AUC), as well as affordable sensitivity and specificity levels, are often taken into consideration. Strictly speaking, such measures are appropriate under a stochastic ordering assumption, which implies, without loss of generality, that higher measurements are more indicative for the disease. Such an assumption is not always plausible and may lead to rejection of extremely useful biomarkers at this early discovery stage. We explore the length of a smooth ROC curve as a measure for biomarker ranking, which is not subject to directionality. We show that the length corresponds to a divergence, is identical to the corresponding length of the optimal (likelihood ratio) ROC curve, and is an appropriate measure for ranking biomarkers. We explore the relationship between the length measure and the AUC of the optimal ROC curve. We then provide a complete framework for the evaluation of a biomarker in terms of sensitivity and specificity through a proposed ROC analogue for use in improper settings. In the absence of any clinical insight regarding the appropriate cutoffs, we estimate the sensitivity and specificity under a two-cutoff extension of the Youden index and we further take into account the implied costs. We apply our approaches on two biomarker studies that relate to pancreatic and esophageal cancer.
在生物标志物发现的早期阶段,高通量技术允许同时输入数千个试图区分健康和患病受试者的生物标志物。在这种情况下,对生物标志物进行适当的排序非常重要。通常会考虑常见的措施,例如接收器操作特性(ROC)曲线下的面积(AUC),以及可承受的灵敏度和特异性水平。严格来说,这些措施在随机排序假设下是合适的,这意味着更高的测量值更能指示疾病。这种假设并不总是合理的,并且可能导致在这个早期发现阶段拒绝非常有用的生物标志物。我们探索了平滑 ROC 曲线的长度作为生物标志物排序的度量,它不受方向性的影响。我们表明,该长度对应于一个散度,与最优(似然比)ROC 曲线的相应长度相同,并且是一种用于对生物标志物进行排序的适当度量。我们探讨了长度度量与最优 ROC 曲线 AUC 之间的关系。然后,我们通过提出的用于不当设置的 ROC 模拟,提供了一种根据灵敏度和特异性评估生物标志物的完整框架。在没有任何关于适当截止值的临床洞察力的情况下,我们根据 Youden 指数的两个截止值扩展来估计灵敏度和特异性,并进一步考虑隐含的成本。我们将我们的方法应用于两个与胰腺癌和食管癌相关的生物标志物研究。