Division of Clinical and Translational Sciences, Department of Internal Medicine, The University of Texas McGovern Medical School at Houston, Houston, Texas, USA.
Department of Biostatistics and Data Science, The University of Texas School of Public Health, Dallas, Texas, USA.
Stat Med. 2023 Apr 30;42(9):1398-1411. doi: 10.1002/sim.9677. Epub 2023 Feb 2.
Incorporating promising biomarkers into cancer screening practices for early-detection is increasingly appealing because of the unsatisfactory performance of current cancer screening strategies. The matched case-control design is commonly adopted in biomarker development studies to evaluate the discriminative power of biomarker candidates, with an intention to eliminate confounding effects. Data from matched case-control studies have been routinely analyzed by the conditional logistic regression, although the assumed logit link between biomarker combinations and disease risk may not always hold. We propose a conditional concordance-assisted learning method, which is distribution-free, for identifying an optimal combination of biomarkers to discriminate cases and controls. We are particularly interested in combinations with a clinically and practically meaningful specificity to prevent disease-free subjects from unnecessary and possibly intrusive diagnostic procedures, which is a top priority for cancer population screening. We establish asymptotic properties for the derived combination and confirm its favorable finite sample performance in simulations. We apply the proposed method to the prostate cancer data from the carotene and retinol efficacy trial (CARET).
将有前途的生物标志物纳入癌症筛查实践以进行早期检测越来越受到关注,因为当前癌症筛查策略的性能并不令人满意。匹配病例对照设计通常用于生物标志物开发研究中,以评估生物标志物候选物的区分能力,旨在消除混杂效应。虽然生物标志物组合与疾病风险之间的假定对数联系并不总是成立,但匹配病例对照研究的数据通常通过条件逻辑回归进行分析。我们提出了一种条件一致性辅助学习方法,该方法是无分布的,用于确定最佳的生物标志物组合来区分病例和对照。我们特别关注具有临床和实际意义的特异性的组合,以防止无疾病的个体接受不必要的、可能具有侵入性的诊断程序,这是癌症人群筛查的首要任务。我们为推导出的组合建立了渐近性质,并在模拟中确认了其在有限样本中的良好性能。我们将所提出的方法应用于来自胡萝卜素和视黄醇疗效试验 (CARET) 的前列腺癌数据。