Institute of Family Medicine and Public Health, University of Tartu, Ravila 19, 50411, Tartu, Estonia.
Johan Skytte Institute of Political Studies, University of Tartu, Tartu, Estonia.
Sci Rep. 2024 Oct 19;14(1):24589. doi: 10.1038/s41598-024-75697-3.
Transitioning to an individualized risk-based approach can significantly enhance cervical cancer screening programs. We aimed to derive and internally validate a prediction model for assessing the risk of cervical intraepithelial neoplasia grade 3 or higher (CIN3+) and cancer in women eligible for screening. This retrospective study utilized data from the Estonian electronic health records, including 517,884 women from the health insurance database and linked health registries. We employed Cox proportional hazard regression, incorporating reproductive and medical history variables (14 covariates), and utilized the least absolute shrinkage and selection operator (LASSO) for variable selection. A 10-fold cross-validation for internal validation of the model was used. The main outcomes were the performance of discrimination and calibration. Over the 8-year follow-up, we identified 1326 women with cervical cancer and 5929 with CIN3+, with absolute risks of 0.3% and 1.1%, respectively. The prediction model for CIN3 + and cervical cancer had good discriminative power and was well calibrated Harrell's C of 0.74 (0.73-0.74) (calibration slope 1.00 (0.97-1.02) and 0.67 (0.66-0.69) (calibration slope 0.92 (0.84-1.00) respectively. A developed model based on nationwide electronic health data showed potential utility for risk stratification to supplement screening efforts. This work was supported through grants number PRG2218 from the Estonian Research Council, and EMP416 from the EEA (European Economic Area) and Norway Grants.
向基于个体风险的方法转变可以显著增强宫颈癌筛查计划。我们旨在为有筛查资格的女性建立并内部验证一个用于评估宫颈上皮内瘤变 3 级或更高(CIN3+)及癌症风险的预测模型。这项回顾性研究利用了爱沙尼亚电子健康记录中的数据,包括来自健康保险数据库和健康登记的 517884 名女性。我们采用了 Cox 比例风险回归,纳入了生殖和医学史变量(14 个协变量),并使用最小绝对收缩和选择算子(LASSO)进行变量选择。我们使用 10 倍交叉验证对内验证模型。主要结果是判别和校准的性能。在 8 年的随访期间,我们发现了 1326 名宫颈癌患者和 5929 名 CIN3+患者,绝对风险分别为 0.3%和 1.1%。CIN3+和宫颈癌的预测模型具有良好的判别能力,校准良好,Harrell 的 C 值为 0.74(0.73-0.74)(校准斜率为 1.00(0.97-1.02)和 0.67(0.66-0.69)(校准斜率为 0.92(0.84-1.00))。基于全国性电子健康数据建立的模型显示出用于风险分层以补充筛查工作的潜在效用。这项工作得到了爱沙尼亚研究理事会的 PRG2218 号拨款和 EEA(欧洲经济区)和挪威赠款的 EMP416 号拨款的支持。