Suppr超能文献

一种用于在基层医疗中识别转移性结直肠癌的机器学习工具。

A machine learning tool for identifying metastatic colorectal cancer in primary care.

作者信息

Abedi Eliya, Ewing Marcela, Nemlander Elinor, Hasselström Jan, Sjövall Annika, Carlsson Axel C, Rosenblad Andreas

机构信息

Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Solna, Sweden.

Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden.

出版信息

Scand J Prim Health Care. 2025 Mar 13:1-9. doi: 10.1080/02813432.2025.2477155.

Abstract

BACKGROUND

Detection of colorectal cancer (CRC) is mainly achieved by clinical assessment. As new treatments become available for metastatic CRC (MCRC), it is important to accurately identify these patients.

AIM

To develop a predictive model for identifying MCRC in primary health care patients using diagnostic data analysed with machine learning.

DESIGN AND SETTING

A case-control study utilising data on primary health care visits for 146 patients >18 years old diagnosed with MCRC in the Västra Götaland Region, Sweden during 2011, and 577 sex-, age, and primary health care centre-matched controls.

METHOD

Stochastic gradient boosting was used to construct a model for predicting the presence of MCRC based on diagnostic codes from primary health care consultations during the year before index (diagnosis) date and number of consultations. Variable importance was estimated using the normalised relative influence (NRI) score. Risks of having MCRC were calculated using odds ratios of marginal effects (OR).

RESULTS

The optimal model included 76 variables with non-zero influence, had an area under the curve of 76.5%, a sensitivity of 77.8%, and a specificity of 69.2%. The 10 most important variables had a combined NRI of 61.0%. Number of consultations during the year before index date had the highest NRI at 19.2%, with an OR of 3.3.

CONCLUSION

A machine learning method based on primary health care consultation frequency and diagnoses may be used to identify important variables for predicting presence of MCRC. Both primary health care consultations and associated diagnostic codes need to be taken into consideration.

摘要

背景

结直肠癌(CRC)的检测主要通过临床评估来实现。随着转移性结直肠癌(MCRC)的新治疗方法不断涌现,准确识别这些患者至关重要。

目的

利用机器学习分析的诊断数据,开发一种用于在初级卫生保健患者中识别MCRC的预测模型。

设计与背景

一项病例对照研究,利用了2011年在瑞典韦斯特罗斯-哥德兰地区诊断为MCRC的146名18岁以上初级卫生保健就诊患者的数据,以及577名性别、年龄和初级卫生保健中心匹配的对照。

方法

使用随机梯度提升法,基于索引(诊断)日期前一年初级卫生保健会诊的诊断代码和会诊次数构建预测MCRC存在的模型。使用归一化相对影响(NRI)分数估计变量重要性。使用边际效应比值比(OR)计算患MCRC的风险。

结果

最佳模型包括76个具有非零影响的变量,曲线下面积为76.5%,灵敏度为77.8%,特异性为69.2%。10个最重要的变量的综合NRI为61.0%。索引日期前一年的会诊次数NRI最高,为19.2%,OR为3.3。

结论

基于初级卫生保健会诊频率和诊断的机器学习方法可用于识别预测MCRC存在的重要变量。初级卫生保健会诊和相关诊断代码都需要考虑在内。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验