Department of Colorectal Disease, Victoria Hospital Kirkcaldy, Kirkcaldy, UK.
School of Medicine, University of St Andrews, St Andrews, UK.
Scott Med J. 2024 Aug;69(3):61-71. doi: 10.1177/00369330241266080. Epub 2024 Jul 23.
Pressured healthcare resources make risk stratification and patient prioritisation fundamental issues for the investigation of colorectal cancer (CRC) in symptomatic patients. The present study uses machine learning algorithms and decision strategies to improve the appropriate use of colonoscopy.
All symptomatic patients in a single health board (2018-2021) proceeding to colonoscopy to investigate for CRC were included. Machine learning algorithms (NeuralNetwork, randomForest, Logistic regression, Naïve-Bayes and Adaboost) were used to risk-stratify patients for CRC using demographics, symptoms, quantitative faecal immunochemical test (qFIT) and haematological tests. Decision curve analyses were performed to determine the optimal decision strategies.
3776 patients were included (median age, 65; M:F,0.9:1.0) and CRC was identified in 217 patients (5.7%). qFIT > 400 μg Hb/g was the most important variable (%IncMSE = 78.5). RandomForrest had the highest area under curve (0.91) and accuracy (0.80) for CRC. When utilising decision curve analysis (DCA), 30%, 46% and 54% of colonoscopies were saved at accepted CRC probabilities of 1%, 2% and 3%, respectively. RandomForrest modelling had superior net clinical benefit compared to default colonoscopy strategies.
MLA-derived decision strategies that account for patient and referrer risk preference reduce colonoscopy demand and carry net clinical benefit compared to default colonoscopy strategies.
在有症状的结直肠癌(CRC)患者中,医疗资源紧张使得风险分层和患者优先排序成为调查的基本问题。本研究使用机器学习算法和决策策略来改善结肠镜检查的合理应用。
纳入单一健康委员会(2018-2021 年)中所有因 CRC 而接受结肠镜检查的有症状患者。使用机器学习算法(神经网络、随机森林、逻辑回归、朴素贝叶斯和自适应增强)根据人口统计学、症状、定量粪便免疫化学测试(qFIT)和血液学测试对患者进行 CRC 风险分层。进行决策曲线分析以确定最佳决策策略。
共纳入 3776 例患者(中位年龄 65 岁;男女比例为 0.9:1.0),217 例(5.7%)患者发现 CRC。qFIT>400 μg Hb/g 是最重要的变量(%IncMSE=78.5)。随机森林对 CRC 的曲线下面积最高(0.91),准确率最高(0.80)。利用决策曲线分析(DCA),在接受的 CRC 概率为 1%、2%和 3%时,分别有 30%、46%和 54%的结肠镜检查可被节省。随机森林模型比默认的结肠镜检查策略具有更高的净临床获益。
考虑患者和转诊医生风险偏好的 MLA 衍生决策策略可降低结肠镜检查的需求,并与默认的结肠镜检查策略相比具有净临床获益。