Department of Orthopaedics and Traumatology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong.
Cancer Med. 2024 Nov;13(22):e70383. doi: 10.1002/cam4.70383.
The aim of this study was to find the most appropriate variables to input into machine learning algorithms to identify those patients with primary lung malignancy with high risk for metastasis to the bone.
Patients with either histological or radiological diagnoses of lung cancer were included in this study.
The patient cohort comprised 1864 patients diagnosed from 2016 to 2021. A total of 25 variables were considered as potential risk factors. These variables have been identified in previous studies as independent risk factors for bone metastasis. Treatment methods for lung cancer were taken into account during model development. The outcome variable was binary, (presence or absence of bone metastasis) with follow-up until death or 12-month survival, whichever is the sooner. Results showed that American Joint Committee on Cancer staging, the use of EGFR inhibitor, age, T-staging, and lymphovascular invasion were the five input features contributing the most to the model algorithm. High AJCC staging (OR 1.98; p < 0.05), the use of EGFR inhibitor (OR 6.14; p < 0.05), high T-staging (OR 1.47; p < 0.05), and the presence of lymphovascular invasion (OR 4.92; p < 0.05) increase predicted risk of bone metastasis. Conversely, older age reduces predicted bone metastasis risk (OR 0.98; p < 0.05).
The machine learning model developed in this study can be easily incorporated into the hospital's Clinical Management System so that input variables can be immediately utilized to give an accurate prediction of bone metastatic risk, therefore informing clinicians on the best treatment strategy for that individual patient.
本研究旨在找到最适合输入机器学习算法的变量,以识别那些患有原发性肺部恶性肿瘤且有高骨转移风险的患者。
本研究纳入了经组织学或影像学诊断为肺癌的患者。
患者队列包括了 2016 年至 2021 年间诊断的 1864 名患者。共考虑了 25 个变量作为潜在的危险因素。这些变量在以前的研究中被确定为骨转移的独立危险因素。在模型开发过程中考虑了肺癌的治疗方法。结局变量为二分类(存在或不存在骨转移),随访至死亡或 12 个月生存,以先发生者为准。结果表明,美国癌症联合委员会分期、EGFR 抑制剂的使用、年龄、T 分期和脉管侵犯是对模型算法贡献最大的五个输入特征。高 AJCC 分期(OR 1.98;p<0.05)、EGFR 抑制剂的使用(OR 6.14;p<0.05)、高 T 分期(OR 1.47;p<0.05)和脉管侵犯的存在(OR 4.92;p<0.05)增加了骨转移的预测风险。相反,年龄较大降低了骨转移的预测风险(OR 0.98;p<0.05)。
本研究开发的机器学习模型可以很容易地纳入医院的临床管理系统,以便立即利用输入变量对骨转移风险进行准确预测,从而为临床医生提供针对该患者的最佳治疗策略。