Laffafchi Soroor, Ebrahimi Ahmad, Kafan Samira
Department of Business Administration and Entrepreneurship, Faculty of Management and Economics, Science and Research Branch, Islamic Azad University, Daneshgah Blvd, Simon Bulivar Blvd, Tehran, Iran.
Department of Industrial and Technology Management, Faculty of Management and Economics, Science and Research Branch, Islamic Azad University, Daneshgah Blvd, Simon Bulivar Blvd, Tehran, Iran.
Health Inf Sci Syst. 2024 Mar 6;12(1):17. doi: 10.1007/s13755-024-00276-9. eCollection 2024 Dec.
Pulmonary Embolism (PE) is a life-threatening clinical disease with no specific clinical symptoms and Computed Tomography Angiography (CTA) is used for diagnosis. Clinical decision support scoring systems like Wells and rGeneva based on PE risk factors have been developed to estimate the pre-test probability but are underused, leading to continuous overuse of CTA imaging. This diagnostic study aimed to propose a novel approach for efficient management of PE diagnosis using a two-step interconnected machine learning framework directly by analyzing patients' Electronic Health Records data. First, we performed feature importance analysis according to the result of LightGBM superiority for PE prediction, then four state-of-the-art machine learning methods were applied for PE prediction based on the feature importance results, enabling swift and accurate pre-test diagnosis. Throughout the study patients' data from different departments were collected from Sina educational hospital, affiliated with the Tehran University of medical sciences in Iran. Generally, the Ridge classification method obtained the best performance with an F1 score of 0.96. Extensive experimental findings showed the effectiveness and simplicity of this diagnostic process of PE in comparison with the existing scoring systems. The main strength of this approach centered on PE disease management procedures, which would reduce avoidable invasive CTA imaging and be applied as a primary prognosis of PE, hence assisting the healthcare system, clinicians, and patients by reducing costs and promoting treatment quality and patient satisfaction.
肺栓塞(PE)是一种危及生命的临床疾病,没有特定的临床症状,计算机断层扫描血管造影(CTA)用于诊断。基于PE危险因素的Wells和rGeneva等临床决策支持评分系统已被开发出来以估计检测前概率,但未得到充分利用,导致CTA成像持续过度使用。这项诊断研究旨在通过直接分析患者的电子健康记录数据,提出一种使用两步互联机器学习框架有效管理PE诊断的新方法。首先,我们根据LightGBM在PE预测方面的优势结果进行特征重要性分析,然后基于特征重要性结果应用四种最先进的机器学习方法进行PE预测,实现快速准确的检测前诊断。在整个研究过程中,从伊朗德黑兰医科大学附属的新浪教育医院收集了不同科室患者的数据。一般来说,岭分类方法获得了最佳性能,F1分数为0.96。大量实验结果表明,与现有的评分系统相比,这种PE诊断过程具有有效性和简便性。这种方法的主要优势集中在PE疾病管理程序上,这将减少不必要的侵入性CTA成像,并作为PE的主要预后指标应用,从而通过降低成本、提高治疗质量和患者满意度来帮助医疗系统、临床医生和患者。