Du Jianchao, Ding Junyao, Wu Yuan, Chen Tianyan, Lian Jianqi, Shi Lei, Zhou Yun
School of Telecommunications Engineering, Xidian University, Xi'an, China.
Duke University Health System, Durham, NC, United States.
JMIR Form Res. 2024 Dec 9;8:e58423. doi: 10.2196/58423.
Fever of unknown origin (FUO) is a significant challenge for the medical community due to its association with a wide range of diseases, the complexity of diagnosis, and the likelihood of misdiagnosis. Machine learning can extract valuable information from the extensive data of patient indicators, aiding doctors in diagnosing the underlying cause of FUO.
The study aims to design a multipath hierarchical classification algorithm to diagnose FUO due to the hierarchical structure of the etiology of FUO. In addition, to improve the diagnostic performance of the model, a mechanism for feature selection is added to the model.
The case data of patients with FUO admitted to the First Affiliated Hospital of Xi'an Jiaotong University between 2011 and 2020 in China were used as the dataset for model training and validation. The hierarchical structure tree was then characterized according to etiology. The structure included 3 layers, with the top layer representing the FUO, the middle layer dividing the FUO into 5 categories of etiology (bacterial infection, viral infection, other infection, autoimmune diseases, and other noninfection), and the last layer further refining them to 16 etiologies. Finally, ablation experiments were set to determine the optimal structure of the proposed method, and comparison experiments were to verify the diagnostic performance.
According to ablation experiments, the model achieved the best performance with an accuracy of 76.08% when the number of middle paths was 3%, and 25% of the features were selected. According to comparison experiments, the proposed model outperformed the comparison methods, both from the perspective of feature selection methods and hierarchical classification methods. Specifically, brucellosis had an accuracy of 100%, and liver abscess, viral infection, and lymphoma all had an accuracy of more than 80%.
In this study, a novel multipath feature selection and hierarchical classification model was designed for the diagnosis of FUO and was adequately evaluated quantitatively. Despite some limitations, this model enriches the exploration of FUO in machine learning and assists physicians in their work.
不明原因发热(FUO)因其与多种疾病相关、诊断复杂且存在误诊可能性,对医学界构成重大挑战。机器学习可以从患者指标的大量数据中提取有价值的信息,帮助医生诊断FUO的潜在病因。
由于FUO病因的层次结构,本研究旨在设计一种多路径层次分类算法来诊断FUO。此外,为提高模型的诊断性能,在模型中添加了特征选择机制。
将2011年至2020年在中国西安交通大学第一附属医院收治的FUO患者的病例数据用作模型训练和验证的数据集。然后根据病因对层次结构树进行特征化。该结构包括3层,顶层代表FUO,中间层将FUO分为5类病因(细菌感染、病毒感染、其他感染、自身免疫性疾病和其他非感染性疾病),最后一层将它们进一步细化为16种病因。最后,设置消融实验以确定所提方法的最优结构,并进行对比实验以验证诊断性能。
根据消融实验,当中路径数量为3%且选择25%的特征时,模型取得了最佳性能,准确率为76.08%。根据对比实验,从特征选择方法和层次分类方法的角度来看,所提模型均优于对比方法。具体而言,布鲁氏菌病的准确率为100%,肝脓肿、病毒感染和淋巴瘤的准确率均超过80%。
在本研究中,设计了一种用于诊断FUO的新型多路径特征选择和层次分类模型,并对其进行了充分的定量评估。尽管存在一些局限性,但该模型丰富了机器学习中对FUO的探索,并有助于医生开展工作。