Liang Hengrui, Yang Tao, Liu Zihao, Jian Wenhua, Chen Yilong, Li Bingliang, Yan Zeping, Xu Weiqiang, Chen Luming, Qi Yifan, Wang Zhiwei, Liao Yajing, Lin Peixuan, Li Jiameng, Wang Wei, Li Li, Wang Meijia, Zhang YunHui, Deng Lizong, Jiang Taijiao, He Jianxing
Department of Thoracic Surgery China State Key Laboratory of Respiratory Disease & National Clinical Research Center for Respiratory Disease the Key laboratory of Advanced Interdisciplinary Studies Center the First Affiliated Hospital of Guangzhou Medical University Guangzhou China.
Guangzhou National Laboratory Guangzhou China.
MedComm (2020). 2025 Jan 12;6(1):e70043. doi: 10.1002/mco2.70043. eCollection 2025 Jan.
Respiratory diseases pose a significant global health burden, with challenges in early and accurate diagnosis due to overlapping clinical symptoms, which often leads to misdiagnosis or delayed treatment. To address this issue, we developed , an artificial intelligence (AI)-based diagnostic system that utilizes natural language processing (NLP) to extract key clinical features from electronic health records (EHRs) for the accurate classification of respiratory diseases. This study employed a large cohort of 31,267 EHRs from multiple centers for model training and internal testing. Additionally, prospective real-world validation was conducted using 1142 EHRs from three external centers. demonstrated superior diagnostic performance, achieving an F1 score of 0.711 for top 1 diagnosis and 0.927 for top 3 diagnoses. In real-world testing, outperformed both human experts and ChatGPT 4.0, achieving an F1 score of 0.651 for top 1 diagnosis. The study emphasizes the potential of as an effective tool to support physicians in diagnosing respiratory diseases more accurately and efficiently. Despite the promising results, further large-scale multicenter validation with larger sample sizes is still needed to confirm its clinical utility and generalizability.
呼吸系统疾病给全球健康带来了重大负担,由于临床症状重叠,早期准确诊断面临挑战,这常常导致误诊或治疗延误。为了解决这个问题,我们开发了一种基于人工智能(AI)的诊断系统,该系统利用自然语言处理(NLP)从电子健康记录(EHR)中提取关键临床特征,以准确分类呼吸系统疾病。本研究使用了来自多个中心的31267份电子健康记录的大型队列进行模型训练和内部测试。此外,还使用来自三个外部中心的1142份电子健康记录进行了前瞻性真实世界验证。该系统表现出卓越的诊断性能,在 top1诊断中F1分数达到0.711,在top3诊断中达到0.927。在真实世界测试中,该系统的表现优于人类专家和ChatGPT 4.0,在top1诊断中F1分数达到0.651。该研究强调了该系统作为一种有效工具的潜力,可支持医生更准确、高效地诊断呼吸系统疾病。尽管取得了令人鼓舞的结果,但仍需要进一步进行更大样本量的大规模多中心验证,以确认其临床实用性和可推广性。