Department of Medical Records & Statistics, Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu 221000, China.
Biomed Res Int. 2022 May 13;2022:2199317. doi: 10.1155/2022/2199317. eCollection 2022.
Short-term or long-term connections between different diseases have not been fully acknowledged. This study was aimed at exploring the network association pattern between disorders that occurred in the same individual by using the association rule mining technique.
Raw data were extracted from the large-scale electronic medical record database of the affiliated hospital of Xuzhou Medical University. 1551732 pieces of diagnosis information from 144207 patients were collected from 2015 to 2020. Clinic diagnoses were categorized according to "International Classification of Diseases, 10th revision". The Apriori algorithm was used to explore the association patterns among those diagnoses.
12889 rules were generated after running the algorithm at first. After threshold filtering and manual examination, 110 disease combinations (support ≥ 0.001, confidence ≥ 60%, lift > 1) with strong association strength were obtained eventually. Association rules about the circulatory system and metabolic diseases accounted for a significant part of the results.
This research elucidated the network associations between disorders from different body systems in the same individual and demonstrated the usefulness of the Apriori algorithm in comorbidity or multimorbidity studies. The mined combinations will be helpful in improving prevention strategies, early identification of high-risk populations, and reducing mortality.
不同疾病之间的短期或长期联系尚未得到充分认识。本研究旨在通过关联规则挖掘技术探索同一患者中发生的疾病之间的网络关联模式。
从徐州医科大学附属医院的大型电子病历数据库中提取原始数据。收集了 2015 年至 2020 年 144207 名患者的 1551732 份诊断信息。临床诊断根据“国际疾病分类,第 10 版”进行分类。使用 Apriori 算法来探索这些诊断之间的关联模式。
算法运行后首先生成了 12889 条规则。经过阈值过滤和人工检查,最终获得了 110 种具有强关联强度的疾病组合(支持度≥0.001,置信度≥60%,提升度>1)。与循环系统和代谢性疾病相关的关联规则占了很大一部分。
本研究阐明了同一患者不同身体系统疾病之间的网络关联,并证明了 Apriori 算法在共病或多病研究中的有效性。挖掘出的组合将有助于改善预防策略、早期识别高危人群和降低死亡率。