• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发适用于使用电子病历初始化学习健康系统单元的短暂性脑缺血发作风险预测模型。

Development of transient ischemic attack risk prediction model suitable for initializing a learning health system unit using electronic medical records.

作者信息

Wen Jian, Zhang Tianmei, Ye Shangrong, Li Cheng, Han Ruobing, Huang Ran, Shen Bairong, Chen Anjun, Li Qinghua

机构信息

Department of Neurology, Guilin Medical University Affiliated Hospital, 15 Lequn Road, Guilin, Guangxi, 541000, China.

West China Hospital, 2222 Xingchuan Road, Chengdu, Sichuan, 610212, China.

出版信息

BMC Med Inform Decis Mak. 2024 Dec 18;24(1):392. doi: 10.1186/s12911-024-02767-x.

DOI:10.1186/s12911-024-02767-x
PMID:39696228
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11657208/
Abstract

BACKGROUND

Patients with transient ischemic attack (TIA) face a significantly increased risk of stroke. However, TIA screening and early detection rates are low, especially in developing countries. This study aims to develop an inclusive and practical TIA risk prediction model using machine learning (ML) that performs well in both hospital and resource-limited clinic settings. This model is essential for initiating the first ML-enabled learning health system (LHS) unit designed for routine and equitable TIA screening and early detection across broad populations.

METHODS

Employing a novel protocol, this study first standardized data from a hospital's electronic medical records (EMR) to construct inclusive TIA risk prediction ML models using a data-centric approach. Subsequently, a quantitative distribution of TIA risk factors was applied in feature engineering to reduce the number of variables for a practical ML model. This refined model initiated a TIA ML-LHS unit that is capable of continuously updating with new EMR data from hospitals and clinics. Additionally, the practical model underwent external validation using data from another hospital.

RESULTS

The inclusive 150-variable ML models, derived from all available EMR variables for TIA, achieved a recall of 0.868 and an accuracy of 0.886 in predicting TIA risk. Further feature engineering produced a practical XGBoost model with 20 variables, maintaining acceptable performance of 0.855 recall and 0.796 accuracy. The initialized TIA ML-LHS unit, based on the practical model, achieved performance metrics of 0.830 recall, 0.726 precision, 0.816 ROC-AUC, and 0.812 accuracy. The model also performed well in external validation, confirming its effectiveness with patient data from different clinical settings.

CONCLUSIONS

This study developed the first inclusive and practical TIA XGBoost model from full hospital EHR and initiated the first TIA risk prediction ML-LHS unit. This TIA model, which requires only 20 variables, enables the ML-LHS to serve not only patients in hospitals but also those in resource-limited clinics. These results have significant implications for expanding risk-based TIA screening in community and rural clinics, thereby enhancing early detection of TIA among underserved populations and improving health equity. The novel protocol used in this study is also applicable for initiating ML-LHS units for various preventable diseases, providing a new system-level approach to responsible AI development and applications.

摘要

背景

短暂性脑缺血发作(TIA)患者面临的中风风险显著增加。然而,TIA筛查和早期检测率较低,尤其是在发展中国家。本研究旨在利用机器学习(ML)开发一种包容性强且实用的TIA风险预测模型,该模型在医院和资源有限的诊所环境中均能表现良好。此模型对于启动首个基于ML的学习健康系统(LHS)单元至关重要,该单元旨在对广泛人群进行常规且公平的TIA筛查和早期检测。

方法

本研究采用一种新颖的方案,首先对医院电子病历(EMR)中的数据进行标准化处理,以数据为中心的方法构建包容性的TIA风险预测ML模型。随后,在特征工程中应用TIA风险因素的定量分布,以减少实用ML模型的变量数量。这个优化后的模型启动了一个TIA ML-LHS单元,该单元能够根据医院和诊所的新EMR数据持续更新。此外,使用另一家医院的数据对实用模型进行了外部验证。

结果

从所有可用的TIA EMR变量中得出的包含150个变量的ML模型,在预测TIA风险时召回率达到0.868,准确率达到0.886。进一步的特征工程产生了一个包含20个变量的实用XGBoost模型,召回率保持在可接受的0.855,准确率为0.796。基于实用模型初始化的TIA ML-LHS单元,召回率为0.830,精确率为0.726,ROC-AUC为0.816,准确率为0.812。该模型在外部验证中也表现良好,证实了其在不同临床环境下患者数据中的有效性。

结论

本研究从完整的医院电子健康记录中开发出首个包容性强且实用的TIA XGBoost模型,并启动了首个TIA风险预测ML-LHS单元。这个仅需20个变量的TIA模型使ML-LHS不仅能为医院中的患者服务,也能为资源有限的诊所中的患者服务。这些结果对于在社区和农村诊所扩大基于风险的TIA筛查具有重要意义,从而提高在服务不足人群中TIA的早期检测率并改善健康公平性。本研究中使用的新颖方案也适用于启动针对各种可预防疾病的ML-LHS单元,为负责任的人工智能开发和应用提供了一种新的系统级方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/187311d7166d/12911_2024_2767_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/cca8811abbf4/12911_2024_2767_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/fc4ebb3d03b0/12911_2024_2767_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/6cca997b81bd/12911_2024_2767_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/187311d7166d/12911_2024_2767_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/cca8811abbf4/12911_2024_2767_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/fc4ebb3d03b0/12911_2024_2767_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/6cca997b81bd/12911_2024_2767_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/928e/11657208/187311d7166d/12911_2024_2767_Fig4_HTML.jpg

相似文献

1
Development of transient ischemic attack risk prediction model suitable for initializing a learning health system unit using electronic medical records.开发适用于使用电子病历初始化学习健康系统单元的短暂性脑缺血发作风险预测模型。
BMC Med Inform Decis Mak. 2024 Dec 18;24(1):392. doi: 10.1186/s12911-024-02767-x.
2
Development of Lung Cancer Risk Prediction Machine Learning Models for Equitable Learning Health System: Retrospective Study.用于公平学习健康系统的肺癌风险预测机器学习模型的开发:回顾性研究
JMIR AI. 2024 Sep 11;3:e56590. doi: 10.2196/56590.
3
Predicting cerebral infarction and transient ischemic attack in healthy individuals and those with dysmetabolism: a machine learning approach combined with routine blood tests.预测健康个体和代谢异常者的脑梗死和短暂性脑缺血发作:一种结合常规血液检测的机器学习方法
Sci Rep. 2025 Apr 16;15(1):13044. doi: 10.1038/s41598-025-94682-y.
4
Machine learning is an effective method to predict the 90-day prognosis of patients with transient ischemic attack and minor stroke.机器学习是预测短暂性脑缺血发作和小卒中患者 90 天预后的有效方法。
BMC Med Res Methodol. 2022 Jul 16;22(1):195. doi: 10.1186/s12874-022-01672-z.
5
Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data.使用合成患者数据模拟机器学习支持的学习健康系统进行风险预测。
Sci Rep. 2022 Oct 26;12(1):17917. doi: 10.1038/s41598-022-23011-4.
6
Quantitative patient graph analysis for transient ischemic attack risk factor distribution based on electronic medical records.基于电子病历的短暂性脑缺血发作危险因素分布的定量患者图形分析
Heliyon. 2023 Nov 25;10(1):e22766. doi: 10.1016/j.heliyon.2023.e22766. eCollection 2024 Jan 15.
7
Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation.机器学习预测纽约市新冠肺炎患者队列中的死亡率和危急事件:模型开发与验证
J Med Internet Res. 2020 Nov 6;22(11):e24018. doi: 10.2196/24018.
8
Prediction of adverse pregnancy outcomes using machine learning techniques: evidence from analysis of electronic medical records data in Rwanda.使用机器学习技术预测不良妊娠结局:来自卢旺达电子病历数据分析的证据。
BMC Med Inform Decis Mak. 2025 Feb 12;25(1):76. doi: 10.1186/s12911-025-02921-z.
9
Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study.基于电子健康记录的社区获得性急性肾损伤住院风险预测的机器学习模型:开发和验证研究。
J Med Internet Res. 2020 Aug 4;22(8):e16903. doi: 10.2196/16903.
10
Identifying transient ischemic attack (TIA) patients at high-risk of adverse outcomes: development and validation of an approach using electronic health record data.识别发生不良结局风险较高的短暂性脑缺血发作(TIA)患者:一种基于电子健康记录数据的方法的建立和验证。
BMC Neurol. 2022 Jul 12;22(1):256. doi: 10.1186/s12883-022-02776-1.

引用本文的文献

1
Predicting cerebral infarction and transient ischemic attack in healthy individuals and those with dysmetabolism: a machine learning approach combined with routine blood tests.预测健康个体和代谢异常者的脑梗死和短暂性脑缺血发作:一种结合常规血液检测的机器学习方法
Sci Rep. 2025 Apr 16;15(1):13044. doi: 10.1038/s41598-025-94682-y.

本文引用的文献

1
Quantitative patient graph analysis for transient ischemic attack risk factor distribution based on electronic medical records.基于电子病历的短暂性脑缺血发作危险因素分布的定量患者图形分析
Heliyon. 2023 Nov 25;10(1):e22766. doi: 10.1016/j.heliyon.2023.e22766. eCollection 2024 Jan 15.
2
Building Practical Risk Prediction Models for Nasopharyngeal Carcinoma Screening with Patient Graph Analysis and Machine Learning.基于患者图谱分析和机器学习构建用于鼻咽癌筛查的实用风险预测模型。
Cancer Epidemiol Biomarkers Prev. 2023 Feb 6;32(2):274-280. doi: 10.1158/1055-9965.EPI-22-0792.
3
A multi-step approach to managing missing data in time and patient variant electronic health records.
一种管理时间和患者变异电子健康记录中缺失数据的多步骤方法。
BMC Res Notes. 2022 Feb 17;15(1):64. doi: 10.1186/s13104-022-05911-w.
4
Heart Disease and Stroke Statistics-2022 Update: A Report From the American Heart Association.《心脏病与卒中统计-2022 更新:美国心脏协会报告》。
Circulation. 2022 Feb 22;145(8):e153-e639. doi: 10.1161/CIR.0000000000001052. Epub 2022 Jan 26.
5
Risk of Subsequent Stroke Among Patients Receiving Outpatient vs Inpatient Care for Transient Ischemic Attack: A Systematic Review and Meta-analysis.门诊与住院治疗短暂性脑缺血发作患者的后续卒中风险:系统评价和荟萃分析。
JAMA Netw Open. 2022 Jan 4;5(1):e2136644. doi: 10.1001/jamanetworkopen.2021.36644.
6
"Lessons Learned" Preventing Recurrent Ischemic Strokes through Secondary Prevention Programs: A Systematic Review.“经验教训”:通过二级预防项目预防复发性缺血性中风:一项系统综述
J Clin Med. 2021 Sep 17;10(18):4209. doi: 10.3390/jcm10184209.
7
Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019.1990—2019年全球、区域和国家的卒中负担及其风险因素:全球疾病负担研究2019的系统分析
Lancet Neurol. 2021 Oct;20(10):795-820. doi: 10.1016/S1474-4422(21)00252-0. Epub 2021 Sep 3.
8
Improving Stroke Risk Prediction in the General Population: A Comparative Assessment of Common Clinical Rules, a New Multimorbid Index, and Machine-Learning-Based Algorithms.提高普通人群中风风险预测能力:常见临床规则、新的多病指数和基于机器学习算法的比较评估。
Thromb Haemost. 2022 Jan;122(1):142-150. doi: 10.1055/a-1467-2993. Epub 2021 May 28.
9
Prospective validation of Canadian TIA Score and comparison with ABCD2 and ABCD2i for subsequent stroke risk after transient ischaemic attack: multicentre prospective cohort study.前瞻性验证加拿大 TIA 评分,并与 ABCD2 和 ABCD2i 进行比较,以评估短暂性脑缺血发作后后续卒中风险:多中心前瞻性队列研究。
BMJ. 2021 Feb 4;372:n49. doi: 10.1136/bmj.n49.
10
Incidence of Transient Ischemic Attack and Association With Long-term Risk of Stroke.短暂性脑缺血发作的发生率及其与长期卒中风险的关系。
JAMA. 2021 Jan 26;325(4):373-381. doi: 10.1001/jama.2020.25071.