• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然语言处理在评估心血管疾病合并症中的应用:cardio-Canary 合并症项目。

Natural language processing for the assessment of cardiovascular disease comorbidities: The cardio-Canary comorbidity project.

机构信息

Cardiovascular Division, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.

Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.

出版信息

Clin Cardiol. 2021 Sep;44(9):1296-1304. doi: 10.1002/clc.23687. Epub 2021 Aug 4.

DOI:10.1002/clc.23687
PMID:34347314
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8428009/
Abstract

OBJECTIVE

Accurate ascertainment of comorbidities is paramount in clinical research. While manual adjudication is labor-intensive and expensive, the adoption of electronic health records enables computational analysis of free-text documentation using natural language processing (NLP) tools.

HYPOTHESIS

We sought to develop highly accurate NLP modules to assess for the presence of five key cardiovascular comorbidities in a large electronic health record system.

METHODS

One-thousand clinical notes were randomly selected from a cardiovascular registry at Mass General Brigham. Trained physicians manually adjudicated these notes for the following five diagnostic comorbidities: hypertension, dyslipidemia, diabetes, coronary artery disease, and stroke/transient ischemic attack. Using the open-source Canary NLP system, five separate NLP modules were designed based on 800 "training-set" notes and validated on 200 "test-set" notes.

RESULTS

Across the five NLP modules, the sentence-level and note-level sensitivity, specificity, and positive predictive value was always greater than 85% and was most often greater than 90%. Accuracy tended to be highest for conditions with greater diagnostic clarity (e.g. diabetes and hypertension) and slightly lower for conditions whose greater diagnostic challenges (e.g. myocardial infarction and embolic stroke) may lead to less definitive documentation.

CONCLUSION

We designed five open-source and highly accurate NLP modules that can be used to assess for the presence of important cardiovascular comorbidities in free-text health records. These modules have been placed in the public domain and can be used for clinical research, trial recruitment and population management at any institution as well as serve as the basis for further development of cardiovascular NLP tools.

摘要

目的

准确确定合并症在临床研究中至关重要。虽然手动判断既费力又昂贵,但电子健康记录的采用使使用自然语言处理 (NLP) 工具对自由文本文档进行计算分析成为可能。

假设

我们试图开发高度准确的 NLP 模块,以在大型电子健康记录系统中评估五种主要心血管合并症的存在。

方法

从 Mass General Brigham 的心血管登记处随机选择了 1000 份临床记录。经过培训的医生手动判断这些记录是否存在以下五种诊断合并症:高血压、血脂异常、糖尿病、冠心病和中风/短暂性脑缺血发作。使用开源 Canary NLP 系统,根据 800 份“训练集”记录设计了五个独立的 NLP 模块,并在 200 份“测试集”记录上进行了验证。

结果

在五个 NLP 模块中,句子级和记录级的敏感性、特异性和阳性预测值始终大于 85%,且通常大于 90%。对于诊断清晰度较高的疾病(如糖尿病和高血压),准确性往往最高,而对于诊断更具挑战性的疾病(如心肌梗死和栓塞性中风),准确性可能略低,这可能导致记录不太明确。

结论

我们设计了五个开源且高度准确的 NLP 模块,可用于评估自由文本健康记录中重要心血管合并症的存在。这些模块已被置于公共领域,可在任何机构用于临床研究、试验招募和人群管理,也可作为进一步开发心血管 NLP 工具的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/454557e69b57/CLC-44-1296-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/e66195b78380/CLC-44-1296-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/b8e65aba4598/CLC-44-1296-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/454557e69b57/CLC-44-1296-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/e66195b78380/CLC-44-1296-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/b8e65aba4598/CLC-44-1296-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ad6/8428009/454557e69b57/CLC-44-1296-g001.jpg

相似文献

1
Natural language processing for the assessment of cardiovascular disease comorbidities: The cardio-Canary comorbidity project.自然语言处理在评估心血管疾病合并症中的应用:cardio-Canary 合并症项目。
Clin Cardiol. 2021 Sep;44(9):1296-1304. doi: 10.1002/clc.23687. Epub 2021 Aug 4.
2
Natural Language Processing for the Ascertainment and Phenotyping of Left Ventricular Hypertrophy and Hypertrophic Cardiomyopathy on Echocardiogram Reports.基于自然语言处理的超声心动图报告左心室肥厚和肥厚型心肌病的确定和表型分析。
Am J Cardiol. 2023 Nov 1;206:247-253. doi: 10.1016/j.amjcard.2023.08.109. Epub 2023 Sep 13.
3
Comparison of Natural Language Processing of Clinical Notes With a Validated Risk-Stratification Tool to Predict Severe Maternal Morbidity.临床记录的自然语言处理与验证的风险分层工具预测严重产妇发病率的比较。
JAMA Netw Open. 2022 Oct 3;5(10):e2234924. doi: 10.1001/jamanetworkopen.2022.34924.
4
Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.应用自然语言处理的增强型人工智能用于电子健康记录,以识别非酒精性脂肪性肝病患者中疾病进展风险较高的患者。
Int J Med Inform. 2019 Sep;129:334-341. doi: 10.1016/j.ijmedinf.2019.06.028. Epub 2019 Jul 6.
5
Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records.使用电子健康记录中的自然语言处理来确定谵妄状态。
J Gerontol A Biol Sci Med Sci. 2022 Mar 3;77(3):524-530. doi: 10.1093/gerona/glaa275.
6
Use of Natural Language Processing Tools to Identify and Classify Periprosthetic Femur Fractures.使用自然语言处理工具识别和分类股骨假体周围骨折。
J Arthroplasty. 2019 Oct;34(10):2216-2219. doi: 10.1016/j.arth.2019.07.025. Epub 2019 Jul 24.
7
Mining peripheral arterial disease cases from narrative clinical notes using natural language processing.使用自然语言处理技术从叙述性临床记录中挖掘外周动脉疾病病例。
J Vasc Surg. 2017 Jun;65(6):1753-1761. doi: 10.1016/j.jvs.2016.11.031. Epub 2017 Feb 8.
8
The use of natural language processing to identify vaccine-related anaphylaxis at five health care systems in the Vaccine Safety Datalink.利用自然语言处理技术在疫苗安全数据链中的五个医疗系统中识别与疫苗相关的过敏反应。
Pharmacoepidemiol Drug Saf. 2020 Feb;29(2):182-188. doi: 10.1002/pds.4919. Epub 2019 Dec 3.
9
Development of a natural language processing algorithm to detect chronic cough in electronic health records.开发一种自然语言处理算法以检测电子健康记录中的慢性咳嗽。
BMC Pulm Med. 2022 Jun 28;22(1):256. doi: 10.1186/s12890-022-02035-6.
10
Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Total Hip Arthroplasty.使用自然语言处理算法识别全髋关节置换术手术记录中的常见数据元素。
J Bone Joint Surg Am. 2019 Nov 6;101(21):1931-1938. doi: 10.2106/JBJS.19.00071.

引用本文的文献

1
Clinical applications of large language models in medicine and surgery: A scoping review.大型语言模型在医学与外科中的临床应用:一项范围综述
J Int Med Res. 2025 Jul;53(7):3000605251347556. doi: 10.1177/03000605251347556. Epub 2025 Jul 4.
2
Artificial intelligence in cardiovascular practice.心血管实践中的人工智能
Nurse Pract. 2025 May 1;50(5):13-24. doi: 10.1097/01.NPR.0000000000000312. Epub 2025 Apr 24.
3
Sex Differences in the Association Between Lipoprotein(a) and Cardiovascular Outcomes: The MGB Lp(a) Registry.

本文引用的文献

1
Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.使用自然语言处理技术衡量和改善糖尿病护理质量:系统评价。
J Diabetes Sci Technol. 2021 May;15(3):553-560. doi: 10.1177/19322968211000831. Epub 2021 Mar 19.
2
Comparing information extraction techniques for low-prevalence concepts: The case of insulin rejection by patients.比较低患病率概念的信息提取技术:以患者拒绝胰岛素为例。
J Biomed Inform. 2019 Nov;99:103306. doi: 10.1016/j.jbi.2019.103306. Epub 2019 Oct 13.
3
The REDCap consortium: Building an international community of software platform partners.
脂蛋白(a)与心血管结局关联中的性别差异:MGB Lp(a)注册研究
J Am Heart Assoc. 2025 May 6;14(9):e035353. doi: 10.1161/JAHA.124.035353. Epub 2025 Apr 16.
4
How to assess multimorbidity: a systematic review.如何评估多重疾病:一项系统综述。
Front Public Health. 2025 Mar 27;13:1525593. doi: 10.3389/fpubh.2025.1525593. eCollection 2025.
5
Artificial intelligence in cardiovascular practice.心血管实践中的人工智能
JAAPA. 2025 May 1;38(5):21-30. doi: 10.1097/01.JAA.0000000000000204. Epub 2025 Apr 24.
6
EHR-Based Screening of Familial Hypercholesterolemia: Finding the Lipid in the Haystack.基于电子健康记录筛查家族性高胆固醇血症:大海捞针找血脂
JACC Adv. 2024 Oct 16;3(12):101296. doi: 10.1016/j.jacadv.2024.101296. eCollection 2024 Dec.
7
Big Data, Big Insights: Leveraging Data Analytics to Unravel Cardiovascular Exposome Complexities.大数据,大洞察:利用数据分析揭示心血管外显子组复杂性。
Methodist Debakey Cardiovasc J. 2024 Nov 5;20(5):111-123. doi: 10.14797/mdcvj.1467. eCollection 2024.
8
Lipoprotein(a) as a cardiovascular risk factor among patients with and without diabetes Mellitus: the Mass General Brigham Lp(a) Registry.脂蛋白(a)作为糖尿病患者和非糖尿病患者的心血管风险因素:麻省总医院布里格姆脂蛋白(a)登记处。
Cardiovasc Diabetol. 2024 Jul 18;23(1):257. doi: 10.1186/s12933-024-02348-2.
9
Social Phenotyping for Cardiovascular Risk Stratification in Electronic Health Registries.社会表型分析在电子健康档案中的心血管风险分层。
Curr Atheroscler Rep. 2024 Sep;26(9):485-497. doi: 10.1007/s11883-024-01222-6. Epub 2024 Jul 8.
10
Extraction of Radiological Characteristics From Free-Text Imaging Reports Using Natural Language Processing Among Patients With Ischemic and Hemorrhagic Stroke: Algorithm Development and Validation.使用自然语言处理从缺血性和出血性中风患者的自由文本影像报告中提取放射学特征:算法开发与验证
JMIR AI. 2023 Jun 6;2:e42884. doi: 10.2196/42884.
REDCap 联盟:构建软件平台合作伙伴的国际社区。
J Biomed Inform. 2019 Jul;95:103208. doi: 10.1016/j.jbi.2019.103208. Epub 2019 May 9.
4
Big Data and Machine Learning in Health Care.医疗保健中的大数据与机器学习
JAMA. 2018 Apr 3;319(13):1317-1318. doi: 10.1001/jama.2017.18391.
5
Continued Statin Prescriptions After Adverse Reactions and Patient Outcomes: A Cohort Study.不良反应后继续开具他汀类药物处方与患者结局:一项队列研究。
Ann Intern Med. 2017 Aug 15;167(4):221-227. doi: 10.7326/M16-0838. Epub 2017 Jul 25.
6
Canary: An NLP Platform for Clinicians and Researchers.金丝雀:面向临床医生和研究人员的自然语言处理平台。
Appl Clin Inform. 2017 May 3;8(2):447-453. doi: 10.4338/ACI-2017-01-IE-0018.
7
Risk Prediction With Electronic Health Records: The Importance of Model Validation and Clinical Context.利用电子健康记录进行风险预测:模型验证和临床背景的重要性。
JAMA Cardiol. 2016 Dec 1;1(9):976-977. doi: 10.1001/jamacardio.2016.3826.
8
Detecting the presence of an indwelling urinary catheter and urinary symptoms in hospitalized patients using natural language processing.使用自然语言处理技术检测住院患者体内留置导尿管的情况及泌尿系统症状。
J Biomed Inform. 2017 Jul;71S:S39-S45. doi: 10.1016/j.jbi.2016.07.012. Epub 2016 Jul 9.
9
Natural Language Processing in Oncology: A Review.自然语言处理在肿瘤学中的应用:综述
JAMA Oncol. 2016 Jun 1;2(6):797-804. doi: 10.1001/jamaoncol.2016.0213.
10
Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes.调整现有自然语言处理资源以识别临床记录中的心血管危险因素。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S128-S132. doi: 10.1016/j.jbi.2015.08.002. Epub 2015 Aug 28.