• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过癌症自然语言处理的范围综述评估癌症研究和患者护理的电子健康记录。

Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.

机构信息

Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN.

Department of Hematology/Oncology, Mayo Clinic, Scottsdale, AZ.

出版信息

JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.

DOI:10.1200/CCI.22.00006
PMID:35917480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9470142/
Abstract

PURPOSE

The advancement of natural language processing (NLP) has promoted the use of detailed textual data in electronic health records (EHRs) to support cancer research and to facilitate patient care. In this review, we aim to assess EHR for cancer research and patient care by using the Minimal Common Oncology Data Elements (mCODE), which is a community-driven effort to define a minimal set of data elements for cancer research and practice. Specifically, we aim to assess the alignment of NLP-extracted data elements with mCODE and review existing NLP methodologies for extracting said data elements.

METHODS

Published literature studies were searched to retrieve cancer-related NLP articles that were written in English and published between January 2010 and September 2020 from main literature databases. After the retrieval, articles with EHRs as the data source were manually identified. A charting form was developed for relevant study analysis and used to categorize data including four main topics: metadata, EHR data and targeted cancer types, NLP methodology, and oncology data elements and standards.

RESULTS

A total of 123 publications were selected finally and included in our analysis. We found that cancer research and patient care require some data elements beyond mCODE as expected. Transparency and reproductivity are not sufficient in NLP methods, and inconsistency in NLP evaluation exists.

CONCLUSION

We conducted a comprehensive review of cancer NLP for research and patient care using EHRs data. Issues and barriers for wide adoption of cancer NLP were identified and discussed.

摘要

目的

自然语言处理(NLP)的进步促进了电子健康记录(EHR)中详细文本数据的使用,以支持癌症研究和促进患者护理。在本综述中,我们旨在使用最小共同肿瘤学数据元素(mCODE)评估 EHR 在癌症研究和患者护理中的应用,mCODE 是一个社区驱动的努力,旨在定义一组用于癌症研究和实践的最小数据元素。具体来说,我们旨在评估 NLP 提取的数据元素与 mCODE 的一致性,并审查用于提取这些数据元素的现有 NLP 方法。

方法

从主要文献数据库中检索了 2010 年 1 月至 2020 年 9 月期间发表的以英语撰写的与癌症相关的 NLP 文章,以进行文献检索。检索后,手动确定了以 EHR 为数据源的文章。为相关研究分析开发了一个图表表格,用于对包括四个主要主题的数据进行分类:元数据、EHR 数据和目标癌症类型、NLP 方法以及肿瘤学数据元素和标准。

结果

最终共选择了 123 篇出版物并纳入我们的分析。我们发现,癌症研究和患者护理需要一些超出 mCODE 的数据元素。NLP 方法的透明度和可重复性不足,并且存在 NLP 评估的不一致性。

结论

我们对使用 EHR 数据进行癌症 NLP 研究和患者护理进行了全面综述。确定并讨论了癌症 NLP 广泛应用的问题和障碍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/6e5fd0e84e49/cci-6-e2200006-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/161d191aab49/cci-6-e2200006-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/f0e5773ec8c7/cci-6-e2200006-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/32babc11b5ec/cci-6-e2200006-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/addaedd49577/cci-6-e2200006-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/be0abc8b1186/cci-6-e2200006-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/6e5fd0e84e49/cci-6-e2200006-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/161d191aab49/cci-6-e2200006-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/f0e5773ec8c7/cci-6-e2200006-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/32babc11b5ec/cci-6-e2200006-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/addaedd49577/cci-6-e2200006-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/be0abc8b1186/cci-6-e2200006-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/9470142/6e5fd0e84e49/cci-6-e2200006-g012.jpg

相似文献

1
Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.通过癌症自然语言处理的范围综述评估癌症研究和患者护理的电子健康记录。
JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.
2
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.电子健康记录中自由文本叙述的症状的自然语言处理:系统评价。
J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173.
3
A frame semantic overview of NLP-based information extraction for cancer-related EHR notes.基于框架语义的自然语言处理信息抽取在癌症相关电子病历中的应用综述。
J Biomed Inform. 2019 Dec;100:103301. doi: 10.1016/j.jbi.2019.103301. Epub 2019 Oct 4.
4
A scoping review of publicly available language tasks in clinical natural language processing.临床自然语言处理中公开可用语言任务的范围综述
J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127.
5
Practical use case of natural language processing for observational clinical research data retrieval from electronic health records: AssistMED project.自然语言处理在观察性临床研究电子病历数据检索中的实际应用案例:AssistMED 项目。
Pol Arch Intern Med. 2024 May 28;134(5). doi: 10.20452/pamw.16704. Epub 2024 Mar 19.
6
Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review.使用机器学习方法进行自然语言处理,以分析来自电子健康记录的非结构化患者报告结局:系统评价。
Artif Intell Med. 2023 Dec;146:102701. doi: 10.1016/j.artmed.2023.102701. Epub 2023 Nov 1.
7
NLP for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review.用于癌症研究中分析电子健康记录和临床笔记的自然语言处理:综述
J Pain Symptom Manage. 2025 May;69(5):e374-e394. doi: 10.1016/j.jpainsymman.2025.01.019. Epub 2025 Jan 31.
8
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
9
The Growing Impact of Natural Language Processing in Healthcare and Public Health.自然语言处理在医疗保健和公共卫生领域的影响日益扩大。
Inquiry. 2024 Jan-Dec;61:469580241290095. doi: 10.1177/00469580241290095.
10
Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP.使用CLAMP开发用于病理报告的可定制癌症信息提取模块。
Stud Health Technol Inform. 2019 Aug 21;264:1041-1045. doi: 10.3233/SHTI190383.

引用本文的文献

1
Natural language processing for local, regional, and distant breast cancer relapse identification in pathology reports.用于在病理报告中识别局部、区域和远处乳腺癌复发的自然语言处理技术。
Breast Cancer Res Treat. 2025 Sep 2. doi: 10.1007/s10549-025-07801-8.
2
Deep Learning Model for Natural Language to Assess Effectiveness of Patients With Non-Muscle Invasive Bladder Cancer Receiving Intravesical Bacillus Calmette-Guérin Therapy.用于评估非肌层浸润性膀胱癌患者接受膀胱内卡介苗治疗有效性的自然语言深度学习模型。
JCO Clin Cancer Inform. 2025 Jun;9:e2400249. doi: 10.1200/CCI-24-00249. Epub 2025 Jun 27.
3
Using Electronic Health Records to Classify Cancer Site and Metastasis.

本文引用的文献

1
Quantitating and assessing interoperability between electronic health records.定量评估电子健康记录之间的互操作性。
J Am Med Inform Assoc. 2022 Apr 13;29(5):753-760. doi: 10.1093/jamia/ocab289.
2
Challenges for assessing replicability in preclinical cancer biology.评估临床前癌症生物学可重复性面临的挑战。
Elife. 2021 Dec 7;10:e67995. doi: 10.7554/eLife.67995.
3
Improving Cancer Data Interoperability: The Promise of the Minimal Common Oncology Data Elements (mCODE) Initiative.改善癌症数据互操作性:最小共同肿瘤学数据元素 (mCODE) 倡议的承诺。
利用电子健康记录对癌症部位和转移进行分类。
Appl Clin Inform. 2025 May;16(3):556-568. doi: 10.1055/a-2544-3117. Epub 2025 Jun 18.
4
Personalized Medical Approach in Gastrointestinal Surgical Oncology: Current Trends and Future Perspectives.胃肠外科肿瘤学中的个性化医疗方法:当前趋势与未来展望
J Pers Med. 2025 Apr 27;15(5):175. doi: 10.3390/jpm15050175.
5
From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.从手动临床标准到机器学习算法:比较源自不同电子健康记录数据模式的结局终点。
PLOS Digit Health. 2025 May 14;4(5):e0000755. doi: 10.1371/journal.pdig.0000755. eCollection 2025 May.
6
A Narrative Review on the Application of Large Language Models to Support Cancer Care and Research.关于应用大语言模型支持癌症护理与研究的叙述性综述。
Yearb Med Inform. 2024 Aug;33(1):90-98. doi: 10.1055/s-0044-1800726. Epub 2025 Apr 8.
7
A scoping review of OMOP CDM adoption for cancer research using real world data.一项关于使用真实世界数据将观察性医疗结果合作组织通用数据模型(OMOP CDM)应用于癌症研究的范围综述。
NPJ Digit Med. 2025 Apr 7;8(1):189. doi: 10.1038/s41746-025-01581-7.
8
Real-World Evidence on Adverse Events and Healthcare Resource Utilization in Patients with Chronic Lymphocytic Leukaemia in Spain Using Natural Language Processing: The SRealCLL Study.利用自然语言处理技术获取的西班牙慢性淋巴细胞白血病患者不良事件和医疗资源利用情况的真实世界证据:SRealCLL研究
Cancers (Basel). 2024 Nov 29;16(23):4004. doi: 10.3390/cancers16234004.
9
Usage of the National Cancer Institute Cancer Research Data Commons by Researchers: A Scoping Review of the Literature.研究人员对国立癌症研究所癌症研究数据共享中心的使用情况:文献的范围综述。
JCO Clin Cancer Inform. 2024 Nov;8:e2400116. doi: 10.1200/CCI.24.00116. Epub 2024 Nov 13.
10
Adoption of the OMOP CDM for Cancer Research using Real-world Data: Current Status and Opportunities.使用真实世界数据采用OMOP通用数据模型进行癌症研究:现状与机遇
medRxiv. 2024 Aug 23:2024.08.23.24311950. doi: 10.1101/2024.08.23.24311950.
JCO Clin Cancer Inform. 2020 Oct;4:993-1001. doi: 10.1200/CCI.20.00059.
4
Phenotyping severity of patient-centered outcomes using clinical notes: A prostate cancer use case.利用临床记录对以患者为中心的结局的严重程度进行表型分析:一个前列腺癌的应用案例。
Learn Health Syst. 2020 Jul 17;4(4):e10237. doi: 10.1002/lrh2.10237. eCollection 2020 Oct.
5
Novel Method to Flag Cardiac Implantable Device Infections by Integrating Text Mining With Structured Data in the Veterans Health Administration's Electronic Medical Record.利用文本挖掘技术与退伍军人事务部电子病历中的结构化数据相结合,为心脏植入式电子设备感染标记的新方法。
JAMA Netw Open. 2020 Sep 1;3(9):e2012264. doi: 10.1001/jamanetworkopen.2020.12264.
6
Implementation of an Artificial Intelligence-Based Double Read System in Capturing Pulmonary Nodule Discrepancy in CT Studies.基于人工智能的双读系统在 CT 研究中捕获肺部结节差异的实现。
Curr Probl Diagn Radiol. 2021 Mar-Apr;50(2):119-122. doi: 10.1067/j.cpradiol.2020.07.006. Epub 2020 Jul 27.
7
Automatic Incident Triage in Radiation Oncology Incident Learning System.放射肿瘤学事件学习系统中的自动事件分类
Healthcare (Basel). 2020 Aug 14;8(3):272. doi: 10.3390/healthcare8030272.
8
Thyroid Ultrasound Reports: Will the Thyroid Imaging, Reporting, and Data System Improve Natural Language Processing Capture of Critical Thyroid Nodule Features?甲状腺超声报告:甲状腺成像报告和数据系统是否会改善关键甲状腺结节特征的自然语言处理捕获?
J Surg Res. 2020 Dec;256:557-563. doi: 10.1016/j.jss.2020.07.015. Epub 2020 Aug 13.
9
Clinical concept extraction: A methodology review.临床概念提取:方法学综述。
J Biomed Inform. 2020 Sep;109:103526. doi: 10.1016/j.jbi.2020.103526. Epub 2020 Aug 6.
10
Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes.自然语言处理从肿瘤医生的病历中确定癌症结果。
JCO Clin Cancer Inform. 2020 Aug;4:680-690. doi: 10.1200/CCI.20.00020.