• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理从中文电子病历中提取有临床价值的信息。

Using natural language processing to extract clinically useful information from Chinese electronic medical records.

机构信息

Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, PR China.

Key Laboratory of Molecular Biology for Infectious Diseases (Ministry of Education), Institute for Viral Hepatitis, Department of Infectious Diseases, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, PR China.

出版信息

Int J Med Inform. 2019 Apr;124:6-12. doi: 10.1016/j.ijmedinf.2019.01.004. Epub 2019 Jan 7.

DOI:10.1016/j.ijmedinf.2019.01.004
PMID:30784428
Abstract

AIMS

To develop a natural language processing (NLP)-based algorithm for extracting clinically useful information for patients with hepatocellular carcinoma (HCC) from Chinese electronic medical records (EMRs) and use these data for the assessment of HCC staging.

MATERIALS AND METHODS

Clinical documents, including operation notes, radiology and pathology reports, of 92 HCC patients were collected from Chinese EMRs. We randomly grouped these patients into training (n = 60) and testing (n = 32) datasets. Rule-based and hybrid methods for extracting information were developed using the training set of manually-annotated operation notes. The method with better performance was used to process other documents. The performance of the algorithm was assessed via calculating the precision, recall and F-score for exact-boundary and partial-boundary matching strategies. The utility of clinically useful information for the HCC staging was assessed in comparison with that manually reviewed.

RESULTS

For operation notes, the rule-based and hybrid methods had a precision, recall and F-score ≥80% when the exact-boundary and partial-boundary matching strategies were applied to the testing dataset. By using the rule-based method (which has better performance than the hybrid method), three other types of documents also obtained good performance. When the extracted clinically useful information was applied for the HCC staging, the concordance rate with the manual review was 75%.

CONCLUSION

A NLP system was developed for clinical information extraction and HCC staging based on EMRs, and the results indicate that Chinese NLP has potential utility in clinical research.

摘要

目的

开发一种基于自然语言处理(NLP)的算法,从中文电子病历(EMR)中提取肝细胞癌(HCC)患者的临床有用信息,并利用这些数据评估 HCC 分期。

材料和方法

从中国 EMR 中收集了 92 例 HCC 患者的临床文档,包括手术记录、影像学和病理学报告。我们将这些患者随机分为训练集(n=60)和测试集(n=32)。使用训练集中手动标注的手术记录开发了基于规则和混合方法来提取信息。使用性能更好的方法来处理其他文档。通过计算精确边界和部分边界匹配策略的精度、召回率和 F 分数来评估算法的性能。将临床有用信息用于 HCC 分期的效用与手动审查进行了比较。

结果

对于手术记录,当在测试数据集上应用精确边界和部分边界匹配策略时,基于规则和混合方法的精度、召回率和 F 分数均≥80%。通过使用基于规则的方法(性能优于混合方法),还可以从其他三种类型的文档中获得良好的性能。当应用提取的临床有用信息进行 HCC 分期时,与手动审查的一致性率为 75%。

结论

开发了一种基于 EMR 的临床信息提取和 HCC 分期的 NLP 系统,结果表明中文 NLP 在临床研究中有潜在的应用价值。

相似文献

1
Using natural language processing to extract clinically useful information from Chinese electronic medical records.利用自然语言处理从中文电子病历中提取有临床价值的信息。
Int J Med Inform. 2019 Apr;124:6-12. doi: 10.1016/j.ijmedinf.2019.01.004. Epub 2019 Jan 7.
2
Extracting important information from Chinese Operation Notes with natural language processing methods.运用自然语言处理方法从中文手术记录中提取重要信息。
J Biomed Inform. 2014 Apr;48:130-6. doi: 10.1016/j.jbi.2013.12.017. Epub 2014 Jan 31.
3
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
4
Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing.使用自然语言处理技术从行政数据和电子健康记录中验证肝细胞癌病例发现算法
Med Care. 2016 Feb;54(2):e9-14. doi: 10.1097/MLR.0b013e3182a30373.
5
Extraction of sleep information from clinical notes of Alzheimer's disease patients using natural language processing.使用自然语言处理从阿尔茨海默病患者的临床记录中提取睡眠信息。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2217-2227. doi: 10.1093/jamia/ocae177.
6
CMedTEX: A Rule-based Temporal Expression Extraction and Normalization System for Chinese Clinical Notes.CMedTEX:一个基于规则的中文临床病历时间表达式提取与规范化系统。
AMIA Annu Symp Proc. 2017 Feb 10;2016:818-826. eCollection 2016.
7
Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.从非结构化临床记录中提取症状的任务定义、标注数据集和监督自然语言处理模型。
J Biomed Inform. 2020 Feb;102:103354. doi: 10.1016/j.jbi.2019.103354. Epub 2019 Dec 12.
8
Data for registry and quality review can be retrospectively collected using natural language processing from unstructured charts of arthroplasty patients.可以使用自然语言处理从关节置换患者的非结构化图表中回顾性地收集注册和质量审查数据。
Bone Joint J. 2020 Jul;102-B(7_Supple_B):99-104. doi: 10.1302/0301-620X.102B7.BJJ-2019-1574.R1.
9
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
10
Natural language processing of radiology reports for identification of skeletal site-specific fractures.放射科报告的自然语言处理以识别骨骼部位特异性骨折。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):73. doi: 10.1186/s12911-019-0780-5.

引用本文的文献

1
Performance of Natural Language Processing for Information Extraction From Electronic Health Records Within Cancer: Systematic Review.自然语言处理在癌症电子健康记录信息提取中的性能:系统评价
JMIR Med Inform. 2025 Sep 12;13:e68707. doi: 10.2196/68707.
2
Improving Clinical Documentation with Artificial Intelligence: A Systematic Review.利用人工智能改善临床文档记录:一项系统综述。
Perspect Health Inf Manag. 2024 Jun 1;21(2):1d. eCollection 2024 Summer-Fall.
3
Enhancing Thoracic Surgery with AI: A Review of Current Practices and Emerging Trends.
人工智能在胸外科中的应用:现状与新兴趋势综述。
Curr Oncol. 2024 Oct 17;31(10):6232-6244. doi: 10.3390/curroncol31100464.
4
Health Care Language Models and Their Fine-Tuning for Information Extraction: Scoping Review.医疗保健语言模型及其在信息提取方面的微调:范围综述。
JMIR Med Inform. 2024 Oct 21;12:e60164. doi: 10.2196/60164.
5
Prediction of candidemia with machine learning techniques: state of the art.机器学习技术预测念珠菌血症:现状。
Future Microbiol. 2024 Jul 2;19(10):931-940. doi: 10.2217/fmb-2023-0269. Epub 2024 May 20.
6
Evaluation of an artificial intelligence-based clinical trial matching system in Chinese patients with hepatocellular carcinoma: a retrospective study.基于人工智能的临床试验匹配系统在中国肝细胞癌患者中的评估:一项回顾性研究。
BMC Cancer. 2024 Feb 22;24(1):246. doi: 10.1186/s12885-024-11959-7.
7
Automatic Detection of Distant Metastasis Mentions in Radiology Reports in Spanish.自动检测西班牙语放射学报告中的远处转移提及。
JCO Clin Cancer Inform. 2024 Jan;8:e2300130. doi: 10.1200/CCI.23.00130.
8
Extraction and Quantification of Words Representing Degrees of Diseases: Combining the Fuzzy C-Means Method and Gaussian Membership.表示疾病程度的词汇提取与量化:结合模糊C均值法与高斯隶属度
JMIR Form Res. 2022 Nov 18;6(11):e38677. doi: 10.2196/38677.
9
Advancing the development of real-world data for healthcare research in China: challenges and opportunities.推进中国医疗保健研究中真实世界数据的发展:挑战与机遇。
BMJ Open. 2022 Jul 29;12(7):e063139. doi: 10.1136/bmjopen-2022-063139.
10
Using Natural Language Processing and Machine Learning to Preoperatively Predict Lymph Node Metastasis for Non-Small Cell Lung Cancer With Electronic Medical Records: Development and Validation Study.利用自然语言处理和机器学习,通过电子病历术前预测非小细胞肺癌的淋巴结转移:开发与验证研究
JMIR Med Inform. 2022 Apr 25;10(4):e35475. doi: 10.2196/35475.