• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学报告的关键词提取与结构化

Keyword extraction and structuralization of medical reports.

作者信息

Wu Pei-Hao, Yu Avon, Tsai Ching-Wei, Koh Jia-Ling, Kuo Chin-Chi, Chen Arbee L P

机构信息

1Department of Computer Science and Information Engineering, National Taiwan Normal University, Taipei, Taiwan.

2Big Data Center and Nephrology Division, China Medical University Hospital and College of Medicine, China Medical University, Taichung, Taiwan.

出版信息

Health Inf Sci Syst. 2020 Apr 3;8(1):18. doi: 10.1007/s13755-020-00108-6. eCollection 2020 Dec.

DOI:10.1007/s13755-020-00108-6
PMID:32269770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7125292/
Abstract

PURPOSE

In recent years, patients usually accept more accurate and detailed examinations because of the rapid advances in medical technology. Many of the examination reports are not represented in numerical data, but text documents written by the medical examiners based on the observations from the instruments and biochemical tests. If the above-mentioned unstructured data can be organized as a report in a structured form, it will help doctors to understand a patient's status of the various examinations more efficiently. Besides, further association analysis on the structuralized data can be performed to identify potential factors that affect a disease.

METHODS

In this paper, from the pathology examination reports of renal diseases, we applied the POS tagging results of natural language analysis to automatically extract the keyword phrases. Then a medical dictionary for various examination items in an examination report is established, which is used as the basic information for retrieving the terms to construct a structured form of the report. Moreover, a topical probability modeling method is applied to automatically discover the candidate keyword phrases of the examination items from the reports. Finally, a system is implemented to generate the structured form for the various examination items in a report according to the constructed medical dictionary.

RESULTS AND CONCLUSION

The results of the experiments showed that the methods proposed in this paper can effectively construct a structural form of examination reports. Furthermore, the keywords of the popular examination items can be extracted correctly. The above techniques will help automatic processing and analysis of medical text reports.

摘要

目的

近年来,由于医学技术的飞速发展,患者通常会接受更准确、更详细的检查。许多检查报告并非以数值数据呈现,而是医学检查人员根据仪器观察和生化检测结果撰写的文本文件。如果上述非结构化数据能够以结构化形式整理成报告,将有助于医生更高效地了解患者各项检查的状况。此外,还可以对结构化数据进行进一步的关联分析,以识别影响疾病的潜在因素。

方法

在本文中,我们从肾脏疾病的病理检查报告中,应用自然语言分析的词性标注结果自动提取关键词组。然后建立一份检查报告中各类检查项目的医学词典,将其作为检索术语的基本信息,以构建报告的结构化形式。此外,应用主题概率建模方法从报告中自动发现检查项目的候选关键词组。最后,实现一个系统,根据构建的医学词典生成报告中各类检查项目的结构化形式。

结果与结论

实验结果表明,本文提出的方法能够有效地构建检查报告的结构化形式。此外,能够正确提取常见检查项目的关键词。上述技术将有助于医学文本报告的自动处理与分析。

相似文献

1
Keyword extraction and structuralization of medical reports.医学报告的关键词提取与结构化
Health Inf Sci Syst. 2020 Apr 3;8(1):18. doi: 10.1007/s13755-020-00108-6. eCollection 2020 Dec.
2
Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.设计一个基于 openEHR 的管道,使用自然语言处理提取和标准化非结构化临床数据。
Methods Inf Med. 2020 Dec;59(S 02):e64-e78. doi: 10.1055/s-0040-1716403. Epub 2020 Oct 14.
3
Structuring Legacy Pathology Reports by openEHR Archetypes to Enable Semantic Querying.通过openEHR原型构建传统病理报告以实现语义查询。
Methods Inf Med. 2017 May 18;56(3):230-237. doi: 10.3414/ME16-01-0073. Epub 2017 Feb 28.
4
A general text mining method to extract echocardiography measurement results from echocardiography documents.一种从超声心动图文档中提取超声心动图测量结果的通用文本挖掘方法。
Artif Intell Med. 2023 Sep;143:102584. doi: 10.1016/j.artmed.2023.102584. Epub 2023 May 20.
5
Word synonym relationships for text analysis: A graph-based approach.基于图的文本分析词同义词关系方法。
PLoS One. 2021 Jul 27;16(7):e0255127. doi: 10.1371/journal.pone.0255127. eCollection 2021.
6
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
7
A method for extracting tumor events from clinical CT examination reports.一种从临床 CT 检查报告中提取肿瘤事件的方法。
J Biomed Inform. 2023 Jun;142:104371. doi: 10.1016/j.jbi.2023.104371. Epub 2023 May 5.
8
Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records.深度学习自然语言处理算法在电子病历中从病理报告中提取关键词的验证。
Sci Rep. 2020 Nov 20;10(1):20265. doi: 10.1038/s41598-020-77258-w.
9
Automatic RadLex coding of Chinese structured radiology reports based on text similarity ensemble.基于文本相似度集成的中文结构化放射学报告的自动 RadLex 编码。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):247. doi: 10.1186/s12911-021-01604-9.
10
[Technologies for Complex Intelligent Clinical Data Analysis].复杂智能临床数据分析技术
Vestn Ross Akad Med Nauk. 2016(2):160-71. doi: 10.15690/vramn663.

引用本文的文献

1
An Entity Extraction Pipeline for Medical Text Records Using Large Language Models: Analytical Study.基于大型语言模型的医疗文本记录实体抽取流水线:分析研究。
J Med Internet Res. 2024 Mar 29;26:e54580. doi: 10.2196/54580.
2
A heterogeneous multi-modal medical data fusion framework supporting hybrid data exploration.一个支持混合数据探索的异构多模态医学数据融合框架。
Health Inf Sci Syst. 2022 Aug 26;10(1):22. doi: 10.1007/s13755-022-00183-x. eCollection 2022 Dec.

本文引用的文献

1
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
2
Medical Question Answering for Clinical Decision Support.用于临床决策支持的医学问答
Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:297-306. doi: 10.1145/2983323.2983819.
3
Mayo Clinic/Renal Pathology Society Consensus Report on Pathologic Classification, Diagnosis, and Reporting of GN.梅奥诊所/肾脏病理学会关于肾小球肾炎病理分类、诊断及报告的共识报告
J Am Soc Nephrol. 2016 May;27(5):1278-87. doi: 10.1681/ASN.2015060612. Epub 2015 Nov 13.
4
Unfolding Physiological State: Mortality Modelling in Intensive Care Units.展开生理状态:重症监护病房的死亡率建模
KDD. 2014 Aug 24;2014:75-84. doi: 10.1145/2623330.2623742.
5
Risk stratification of ICU patients using topic models inferred from unstructured progress notes.利用从未结构化病程记录中推断出的主题模型对重症监护病房患者进行风险分层。
AMIA Annu Symp Proc. 2012;2012:505-11. Epub 2012 Nov 3.
6
Discovering peripheral arterial disease cases from radiology notes using natural language processing.使用自然语言处理技术从放射学记录中发现外周动脉疾病病例。
AMIA Annu Symp Proc. 2010 Nov 13;2010:722-6.
7
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
8
Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.生物医学文本到UMLS元词表的有效映射:MetaMap程序
Proc AMIA Symp. 2001:17-21.
9
Automatic structuring of radiology free-text reports.放射学自由文本报告的自动结构化
Radiographics. 2001 Jan-Feb;21(1):237-45. doi: 10.1148/radiographics.21.1.g01ja18237.