• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于中文叙事临床文本的时间表达分类与归一化:模式学习方法

Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach.

作者信息

Pan Xiaoyi, Chen Boyu, Weng Heng, Gong Yongyi, Qu Yingying

机构信息

School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou, China.

Department of Big Data Research of Medicine, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China.

出版信息

JMIR Med Inform. 2020 Jul 27;8(7):e17652. doi: 10.2196/17652.

DOI:10.2196/17652
PMID:32716307
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7418025/
Abstract

BACKGROUND

Temporal information frequently exists in the representation of the disease progress, prescription, medication, surgery progress, or discharge summary in narrative clinical text. The accurate extraction and normalization of temporal expressions can positively boost the analysis and understanding of narrative clinical texts to promote clinical research and practice.

OBJECTIVE

The goal of the study was to propose a novel approach for extracting and normalizing temporal expressions from Chinese narrative clinical text.

METHODS

TNorm, a rule-based and pattern learning-based approach, has been developed for automatic temporal expression extraction and normalization from unstructured Chinese clinical text data. TNorm consists of three stages: extraction, classification, and normalization. It applies a set of heuristic rules and automatically generated patterns for temporal expression identification and extraction of clinical texts. Then, it collects the features of extracted temporal expressions for temporal type prediction and classification by using machine learning algorithms. Finally, the features are combined with the rule-based and a pattern learning-based approach to normalize the extracted temporal expressions.

RESULTS

The evaluation dataset is a set of narrative clinical texts in Chinese containing 1459 discharge summaries of a domestic Grade A Class 3 hospital. The results show that TNorm, combined with temporal expressions extraction and temporal types prediction, achieves a precision of 0.8491, a recall of 0.8328, and a F1 score of 0.8409 in temporal expressions normalization.

CONCLUSIONS

This study illustrates an automatic approach, TNorm, that extracts and normalizes temporal expression from Chinese narrative clinical texts. TNorm was evaluated on the basis of discharge summary data, and results demonstrate its effectiveness on temporal expression normalization.

摘要

背景

时间信息经常存在于叙述性临床文本中的疾病进展、处方、用药、手术过程或出院小结的表述中。时间表达式的准确提取和规范化能够积极推动对叙述性临床文本的分析和理解,以促进临床研究与实践。

目的

本研究的目标是提出一种从中文叙述性临床文本中提取和规范化时间表达式的新方法。

方法

已开发出TNorm,这是一种基于规则和模式学习的方法,用于从非结构化中文临床文本数据中自动提取和规范化时间表达式。TNorm包括三个阶段:提取、分类和规范化。它应用一组启发式规则和自动生成的模式来识别和提取临床文本中的时间表达式。然后,它通过使用机器学习算法收集提取的时间表达式的特征,用于时间类型预测和分类。最后,将这些特征与基于规则和基于模式学习的方法相结合,对提取的时间表达式进行规范化。

结果

评估数据集是一组包含国内一家三甲医院1459份出院小结的中文叙述性临床文本。结果表明,TNorm结合时间表达式提取和时间类型预测,在时间表达式规范化方面的精确率为0.8491,召回率为0.8328,F1分数为0.8—409—。

结论

本研究阐述了一种自动方法TNorm—它能从中文叙述性临床文本中提取和规范化时间表达式—TNorm基于出院小结数据进行了评估—结果证明了其在时间表达式规范化方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/a31a73479c06/medinform_v8i7e17652_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/799ea46b1db3/medinform_v8i7e17652_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/89ce57bf124b/medinform_v8i7e17652_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/7aaa33f505a7/medinform_v8i7e17652_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/bfd687df815e/medinform_v8i7e17652_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/ee7a60bc3d22/medinform_v8i7e17652_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/50da84dc9e41/medinform_v8i7e17652_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/5ae453378e8a/medinform_v8i7e17652_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/a31a73479c06/medinform_v8i7e17652_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/799ea46b1db3/medinform_v8i7e17652_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/89ce57bf124b/medinform_v8i7e17652_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/7aaa33f505a7/medinform_v8i7e17652_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/bfd687df815e/medinform_v8i7e17652_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/ee7a60bc3d22/medinform_v8i7e17652_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/50da84dc9e41/medinform_v8i7e17652_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/5ae453378e8a/medinform_v8i7e17652_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/319e/7418025/a31a73479c06/medinform_v8i7e17652_fig8.jpg

相似文献

1
Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach.基于中文叙事临床文本的时间表达分类与归一化:模式学习方法
JMIR Med Inform. 2020 Jul 27;8(7):e17652. doi: 10.2196/17652.
2
A pattern learning-based method for temporal expression extraction and normalization from multi-lingual heterogeneous clinical texts.基于模式学习的方法,用于从多语言异质临床文本中提取和规范时间表达式。
BMC Med Inform Decis Mak. 2018 Mar 22;18(Suppl 1):22. doi: 10.1186/s12911-018-0595-9.
3
Integrating machine learning with linguistic features: A universal method for extraction and normalization of temporal expressions in Chinese texts.将机器学习与语言特征相结合:一种用于中文文本中时间表达式提取与规范化的通用方法。
Comput Methods Programs Biomed. 2023 May;233:107474. doi: 10.1016/j.cmpb.2023.107474. Epub 2023 Mar 11.
4
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
5
MedTime: a temporal information extraction system for clinical narratives.MedTime:一个用于临床叙述的时间信息提取系统。
J Biomed Inform. 2013 Dec;46 Suppl:S20-S28. doi: 10.1016/j.jbi.2013.07.012. Epub 2013 Jul 31.
6
Recognizing Temporal Information in Korean Clinical Narratives through Text Normalization.通过文本规范化识别韩国临床叙述中的时间信息。
Healthc Inform Res. 2011 Sep;17(3):150-5. doi: 10.4258/hir.2011.17.3.150. Epub 2011 Sep 30.
7
Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives.从临床叙述中提取时间表达式和事件的规则与机器学习相结合。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):859-66. doi: 10.1136/amiajnl-2013-001625. Epub 2013 Apr 20.
8
PDF text classification to leverage information extraction from publication reports.利用出版物报告中的信息提取进行PDF文本分类。
J Biomed Inform. 2016 Jun;61:141-8. doi: 10.1016/j.jbi.2016.03.026. Epub 2016 Apr 1.
9
LATTE: A knowledge-based method to normalize various expressions of laboratory test results in free text of Chinese electronic health records.LATTE:一种基于知识的方法,用于规范化中文电子健康记录自由文本中实验室检查结果的各种表达方式。
J Biomed Inform. 2020 Feb;102:103372. doi: 10.1016/j.jbi.2019.103372. Epub 2019 Dec 31.
10
Balinese story texts dataset for narrative text analyses.用于叙事文本分析的巴厘岛故事文本数据集。
Data Brief. 2024 Aug 8;56:110781. doi: 10.1016/j.dib.2024.110781. eCollection 2024 Oct.

引用本文的文献

1
A Study of Factors Influencing the Volume of Responses to Posts in Physician Online Community.一项关于影响医生在线社区帖子回复量的因素的研究。
Healthcare (Basel). 2023 Apr 29;11(9):1275. doi: 10.3390/healthcare11091275.
2
Chinese herbal medicine for migraine management: A hospital-based retrospective analysis of electronic medical records.用于偏头痛管理的中草药:基于医院电子病历的回顾性分析
Front Med (Lausanne). 2022 Nov 10;9:936234. doi: 10.3389/fmed.2022.936234. eCollection 2022.
3
Health Natural Language Processing: Methodology Development and Applications.

本文引用的文献

1
Temporal indexing of medical entity in Chinese clinical notes.中文临床记录中医疗实体的时间索引。
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.
2
Identifying direct temporal relations between time and events from clinical notes.从临床记录中识别时间与事件之间的直接时间关系。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):49. doi: 10.1186/s12911-018-0627-5.
3
A pattern learning-based method for temporal expression extraction and normalization from multi-lingual heterogeneous clinical texts.
健康自然语言处理:方法学发展与应用
JMIR Med Inform. 2021 Oct 21;9(10):e23898. doi: 10.2196/23898.
基于模式学习的方法,用于从多语言异质临床文本中提取和规范时间表达式。
BMC Med Inform Decis Mak. 2018 Mar 22;18(Suppl 1):22. doi: 10.1186/s12911-018-0595-9.
4
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
5
Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.用于捕获和标准化非结构化临床信息的自然语言处理系统:一项系统综述。
J Biomed Inform. 2017 Sep;73:14-29. doi: 10.1016/j.jbi.2017.07.012. Epub 2017 Jul 17.
6
CMedTEX: A Rule-based Temporal Expression Extraction and Normalization System for Chinese Clinical Notes.CMedTEX:一个基于规则的中文临床病历时间表达式提取与规范化系统。
AMIA Annu Symp Proc. 2017 Feb 10;2016:818-826. eCollection 2016.
7
A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports.一种从临床叙述中提取时间关联的新算法方法及其在医疗产品安全监测报告中的应用。
J Biomed Inform. 2016 Aug;62:78-89. doi: 10.1016/j.jbi.2016.06.006. Epub 2016 Jun 17.
8
Temporal data representation, normalization, extraction, and reasoning: A review from clinical domain.时态数据表示、规范化、提取与推理:来自临床领域的综述
Comput Methods Programs Biomed. 2016 May;128:52-68. doi: 10.1016/j.cmpb.2016.02.007. Epub 2016 Feb 23.
9
Normalization of relative and incomplete temporal expressions in clinical narratives.临床叙述中相对和不完整时间表达的规范化。
J Am Med Inform Assoc. 2015 Sep;22(5):1001-8. doi: 10.1093/jamia/ocu004. Epub 2015 Apr 12.
10
TEMPTING system: a hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries.TEMPTING 系统:一种规则和机器学习的混合方法,用于从患者出院小结中提取时间关系。
J Biomed Inform. 2013 Dec;46 Suppl:S54-S62. doi: 10.1016/j.jbi.2013.09.007. Epub 2013 Sep 20.