• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床叙述中相对和不完整时间表达的规范化。

Normalization of relative and incomplete temporal expressions in clinical narratives.

作者信息

Sun Weiyi, Rumshisky Anna, Uzuner Ozlem

机构信息

Department of Informatics, University at Albany, SUNY. Albany, NY

Department of Computer Science, University of Massachusetts Lowell. Lowell, MA.

出版信息

J Am Med Inform Assoc. 2015 Sep;22(5):1001-8. doi: 10.1093/jamia/ocu004. Epub 2015 Apr 12.

DOI:10.1093/jamia/ocu004
PMID:25868462
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4986666/
Abstract

OBJECTIVE

To improve the normalization of relative and incomplete temporal expressions (RI-TIMEXes) in clinical narratives.

METHODS

We analyzed the RI-TIMEXes in temporally annotated corpora and propose two hypotheses regarding the normalization of RI-TIMEXes in the clinical narrative domain: the anchor point hypothesis and the anchor relation hypothesis. We annotated the RI-TIMEXes in three corpora to study the characteristics of RI-TMEXes in different domains. This informed the design of our RI-TIMEX normalization system for the clinical domain, which consists of an anchor point classifier, an anchor relation classifier, and a rule-based RI-TIMEX text span parser. We experimented with different feature sets and performed an error analysis for each system component.

RESULTS

The annotation confirmed the hypotheses that we can simplify the RI-TIMEXes normalization task using two multi-label classifiers. Our system achieves anchor point classification, anchor relation classification, and rule-based parsing accuracy of 74.68%, 87.71%, and 57.2% (82.09% under relaxed matching criteria), respectively, on the held-out test set of the 2012 i2b2 temporal relation challenge.

DISCUSSION

Experiments with feature sets reveal some interesting findings, such as: the verbal tense feature does not inform the anchor relation classification in clinical narratives as much as the tokens near the RI-TIMEX. Error analysis showed that underrepresented anchor point and anchor relation classes are difficult to detect.

CONCLUSIONS

We formulate the RI-TIMEX normalization problem as a pair of multi-label classification problems. Considering only RI-TIMEX extraction and normalization, the system achieves statistically significant improvement over the RI-TIMEX results of the best systems in the 2012 i2b2 challenge.

摘要

目的

提高临床叙述中相对和不完整时间表达(RI-TIMEXes)的规范化程度。

方法

我们分析了带有时间标注的语料库中的RI-TIMEXes,并提出了关于临床叙述领域中RI-TIMEXes规范化的两个假设:锚点假设和锚点关系假设。我们在三个语料库中对RI-TIMEXes进行了标注,以研究不同领域中RI-TIMEXes的特征。这为我们针对临床领域的RI-TIMEX规范化系统的设计提供了依据,该系统由一个锚点分类器、一个锚点关系分类器和一个基于规则的RI-TIMEX文本跨度解析器组成。我们对不同的特征集进行了实验,并对每个系统组件进行了错误分析。

结果

标注证实了我们可以使用两个多标签分类器简化RI-TIMEXes规范化任务的假设。在2012年i2b2时间关系挑战赛的留出测试集上,我们的系统分别实现了74.68%、87.71%和57.2%(在宽松匹配标准下为82.09%)的锚点分类、锚点关系分类和基于规则的解析准确率。

讨论

对特征集的实验揭示了一些有趣的发现,例如:动词时态特征在临床叙述中对锚点关系分类的作用不如RI-TIMEX附近的词元。错误分析表明,代表性不足的锚点和锚点关系类别难以检测。

结论

我们将RI-TIMEX规范化问题表述为一对多标签分类问题。仅考虑RI-TIMEX提取和规范化,该系统相对于2012年i2b2挑战赛中最佳系统的RI-TIMEX结果有统计学上的显著改进。

相似文献

1
Normalization of relative and incomplete temporal expressions in clinical narratives.临床叙述中相对和不完整时间表达的规范化。
J Am Med Inform Assoc. 2015 Sep;22(5):1001-8. doi: 10.1093/jamia/ocu004. Epub 2015 Apr 12.
2
Short-Term Memory Impairment短期记忆障碍
3
Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.评估临床文本中的时间关系:2012 i2b2 挑战赛。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):806-13. doi: 10.1136/amiajnl-2013-001628. Epub 2013 Apr 5.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Multicriteria Optimization of Language Models for Heart Failure With Preserved Ejection Fraction Symptom Detection in Spanish Electronic Health Records: Comparative Modeling Study.西班牙电子健康记录中射血分数保留的心力衰竭症状检测语言模型的多标准优化:比较建模研究
J Med Internet Res. 2025 Jul 17;27:e76433. doi: 10.2196/76433.
6
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
7
Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.使用基于转换器的自然语言处理方法识别与糖尿病视网膜病变相关的临床概念及其属性。
BMC Med Inform Decis Mak. 2022 Sep 27;22(Suppl 3):255. doi: 10.1186/s12911-022-01996-2.
8
Clinical judgement by primary care physicians for the diagnosis of all-cause dementia or cognitive impairment in symptomatic people.初级保健医生对有症状人群进行全因痴呆或认知障碍诊断的临床判断。
Cochrane Database Syst Rev. 2022 Jun 16;6(6):CD012558. doi: 10.1002/14651858.CD012558.pub2.
9
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
10
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

引用本文的文献

1
Defining Phenotypes from Clinical Data to Drive Genomic Research.从临床数据定义表型以推动基因组研究。
Annu Rev Biomed Data Sci. 2018 Jul;1:69-92. doi: 10.1146/annurev-biodatasci-080917-013335. Epub 2018 Apr 25.
2
Temporal Expression Classification and Normalization From Chinese Narrative Clinical Texts: Pattern Learning Approach.基于中文叙事临床文本的时间表达分类与归一化:模式学习方法
JMIR Med Inform. 2020 Jul 27;8(7):e17652. doi: 10.2196/17652.
3
Temporal information extraction from mental health records to identify duration of untreated psychosis.从心理健康记录中提取时间信息,以确定未治疗精神病的持续时间。
J Biomed Semantics. 2020 Mar 10;11(1):2. doi: 10.1186/s13326-020-00220-2.
4
A bibliometric analysis of natural language processing in medical research.自然语言处理在医学研究中的文献计量分析。
BMC Med Inform Decis Mak. 2018 Mar 22;18(Suppl 1):14. doi: 10.1186/s12911-018-0594-x.
5
Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.2015年的临床自然语言处理:利用各类具有临床意义的文本
Yearb Med Inform. 2016 Nov 10(1):234-239. doi: 10.15265/IY-2016-049.
6
An information model for computable cancer phenotypes.一种可计算癌症表型的信息模型。
BMC Med Inform Decis Mak. 2016 Sep 15;16(1):121. doi: 10.1186/s12911-016-0358-4.
7
Optimizing annotation resources for natural language de-identification via a game theoretic framework.通过博弈论框架优化用于自然语言去识别的注释资源。
J Biomed Inform. 2016 Jun;61:97-109. doi: 10.1016/j.jbi.2016.03.019. Epub 2016 Mar 25.

本文引用的文献

1
Classifying temporal relations in clinical data: a hybrid, knowledge-rich approach.临床数据中的时间关系分类:一种混合的、知识丰富的方法。
J Biomed Inform. 2013 Dec;46 Suppl(0):S29-S39. doi: 10.1016/j.jbi.2013.08.003. Epub 2013 Aug 14.
2
MedTime: a temporal information extraction system for clinical narratives.MedTime:一个用于临床叙述的时间信息提取系统。
J Biomed Inform. 2013 Dec;46 Suppl:S20-S28. doi: 10.1016/j.jbi.2013.07.012. Epub 2013 Jul 31.
3
Annotating temporal information in clinical narratives.标注临床叙述中的时间信息。
J Biomed Inform. 2013 Dec;46 Suppl(0):S5-S12. doi: 10.1016/j.jbi.2013.07.004. Epub 2013 Jul 19.
4
A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text.一种用于识别临床文本中的事件、时间表达式和时间关系的灵活框架。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):867-75. doi: 10.1136/amiajnl-2013-001619. Epub 2013 May 18.
5
Temporal reasoning over clinical text: the state of the art.临床文本的时间推理:现状。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):814-9. doi: 10.1136/amiajnl-2013-001760. Epub 2013 May 15.
6
Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives.从临床叙述中提取时间表达式和事件的规则与机器学习相结合。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):859-66. doi: 10.1136/amiajnl-2013-001625. Epub 2013 Apr 20.
7
A hybrid system for temporal information extraction from clinical text.一种从临床文本中提取时间信息的混合系统。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):828-35. doi: 10.1136/amiajnl-2013-001635. Epub 2013 Apr 9.
8
Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.评估临床文本中的时间关系:2012 i2b2 挑战赛。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):806-13. doi: 10.1136/amiajnl-2013-001628. Epub 2013 Apr 5.
9
Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification.从临床文本中进行全面的时间信息检测:医学事件、时间和 TLINK 识别。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):836-42. doi: 10.1136/amiajnl-2013-001622. Epub 2013 Apr 4.
10
An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge.一个用于识别出院小结中时间关系的端到端系统:2012 年 i2b2 挑战赛。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):849-58. doi: 10.1136/amiajnl-2012-001607. Epub 2013 Mar 6.