• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从泌尿生殖肿瘤学的多样化电子记录数据集提取肿瘤预后因素。

Extracting tumour prognostic factors from a diverse electronic record dataset in genito-urinary oncology.

机构信息

Peter MacCallum Cancer Centre, Department of Radiation Oncology, Melbourne, Australia; University of Melbourne, Sir Peter MacCallum Department of Oncology, Melbourne, Australia; Austin Health, Department of Radiation Oncology, Melbourne, Australia.

The Australian e-Health Research Centre, CSIRO, Brisbane, Australia.

出版信息

Int J Med Inform. 2019 Jan;121:53-57. doi: 10.1016/j.ijmedinf.2018.10.008. Epub 2018 Oct 23.

DOI:10.1016/j.ijmedinf.2018.10.008
PMID:30545489
Abstract

OBJECTIVES

To implement a system for unsupervised extraction of tumor stage and prognostic data in patients with genitourinary cancers using clinicopathological and radiology text.

METHODS

A corpus of 1054 electronic notes (clinician notes, radiology reports and pathology reports) was annotated for tumor stage, prostate specific antigen (PSA) and Gleason grade. Annotations from five clinicians were reconciled to form a gold standard dataset. A training dataset of 386 documents was sequestered. The Medtex algorithm was adapted using the training dataset.

RESULTS

Adapted Medtex equaled or exceeded human performance in most annotations, except for implicit M stage (F-measure of 0.69 vs 0.84) and PSA (0.92 vs 0.96). Overall Medtex performed with an F-measure of 0.86 compared to human annotations of 0.92. There was significant inter-observer variability when comparing human annotators to the gold standard.

CONCLUSIONS

The Medtex algorithm performed similarly to human annotators for extracting stage and prognostic data from varied clinical texts.

摘要

目的

利用临床病理和放射学文本,实现一种用于从泌尿生殖系统癌症患者中提取肿瘤分期和预后数据的无监督系统。

方法

对 1054 份电子病历(临床医生记录、放射学报告和病理学报告)进行了肿瘤分期、前列腺特异性抗原(PSA)和 Gleason 分级的标注。从五名临床医生的标注中进行了协调,形成了一个黄金标准数据集。隔离了 386 份训练文档。使用训练数据集对 Medtex 算法进行了改编。

结果

改编后的 Medtex 在大多数标注中与人类表现相当或优于人类表现,除了隐性 M 期(F-度量值为 0.69 与 0.84)和 PSA(0.92 与 0.96)。总体而言,Medtex 的 F-度量值为 0.86,而人类标注的 F-度量值为 0.92。在将人类注释与黄金标准进行比较时,存在显著的观察者间变异性。

结论

Medtex 算法在从各种临床文本中提取分期和预后数据方面的表现与人类注释者相似。

相似文献

1
Extracting tumour prognostic factors from a diverse electronic record dataset in genito-urinary oncology.从泌尿生殖肿瘤学的多样化电子记录数据集提取肿瘤预后因素。
Int J Med Inform. 2019 Jan;121:53-57. doi: 10.1016/j.ijmedinf.2018.10.008. Epub 2018 Oct 23.
2
Regular expression-based learning to extract bodyweight values from clinical notes.基于正则表达式的学习方法,用于从临床记录中提取体重值。
J Biomed Inform. 2015 Apr;54:186-90. doi: 10.1016/j.jbi.2015.02.009. Epub 2015 Mar 5.
3
Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.从病理报告中提取肺癌分期描述符:一种生成式语言模型方法。
J Biomed Inform. 2024 Sep;157:104720. doi: 10.1016/j.jbi.2024.104720. Epub 2024 Sep 2.
4
Using natural language processing to extract clinically useful information from Chinese electronic medical records.利用自然语言处理从中文电子病历中提取有临床价值的信息。
Int J Med Inform. 2019 Apr;124:6-12. doi: 10.1016/j.ijmedinf.2019.01.004. Epub 2019 Jan 7.
5
Detecting the presence of an indwelling urinary catheter and urinary symptoms in hospitalized patients using natural language processing.使用自然语言处理技术检测住院患者体内留置导尿管的情况及泌尿系统症状。
J Biomed Inform. 2017 Jul;71S:S39-S45. doi: 10.1016/j.jbi.2016.07.012. Epub 2016 Jul 9.
6
Facilitating clinical research through automation: Combining optical character recognition with natural language processing.通过自动化促进临床研究:结合光学字符识别和自然语言处理。
Clin Trials. 2022 Oct;19(5):504-511. doi: 10.1177/17407745221093621. Epub 2022 May 24.
7
Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator.印度一家医院的重症监护出院小结自由文本的密集标注及临床自然语言处理标注器的相关性能
J Med Syst. 2016 Aug;40(8):187. doi: 10.1007/s10916-016-0541-2. Epub 2016 Jun 24.
8
De-identification of clinical notes in French: towards a protocol for reference corpus development.法语临床记录的去识别化:迈向参考语料库开发协议
J Biomed Inform. 2014 Aug;50:151-61. doi: 10.1016/j.jbi.2013.12.014. Epub 2013 Dec 29.
9
Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records.开发具有心血管疾病风险因素注释的中文电子病历语料库。
BMC Med Inform Decis Mak. 2017 Aug 8;17(1):117. doi: 10.1186/s12911-017-0512-7.
10
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.

引用本文的文献

1
Prediction of blood culture outcome using hybrid neural network model based on electronic health records.基于电子健康记录的混合神经网络模型预测血培养结果。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):121. doi: 10.1186/s12911-020-1113-4.
2
Generating high-quality data abstractions from scanned clinical records: text-mining-assisted extraction of endometrial carcinoma pathology features as proof of principle.从扫描的临床记录中生成高质量的数据摘要:文本挖掘辅助提取子宫内膜癌病理特征作为原理验证。
BMJ Open. 2020 Jun 11;10(6):e037740. doi: 10.1136/bmjopen-2020-037740.
3
A Collaborative Framework Based for Semantic Patients-Behavior Analysis and Highlight Topics Discovery of Alcoholic Beverages in Online Healthcare Forums.
基于协作框架的语义患者行为分析及在线医疗保健论坛中酒类话题发现
J Med Syst. 2020 Apr 7;44(5):101. doi: 10.1007/s10916-020-01547-0.
4
Machine learning for syndromic surveillance using veterinary necropsy reports.利用兽医剖检报告进行综合征监测的机器学习。
PLoS One. 2020 Feb 5;15(2):e0228105. doi: 10.1371/journal.pone.0228105. eCollection 2020.