癌症分期信息的信息提取的跨医院可移植性。

Cross-hospital portability of information extraction of cancer staging information.

机构信息

Department of Computing and Information Systems, The University of Melbourne, Doug McDonell Building, Parkville, 3010 VIC, Australia.

Barwon Health, Geelong Hospital, 1/75 Bellerine Street, Geelong, 3220 VIC, Australia.

出版信息

Artif Intell Med. 2014 Sep;62(1):11-21. doi: 10.1016/j.artmed.2014.06.002. Epub 2014 Jun 21.

DOI:10.1016/j.artmed.2014.06.002

PMID:25001545

Abstract

OBJECTIVE

We address the task of extracting information from free-text pathology reports, focusing on staging information encoded by the TNM (tumour-node-metastases) and ACPS (Australian clinico-pathological stage) systems. Staging information is critical for diagnosing the extent of cancer in a patient and for planning individualised treatment. Extracting such information into more structured form saves time, improves reporting, and underpins the potential for automated decision support.

METHODS AND MATERIAL

We investigate the portability of a text mining model constructed from records from one health centre, by applying it directly to the extraction task over a set of records from a different health centre, with different reporting narrative characteristics. Other than a simple normalisation step on features associated with target labels, we apply the models from one system directly to the other.

RESULTS

The best F-scores for in-hospital experiments are 81%, 85%, and 94% (for staging T, N, and M respectively), while best cross-hospital F-scores reach 84%, 81%, and 91% for the same respective categories.

CONCLUSIONS

Our performance results compare favourably to the best levels reported in the literature, and--most relevant to our aim here--the cross-corpus results demonstrate the portability of the models we developed.

摘要

目的

从病理报告的自由文本中提取信息，重点关注 TNM（肿瘤-淋巴结-转移）和 ACPS（澳大利亚临床病理分期）系统编码的分期信息。分期信息对于诊断患者癌症的严重程度和制定个体化治疗方案至关重要。将此类信息提取到更结构化的形式中可以节省时间、提高报告质量，并为自动化决策支持提供潜力。

方法和材料

我们通过将模型直接应用于来自另一个健康中心的记录集，研究了从一个健康中心的记录构建的文本挖掘模型的可移植性，这些记录具有不同的报告叙述特征。除了对与目标标签相关的特征进行简单的规范化处理之外，我们直接将一个系统的模型应用于另一个系统。

结果

住院内实验的最佳 F 分数分别为 81%、85% 和 94%（分别用于分期 T、N 和 M），而最佳跨医院 F 分数分别为 84%、81% 和 91%，用于相同的相应类别。

结论

我们的性能结果与文献中报告的最佳水平相当，并且——与我们在这里的目标最相关——跨语料库的结果证明了我们开发的模型的可移植性。

相似文献

Cross-hospital portability of information extraction of cancer staging information.癌症分期信息的信息提取的跨医院可移植性。

Artif Intell Med. 2014 Sep;62(1):11-21. doi: 10.1016/j.artmed.2014.06.002. Epub 2014 Jun 21.

Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.从病理报告中提取肺癌分期描述符：一种生成式语言模型方法。

J Biomed Inform. 2024 Sep;157:104720. doi: 10.1016/j.jbi.2024.104720. Epub 2024 Sep 2.

Collection of cancer stage data by classifying free-text medical reports.通过对自由文本医学报告进行分类来收集癌症分期数据。

J Am Med Inform Assoc. 2007 Nov-Dec;14(6):736-45. doi: 10.1197/jamia.M2130. Epub 2007 Aug 21.

Symbolic rule-based classification of lung cancer stages from free-text pathology reports.基于符号规则的肺癌分期的自由文本病理学报告分类。

J Am Med Inform Assoc. 2010 Jul-Aug;17(4):440-5. doi: 10.1136/jamia.2010.003707.

Text mining electronic hospital records to automatically classify admissions against disease: Measuring the impact of linking data sources.通过文本挖掘电子医院记录自动对疾病入院情况进行分类：衡量链接数据源的影响。

J Biomed Inform. 2016 Dec;64:158-167. doi: 10.1016/j.jbi.2016.10.008. Epub 2016 Oct 11.

University of California, Irvine-Pathology Extraction Pipeline: the pathology extraction pipeline for information extraction from pathology reports.加利福尼亚大学欧文分校病理提取管道：用于从病理报告中提取信息的病理提取管道。

Health Informatics J. 2014 Dec;20(4):288-305. doi: 10.1177/1460458213494032. Epub 2014 Aug 25.

Extraction from Medical Records.从医疗记录中提取信息。

Stud Health Technol Inform. 2019;261:62-67.

Capturing tumour stage in a cancer information database.在癌症信息数据库中记录肿瘤分期。

Cancer Prev Control. 1998 Dec;2(6):304-9.

Machine learning classification of surgical pathology reports and chunk recognition for information extraction noise reduction.用于信息提取降噪的手术病理报告的机器学习分类及语块识别

Artif Intell Med. 2016 Jun;70:77-83. doi: 10.1016/j.artmed.2016.06.001. Epub 2016 Jun 8.

ReCAP: Feasibility and Accuracy of Extracting Cancer Stage Information From Narrative Electronic Health Record Data.ReCAP：从电子病历数据中提取癌症分期信息的可行性和准确性。

J Oncol Pract. 2016 Feb;12(2):157-8; e169-7. doi: 10.1200/JOP.2015.004622. Epub 2015 Aug 25.

引用本文的文献

Enhancing Thoracic Surgery with AI: A Review of Current Practices and Emerging Trends.人工智能在胸外科中的应用：现状与新兴趋势综述。

Curr Oncol. 2024 Oct 17;31(10):6232-6244. doi: 10.3390/curroncol31100464.

Artificial Intelligence in the Screening, Diagnosis, and Management of Aortic Stenosis.人工智能在主动脉瓣狭窄的筛查、诊断及管理中的应用

Rev Cardiovasc Med. 2024 Jan 17;25(1):31. doi: 10.31083/j.rcm2501031. eCollection 2024 Jan.

Natural Language Processing Applied to Clinical Documentation in Post-acute Care Settings: A Scoping Review.自然语言处理在急性后护理环境中临床文档中的应用：一项范围综述

J Am Med Dir Assoc. 2024 Jan;25(1):69-83. doi: 10.1016/j.jamda.2023.09.006. Epub 2023 Oct 11.

Using Natural Language Processing and Machine Learning to Preoperatively Predict Lymph Node Metastasis for Non-Small Cell Lung Cancer With Electronic Medical Records: Development and Validation Study.利用自然语言处理和机器学习，通过电子病历术前预测非小细胞肺癌的淋巴结转移：开发与验证研究

JMIR Med Inform. 2022 Apr 25;10(4):e35475. doi: 10.2196/35475.

Rule-Based Information Extraction from Free-Text Pathology Reports Reveals Trends in South African Female Breast Cancer Molecular Subtypes and Ki67 Expression.基于规则的自由文本病理学报告信息提取揭示了南非女性乳腺癌分子亚型和 Ki67 表达的趋势。

Biomed Res Int. 2022 Jan 20;2022:6157861. doi: 10.1155/2022/6157861. eCollection 2022.

Information extraction for prognostic stage prediction from breast cancer medical records using NLP and ML.基于自然语言处理和机器学习的乳腺癌病历预后分期预测的信息提取。

Med Biol Eng Comput. 2021 Sep;59(9):1751-1772. doi: 10.1007/s11517-021-02399-7. Epub 2021 Jul 23.

Evaluating the Portability of an NLP System for Processing Echocardiograms: A Retrospective, Multi-site Observational Study.评估用于处理超声心动图的自然语言处理系统的可移植性：一项回顾性、多中心观察性研究。

AMIA Annu Symp Proc. 2020 Mar 4;2019:190-199. eCollection 2019.

Applications of Machine Learning Predictive Models in the Chronic Disease Diagnosis.机器学习预测模型在慢性病诊断中的应用。

J Pers Med. 2020 Mar 31;10(2):21. doi: 10.3390/jpm10020021.

From Sour Grapes to Low-Hanging Fruit: A Case Study Demonstrating a Practical Strategy for Natural Language Processing Portability.从酸葡萄到低垂的果实：一个展示自然语言处理可移植性实用策略的案例研究

AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:104-112. eCollection 2018.

Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.用于捕获和标准化非结构化临床信息的自然语言处理系统：一项系统综述。

J Biomed Inform. 2017 Sep;73:14-29. doi: 10.1016/j.jbi.2017.07.012. Epub 2017 Jul 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

癌症分期信息的信息提取的跨医院可移植性。

Cross-hospital portability of information extraction of cancer staging information.

机构信息

出版信息

OBJECTIVE

METHODS AND MATERIAL

RESULTS

CONCLUSIONS

目的

方法和材料

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献