从用于癌症通报的自由文本病理报告中自动提取癌症特征。

Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.

作者信息

Nguyen Anthony, Moore Julie, Lawley Michael, Hansen David, Colquist Shoni

机构信息

The Australian E-Health Research Centre, CSIRO ICT Centre, Brisbane, Australia.

出版信息

Stud Health Technol Inform. 2011;168:117-24.

PMID:21893919

Abstract

OBJECTIVE

To develop a system for the automatic classification of Cancer Registry notifications data from free-text pathology reports.

METHOD

The underlying technology used for the extraction of cancer notification items is based on the symbolic rule-based classification methodology, whereby formal semantics are used to reason with the systematised nomenclature of medicine - clinical terms (SNOMED CT) concepts identified in the free text. Business rules for cancer notifications used by Cancer Registry coding staff were also incorporated with the aim to mimic Cancer Registry processes.

RESULTS

The system was developed on a corpus of 239 histology and cytology reports (with 60% notifiable reports), and then evaluated on an independent set of 300 reports (with 20% notifiable reports). Results show that the system can reliably classify notifiable reports with 96% and 100% specificity, and achieve an overall accuracy of 82% and 74% for classifying notification items from notifiable reports at a unit record level from the development and evaluation set, respectively.

CONCLUSION

Cancer Registries collect a multitude of data that requires manual review, slowing down the flow of information. Extracting and providing an automatically coded cancer pathology notification for review can lessen the reliance on expert clinical staff, improving the efficiency and availability of cancer information.

摘要

目的

开发一个用于对来自自由文本病理报告的癌症登记通知数据进行自动分类的系统。

方法

用于提取癌症通知项目的基础技术基于基于符号规则的分类方法，即使用形式语义对自由文本中识别出的医学系统化命名法——临床术语（SNOMED CT）概念进行推理。癌症登记编码人员使用的癌症通知业务规则也被纳入，旨在模拟癌症登记流程。

结果

该系统基于239份组织学和细胞学报告（其中60%为应报告报告）的语料库开发，然后在一组独立的300份报告（其中20%为应报告报告）上进行评估。结果表明，该系统能够以96%和100%的特异性可靠地分类应报告报告，并且在开发集和评估集的单位记录级别上，对应报告报告中的通知项目进行分类的总体准确率分别为82%和74%。

结论

癌症登记处收集大量需要人工审核的数据，这减缓了信息流。提取并提供自动编码的癌症病理通知以供审核可以减少对专家临床工作人员的依赖，提高癌症信息的效率和可用性。

相似文献

Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.从用于癌症通报的自由文本病理报告中自动提取癌症特征。

Stud Health Technol Inform. 2011;168:117-24.

Classification of pathology reports for cancer registry notifications.用于癌症登记通知的病理报告分类。

Stud Health Technol Inform. 2012;178:150-6.

Assessing the Utility of Automatic Cancer Registry Notifications Data Extraction from Free-Text Pathology Reports.评估从自由文本病理报告中自动提取癌症登记通知数据的效用。

AMIA Annu Symp Proc. 2015 Nov 5;2015:953-62. eCollection 2015.

The impact of OCR accuracy on automated cancer classification of pathology reports.光学字符识别（OCR）准确性对病理报告自动癌症分类的影响。

Stud Health Technol Inform. 2012;178:250-6.

Symbolic rule-based classification of lung cancer stages from free-text pathology reports.基于符号规则的肺癌分期的自由文本病理学报告分类。

J Am Med Inform Assoc. 2010 Jul-Aug;17(4):440-5. doi: 10.1136/jamia.2010.003707.

Electronic capture and communication of synoptic cancer data elements from pathology reports: results of the Reporting Pathology Protocols 2 (RPP2) project.从病理报告中电子捕获和交流概要性癌症数据元素：报告病理协议2（RPP2）项目的结果

J Registry Manag. 2009 Winter;36(4):117-24; quiz 163-5.

The registry case finding engine: an automated tool to identify cancer cases from unstructured, free-text pathology reports and clinical notes.登记病例发现引擎：一种从非结构化的自由文本病理报告和临床记录中识别癌症病例的自动化工具。

J Am Coll Surg. 2007 Nov;205(5):690-7. doi: 10.1016/j.jamcollsurg.2007.05.014. Epub 2007 Sep 10.

Automated Cancer Registry Notifications: Validation of a Medical Text Analytics System for Identifying Patients with Cancer from a State-Wide Pathology Repository.自动化癌症登记通知：用于从全州病理库中识别癌症患者的医学文本分析系统的验证

AMIA Annu Symp Proc. 2017 Feb 10;2016:964-973. eCollection 2016.

SNOMED-encoded surgical pathology databases: a tool for epidemiologic investigation.采用系统化医学命名法编码的外科病理学数据库：一种流行病学调查工具。

Mod Pathol. 1996 Sep;9(9):944-50.

Pattern-based information extraction from pathology reports for cancer registration.基于模式的病理学报告信息提取用于癌症登记。

Cancer Causes Control. 2010 Nov;21(11):1887-94. doi: 10.1007/s10552-010-9616-4. Epub 2010 Jul 23.

引用本文的文献

Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.系统医学术语命名法（SNOMED CT）在医疗保健中处理自由文本的应用：系统范围综述。

J Med Internet Res. 2021 Jan 26;23(1):e24594. doi: 10.2196/24594.

AMIA Annu Symp Proc. 2017 Feb 10;2016:964-973. eCollection 2016.

Assessing the Utility of Automatic Cancer Registry Notifications Data Extraction from Free-Text Pathology Reports.评估从自由文本病理报告中自动提取癌症登记通知数据的效用。

AMIA Annu Symp Proc. 2015 Nov 5;2015:953-62. eCollection 2015.

Automated Reconciliation of Radiology Reports and Discharge Summaries.放射学报告与出院小结的自动核对

AMIA Annu Symp Proc. 2015 Nov 5;2015:775-84. eCollection 2015.

Active learning: a step towards automating medical concept extraction.主动学习：迈向医学概念提取自动化的一步。

J Am Med Inform Assoc. 2016 Mar;23(2):289-96. doi: 10.1093/jamia/ocv069. Epub 2015 Aug 7.

Automatic classification of diseases from free-text death certificates for real-time surveillance.用于实时监测的基于自由文本死亡证明的疾病自动分类

BMC Med Inform Decis Mak. 2015 Jul 15;15:53. doi: 10.1186/s12911-015-0174-2.

Automatic detection of tweets reporting cases of influenza like illnesses in Australia.自动检测澳大利亚报告流感样病例的推文。

Health Inf Sci Syst. 2015 Feb 24;3(Suppl 1 HISA Big Data in Biomedicine and Healthcare 2013 Con):S4. doi: 10.1186/2047-2501-3-S1-S4. eCollection 2015.

Automatic Classification of Free-Text Radiology Reports to Identify Limb Fractures using Machine Learning and the SNOMED CT Ontology.利用机器学习和SNOMED CT本体对自由文本放射学报告进行自动分类以识别肢体骨折

AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:300-4. eCollection 2013.

Classification of cancer-related death certificates using machine learning.使用机器学习对癌症相关死亡证明进行分类。

Australas Med J. 2013 May 30;6(5):292-9. doi: 10.4066/AMJ.2013.1654. Print 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从用于癌症通报的自由文本病理报告中自动提取癌症特征。

Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.

作者信息

机构信息

出版信息

OBJECTIVE

METHOD

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献