Suppr超能文献

从用于癌症通报的自由文本病理报告中自动提取癌症特征。

Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.

作者信息

Nguyen Anthony, Moore Julie, Lawley Michael, Hansen David, Colquist Shoni

机构信息

The Australian E-Health Research Centre, CSIRO ICT Centre, Brisbane, Australia.

出版信息

Stud Health Technol Inform. 2011;168:117-24.

Abstract

OBJECTIVE

To develop a system for the automatic classification of Cancer Registry notifications data from free-text pathology reports.

METHOD

The underlying technology used for the extraction of cancer notification items is based on the symbolic rule-based classification methodology, whereby formal semantics are used to reason with the systematised nomenclature of medicine - clinical terms (SNOMED CT) concepts identified in the free text. Business rules for cancer notifications used by Cancer Registry coding staff were also incorporated with the aim to mimic Cancer Registry processes.

RESULTS

The system was developed on a corpus of 239 histology and cytology reports (with 60% notifiable reports), and then evaluated on an independent set of 300 reports (with 20% notifiable reports). Results show that the system can reliably classify notifiable reports with 96% and 100% specificity, and achieve an overall accuracy of 82% and 74% for classifying notification items from notifiable reports at a unit record level from the development and evaluation set, respectively.

CONCLUSION

Cancer Registries collect a multitude of data that requires manual review, slowing down the flow of information. Extracting and providing an automatically coded cancer pathology notification for review can lessen the reliance on expert clinical staff, improving the efficiency and availability of cancer information.

摘要

目的

开发一个用于对来自自由文本病理报告的癌症登记通知数据进行自动分类的系统。

方法

用于提取癌症通知项目的基础技术基于基于符号规则的分类方法,即使用形式语义对自由文本中识别出的医学系统化命名法——临床术语(SNOMED CT)概念进行推理。癌症登记编码人员使用的癌症通知业务规则也被纳入,旨在模拟癌症登记流程。

结果

该系统基于239份组织学和细胞学报告(其中60%为应报告报告)的语料库开发,然后在一组独立的300份报告(其中20%为应报告报告)上进行评估。结果表明,该系统能够以96%和100%的特异性可靠地分类应报告报告,并且在开发集和评估集的单位记录级别上,对应报告报告中的通知项目进行分类的总体准确率分别为82%和74%。

结论

癌症登记处收集大量需要人工审核的数据,这减缓了信息流。提取并提供自动编码的癌症病理通知以供审核可以减少对专家临床工作人员的依赖,提高癌症信息的效率和可用性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验