• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

监督机器学习和主动学习在放射科报告分类中的应用。

Supervised machine learning and active learning in classification of radiology reports.

机构信息

School of Information Technologies, University of Sydney, Sydney, New South Wales, Australia.

出版信息

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):893-901. doi: 10.1136/amiajnl-2013-002516. Epub 2014 May 22.

DOI:10.1136/amiajnl-2013-002516
PMID:24853067
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4147614/
Abstract

OBJECTIVE

This paper presents an automated system for classifying the results of imaging examinations (CT, MRI, positron emission tomography) into reportable and non-reportable cancer cases. This system is part of an industrial-strength processing pipeline built to extract content from radiology reports for use in the Victorian Cancer Registry.

MATERIALS AND METHODS

In addition to traditional supervised learning methods such as conditional random fields and support vector machines, active learning (AL) approaches were investigated to optimize training production and further improve classification performance. The project involved two pilot sites in Victoria, Australia (Lake Imaging (Ballarat) and Peter MacCallum Cancer Centre (Melbourne)) and, in collaboration with the NSW Central Registry, one pilot site at Westmead Hospital (Sydney).

RESULTS

The reportability classifier performance achieved 98.25% sensitivity and 96.14% specificity on the cancer registry's held-out test set. Up to 92% of training data needed for supervised machine learning can be saved by AL.

DISCUSSION

AL is a promising method for optimizing the supervised training production used in classification of radiology reports. When an AL strategy is applied during the data selection process, the cost of manual classification can be reduced significantly.

CONCLUSIONS

The most important practical application of the reportability classifier is that it can dramatically reduce human effort in identifying relevant reports from the large imaging pool for further investigation of cancer. The classifier is built on a large real-world dataset and can achieve high performance in filtering relevant reports to support cancer registries.

摘要

目的

本文提出了一种自动系统,用于将影像学检查(CT、MRI、正电子发射断层扫描)的结果分类为可报告和不可报告的癌症病例。该系统是一个工业强度处理管道的一部分,该管道用于从放射学报告中提取内容,供维多利亚癌症登记处使用。

材料与方法

除了条件随机场和支持向量机等传统监督学习方法外,还研究了主动学习(AL)方法,以优化训练生产并进一步提高分类性能。该项目涉及澳大利亚维多利亚州的两个试点站点(Lake Imaging(巴拉腊特)和 Peter MacCallum Cancer Centre(墨尔本)),并与新南威尔士州中央登记处合作,在 Westmead 医院(悉尼)进行了一个试点。

结果

可报告性分类器在癌症登记处的保留测试集上实现了 98.25%的敏感性和 96.14%的特异性。主动学习可节省高达 92%的监督机器学习所需的训练数据。

讨论

主动学习是优化分类放射学报告中使用的监督训练生产的一种很有前途的方法。当在数据选择过程中应用 AL 策略时,可以显著减少手动分类的成本。

结论

可报告性分类器的最重要实际应用是,它可以大大减少从大型成像池中识别相关报告以进一步调查癌症的人力。该分类器建立在一个大型真实数据集上,可以实现高性能的相关报告筛选,以支持癌症登记处。

相似文献

1
Supervised machine learning and active learning in classification of radiology reports.监督机器学习和主动学习在放射科报告分类中的应用。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):893-901. doi: 10.1136/amiajnl-2013-002516. Epub 2014 May 22.
2
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
3
Using NLP to identify cancer cases in imaging reports drawn from radiology information systems.利用自然语言处理技术从放射学信息系统提取的影像报告中识别癌症病例。
Stud Health Technol Inform. 2013;188:91-4.
4
Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.基于拉普拉斯支持向量机的半监督临床文本分类:在癌症病例管理中的应用。
J Biomed Inform. 2013 Oct;46(5):869-75. doi: 10.1016/j.jbi.2013.06.014. Epub 2013 Jul 8.
5
Automated outcome classification of emergency department computed tomography imaging reports.急诊 CT 影像报告的自动化结果分类。
Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.
6
Accurate determination of imaging modality using an ensemble of text- and image-based classifiers.使用基于文本和图像的分类器集成来准确确定成像方式。
J Digit Imaging. 2012 Feb;25(1):37-42. doi: 10.1007/s10278-011-9399-5.
7
Supervised and unsupervised methods for prostate cancer segmentation with multispectral MRI.多光谱 MRI 前列腺癌分割的有监督和无监督方法。
Med Phys. 2010 Apr;37(4):1873-83. doi: 10.1118/1.3359459.
8
Classification of Benign and Malignant Breast Masses on Mammograms for Large Datasets using Core Vector Machines.基于核向量机的大样本乳腺钼靶图像良恶性肿块分类
Curr Med Imaging. 2020;16(6):703-710. doi: 10.2174/1573405615666190801121506.
9
Combined unsupervised-supervised classification of multiparametric PET/MRI data: application to prostate cancer.多参数PET/MRI数据的无监督-监督联合分类:在前列腺癌中的应用
NMR Biomed. 2015 Jul;28(7):914-22. doi: 10.1002/nbm.3329. Epub 2015 May 26.
10
Automatic medical protocol classification using machine learning approaches.使用机器学习方法进行自动医疗协议分类。
Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.

引用本文的文献

1
Dataset from a human-in-the-loop approach to identify functionally important protein residues from literature.来自人机交互方法的数据集,用于从文献中识别功能重要的蛋白质残基。
Sci Data. 2024 Sep 27;11(1):1032. doi: 10.1038/s41597-024-03841-9.
2
Utilizing active learning strategies in machine-assisted annotation for clinical named entity recognition: a comprehensive analysis considering annotation costs and target effectiveness.利用主动学习策略在机器辅助标注中进行临床命名实体识别:考虑标注成本和目标效果的综合分析。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2632-2640. doi: 10.1093/jamia/ocae197.
3
From data to insights: how natural language processing and structured reporting advance data-driven radiology.从数据到洞察:自然语言处理与结构化报告如何推动数据驱动的放射学发展
Eur Radiol. 2023 Nov;33(11):7494-7495. doi: 10.1007/s00330-023-10242-w. Epub 2023 Oct 2.
4
Multi-center Integrating Radiomics, Structured Reports, and Machine Learning Algorithms for Assisted Classification of COVID-19 in Lung Computed Tomography.多中心整合影像组学、结构化报告和机器学习算法用于肺部计算机断层扫描中新型冠状病毒肺炎的辅助分类
J Med Biol Eng. 2023;43(2):156-162. doi: 10.1007/s40846-023-00781-4. Epub 2023 Mar 7.
5
Reviewing the relationship between machines and radiology: the application of artificial intelligence.审视机器与放射学之间的关系:人工智能的应用
Acta Radiol Open. 2021 Feb 9;10(2):2058460121990296. doi: 10.1177/2058460121990296. eCollection 2021 Feb.
6
Deep learning to convert unstructured CT pulmonary angiography reports into structured reports.深度学习将非结构化CT肺血管造影报告转换为结构化报告。
Eur Radiol Exp. 2019 Sep 23;3(1):37. doi: 10.1186/s41747-019-0118-1.
7
Active deep learning for the identification of concepts and relations in electroencephalography reports.主动深度学习在脑电图报告中概念和关系的识别。
J Biomed Inform. 2019 Oct;98:103265. doi: 10.1016/j.jbi.2019.103265. Epub 2019 Aug 27.
8
Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment.用于评估前列腺癌治疗后以患者为中心的结果的弱监督自然语言处理
JAMIA Open. 2019 Apr;2(1):150-159. doi: 10.1093/jamiaopen/ooy057. Epub 2019 Jan 4.
9
Automatic inference of BI-RADS final assessment categories from narrative mammography report findings.从叙事性乳腺 X 线报告结果自动推断 BI-RADS 最终评估类别。
J Biomed Inform. 2019 Apr;92:103137. doi: 10.1016/j.jbi.2019.103137. Epub 2019 Feb 23.
10
Intelligent Word Embeddings of Free-Text Radiology Reports.自由文本放射学报告的智能词嵌入
AMIA Annu Symp Proc. 2018 Apr 16;2017:411-420. eCollection 2017.

本文引用的文献

1
Applying active learning to high-throughput phenotyping algorithms for electronic health records data.将主动学习应用于电子健康记录数据的高通量表型算法。
J Am Med Inform Assoc. 2013 Dec;20(e2):e253-9. doi: 10.1136/amiajnl-2013-001945. Epub 2013 Jul 13.
2
Applying active learning to supervised word sense disambiguation in MEDLINE.将主动学习应用于 MEDLINE 中的监督词义消歧。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):1001-6. doi: 10.1136/amiajnl-2012-001244. Epub 2013 Jan 30.
3
Active learning for clinical text classification: is it better than random sampling?主动学习在临床文本分类中的应用:它比随机抽样更好吗?
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):809-16. doi: 10.1136/amiajnl-2011-000648. Epub 2012 Jun 15.
4
Systematized nomenclature of medicine clinical terms (SNOMED CT) to represent computed tomography procedures.系统医学术语命名法(SNOMED CT)用于表示计算机断层扫描程序。
Comput Methods Programs Biomed. 2011 Mar;101(3):324-9. doi: 10.1016/j.cmpb.2011.01.002.
5
Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.从非结构化 MRI 报告中辨别肿瘤状态——现有报告中信息的完整性和自动化自然语言处理的实用性。
J Digit Imaging. 2010 Apr;23(2):119-32. doi: 10.1007/s10278-009-9215-7. Epub 2009 May 30.
6
Optimal training sets for Bayesian prediction of MeSH assignment.用于医学主题词(MeSH)分配贝叶斯预测的最优训练集。
J Am Med Inform Assoc. 2008 Jul-Aug;15(4):546-53. doi: 10.1197/jamia.M2431. Epub 2008 Apr 24.
7
Collection of cancer stage data by classifying free-text medical reports.通过对自由文本医学报告进行分类来收集癌症分期数据。
J Am Med Inform Assoc. 2007 Nov-Dec;14(6):736-45. doi: 10.1197/jamia.M2130. Epub 2007 Aug 21.
8
Automated computer-assisted categorization of radiology reports.放射学报告的自动化计算机辅助分类
AJR Am J Roentgenol. 2005 Feb;184(2):687-90. doi: 10.2214/ajr.184.2.01840687.
9
Application of recently developed computer algorithm for automatic classification of unstructured radiology reports: validation study.近期开发的用于非结构化放射学报告自动分类的计算机算法的应用:验证研究
Radiology. 2005 Feb;234(2):323-9. doi: 10.1148/radiol.2341040049. Epub 2004 Dec 10.