Suppr超能文献

神经信息提取算法在不同机构间具有通用性吗?

Do Neural Information Extraction Algorithms Generalize Across Institutions?

作者信息

Santus Enrico, Li Clara, Yala Adam, Peck Donald, Soomro Rufina, Faridi Naveen, Mamshad Isra, Tang Rong, Lanahan Conor R, Barzilay Regina, Hughes Kevin

机构信息

Massachusetts Institute of Technology, Cambridge, MA.

Henry Ford Health System, Detroit, MI.

出版信息

JCO Clin Cancer Inform. 2019 Jul;3:1-8. doi: 10.1200/CCI.18.00160.

Abstract

PURPOSE

Natural language processing (NLP) techniques have been adopted to reduce the curation costs of electronic health records. However, studies have questioned whether such techniques can be applied to data from previously unseen institutions. We investigated the performance of a common neural NLP algorithm on data from both known and heldout (ie, institutions whose data were withheld from the training set and only used for testing) hospitals. We also explored how diversity in the training data affects the system's generalization ability.

METHODS

We collected 24,881 breast pathology reports from seven hospitals and manually annotated them with nine key attributes that describe types of atypia and cancer. We trained a convolutional neural network (CNN) on annotations from either only one (CNN1), only two (CNN2), or only four (CNN4) hospitals. The trained systems were tested on data from five organizations, including both known and heldout ones. For every setting, we provide the accuracy scores as well as the learning curves that show how much data are necessary to achieve good performance and generalizability.

RESULTS

The system achieved a cross-institutional accuracy of 93.87% when trained on reports from only one hospital (CNN1). Performance improved to 95.7% and 96%, respectively, when the system was trained on reports from two (CNN2) and four (CNN4) hospitals. The introduction of diversity during training did not lead to improvements on the known institutions, but it boosted performance on the heldout institutions. When tested on reports from heldout hospitals, CNN4 outperformed CNN1 and CNN2 by 2.13% and 0.3%, respectively.

CONCLUSION

Real-world scenarios require that neural NLP approaches scale to data from previously unseen institutions. We show that a common neural NLP algorithm for information extraction can achieve this goal, especially when diverse data are used during training.

摘要

目的

自然语言处理(NLP)技术已被用于降低电子健康记录的整理成本。然而,研究人员质疑这些技术是否可应用于来自未知机构的数据。我们研究了一种常见的神经NLP算法在已知医院和保留医院(即数据未包含在训练集中,仅用于测试的机构)的数据上的性能。我们还探讨了训练数据的多样性如何影响系统的泛化能力。

方法

我们从七家医院收集了24,881份乳腺病理报告,并手动标注了九个关键属性,这些属性描述了异型性和癌症的类型。我们在仅来自一家医院(CNN1)、仅来自两家医院(CNN2)或仅来自四家医院(CNN4)的标注数据上训练了一个卷积神经网络(CNN)。训练好的系统在包括已知和保留机构在内的五个组织的数据上进行了测试。对于每种设置,我们提供了准确率得分以及学习曲线,这些曲线展示了要实现良好性能和泛化能力需要多少数据。

结果

当仅在一家医院(CNN1)的报告上进行训练时,该系统实现了93.87%的跨机构准确率。当系统在两家医院(CNN2)和四家医院(CNN4)的报告上进行训练时,性能分别提高到了95.7%和96%。在训练过程中引入多样性并没有提高在已知机构上的性能,但提高了在保留机构上的性能。当在保留医院的报告上进行测试时,CNN4分别比CNN1和CNN2的性能高出2.13%和0.3%。

结论

现实世界的场景要求神经NLP方法能够扩展到来自未知机构的数据。我们表明,一种用于信息提取的常见神经NLP算法可以实现这一目标,特别是在训练过程中使用多样化数据时。

相似文献

1
Do Neural Information Extraction Algorithms Generalize Across Institutions?
JCO Clin Cancer Inform. 2019 Jul;3:1-8. doi: 10.1200/CCI.18.00160.
2
A comparison of word embeddings for the biomedical natural language processing.
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
4
Ensemble method-based extraction of medication and related information from clinical texts.
J Am Med Inform Assoc. 2020 Jan 1;27(1):31-38. doi: 10.1093/jamia/ocz100.
6
A method for cohort selection of cardiovascular disease records from an electronic health record system.
Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.
7
Using natural language processing to extract clinically useful information from Chinese electronic medical records.
Int J Med Inform. 2019 Apr;124:6-12. doi: 10.1016/j.ijmedinf.2019.01.004. Epub 2019 Jan 7.
8
Machine learning to parse breast pathology reports in Chinese.
Breast Cancer Res Treat. 2018 Jun;169(2):243-250. doi: 10.1007/s10549-018-4668-3. Epub 2018 Jan 29.
9
Extracting important information from Chinese Operation Notes with natural language processing methods.
J Biomed Inform. 2014 Apr;48:130-6. doi: 10.1016/j.jbi.2013.12.017. Epub 2014 Jan 31.
10
Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.
J Biomed Inform. 2024 Sep;157:104720. doi: 10.1016/j.jbi.2024.104720. Epub 2024 Sep 2.

引用本文的文献

2
Generalization of finetuned transformer language models to new clinical contexts.
JAMIA Open. 2023 Aug 16;6(3):ooad070. doi: 10.1093/jamiaopen/ooad070. eCollection 2023 Oct.
4
Quantification of BERT Diagnosis Generalizability Across Medical Specialties Using Semantic Dataset Distance.
AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:345-354. eCollection 2021.
5
Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.
JCO Clin Cancer Inform. 2021 May;5:550-560. doi: 10.1200/CCI.20.00139.

本文引用的文献

1
Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.
PLoS Med. 2018 Nov 6;15(11):e1002683. doi: 10.1371/journal.pmed.1002683. eCollection 2018 Nov.
2
Using machine learning to parse breast pathology reports.
Breast Cancer Res Treat. 2017 Jan;161(2):203-211. doi: 10.1007/s10549-016-4035-1. Epub 2016 Nov 8.
3
Information extraction from multi-institutional radiology reports.
Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.
4
Validation of natural language processing to extract breast cancer pathology procedures and results.
J Pathol Inform. 2015 Jun 23;6:38. doi: 10.4103/2153-3539.159215. eCollection 2015.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验