在临床决策支持中使用自然语言处理和数据挖掘方法关联乳腺钼靶检查和病理检查结果。

Correlating mammographic and pathologic findings in clinical decision support using natural language processing and data mining methods.

作者信息

Patel Tejal A, Puppala Mamta, Ogunti Richard O, Ensor Joe E, He Tiancheng, Shewale Jitesh B, Ankerst Donna P, Kaklamani Virginia G, Rodriguez Angel A, Wong Stephen T C, Chang Jenny C

机构信息

Houston Methodist Cancer Center, Houston, Texas.

Cancer Research Program, Houston Methodist Research Institute, Houston, Texas.

出版信息

Cancer. 2017 Jan 1;123(1):114-121. doi: 10.1002/cncr.30245. Epub 2016 Aug 29.

DOI:10.1002/cncr.30245

PMID:27571243

Abstract

BACKGROUND

A key challenge to mining electronic health records for mammography research is the preponderance of unstructured narrative text, which strikingly limits usable output. The imaging characteristics of breast cancer subtypes have been described previously, but without standardization of parameters for data mining.

METHODS

The authors searched the enterprise-wide data warehouse at the Houston Methodist Hospital, the Methodist Environment for Translational Enhancement and Outcomes Research (METEOR), for patients with Breast Imaging Reporting and Data System (BI-RADS) category 5 mammogram readings performed between January 2006 and May 2015 and an available pathology report. The authors developed natural language processing (NLP) software algorithms to automatically extract mammographic and pathologic findings from free text mammogram and pathology reports. The correlation between mammographic imaging features and breast cancer subtype was analyzed using one-way analysis of variance and the Fisher exact test.

RESULTS

The NLP algorithm was able to obtain key characteristics for 543 patients who met the inclusion criteria. Patients with estrogen receptor-positive tumors were more likely to have spiculated margins (P = .0008), and those with tumors that overexpressed human epidermal growth factor receptor 2 (HER2) were more likely to have heterogeneous and pleomorphic calcifications (P = .0078 and P = .0002, respectively).

CONCLUSIONS

Mammographic imaging characteristics, obtained from an automated text search and the extraction of mammogram reports using NLP techniques, correlated with pathologic breast cancer subtype. The results of the current study validate previously reported trends assessed by manual data collection. Furthermore, NLP provides an automated means with which to scale up data extraction and analysis for clinical decision support. Cancer 2017;114-121. © 2016 American Cancer Society.

摘要

背景

在利用电子健康记录进行乳房X光摄影研究时，一个关键挑战是存在大量非结构化的叙述性文本，这极大地限制了可用输出。先前已描述了乳腺癌亚型的影像学特征，但数据挖掘参数未实现标准化。

方法

作者在休斯顿卫理公会医院的企业级数据仓库——卫理公会转化增强与结果研究环境（METEOR）中，搜索了2006年1月至2015年5月期间进行乳房影像报告和数据系统（BI-RADS）5类乳房X光摄影读数且有可用病理报告的患者。作者开发了自然语言处理（NLP）软件算法，以从乳房X光摄影和病理报告的自由文本中自动提取乳房X光摄影和病理结果。使用单因素方差分析和Fisher精确检验分析乳房X光摄影特征与乳腺癌亚型之间的相关性。

结果

NLP算法能够为543名符合纳入标准的患者获取关键特征。雌激素受体阳性肿瘤患者更有可能出现毛刺状边缘（P = .0008），而人表皮生长因子受体2（HER2）过表达肿瘤患者更有可能出现不均匀及多形性钙化（分别为P = .0078和P = .0002）。

结论

相似文献

Correlating mammographic and pathologic findings in clinical decision support using natural language processing and data mining methods.

Cancer. 2017 Jan 1;123(1):114-121. doi: 10.1002/cncr.30245. Epub 2016 Aug 29.

Automatic abstraction of imaging observations with their characteristics from mammography reports.

J Am Med Inform Assoc. 2015 Apr;22(e1):e81-92. doi: 10.1136/amiajnl-2014-003009. Epub 2014 Oct 28.

Using automatically extracted information from mammography reports for decision-support.

J Biomed Inform. 2016 Aug;62:224-31. doi: 10.1016/j.jbi.2016.07.001. Epub 2016 Jul 4.

Using natural language processing to extract mammographic findings.

J Biomed Inform. 2015 Apr;54:77-84. doi: 10.1016/j.jbi.2015.01.010. Epub 2015 Feb 3.

A Deep Learning-Based Decision Support Tool for Precision Risk Assessment of Breast Cancer.

JCO Clin Cancer Inform. 2019 May;3:1-12. doi: 10.1200/CCI.18.00121.

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.

Chin Med J (Engl). 2019 Jul 20;132(14):1673-1680. doi: 10.1097/CM9.0000000000000301.

The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports.

BMC Med Inform Decis Mak. 2019 Dec 30;19(1):288. doi: 10.1186/s12911-019-0997-3.

HER2-positive breast cancer patients: correlation between mammographic and pathological findings.

Radiat Prot Dosimetry. 2014 Nov;162(1-2):125-8. doi: 10.1093/rpd/ncu243. Epub 2014 Jul 25.

Automated detection of ambiguity in BI-RADS assessment categories in mammography reports.

Stud Health Technol Inform. 2014;197:35-9.

Is there a correlation between breast cancer molecular subtype using receptors as surrogates and mammographic appearance?

Ann Surg Oncol. 2013 Oct;20(10):3247-53. doi: 10.1245/s10434-013-3155-7. Epub 2013 Aug 22.

引用本文的文献

TECRR: a benchmark dataset of radiological reports for BI-RADS classification with machine learning, deep learning, and large language model baselines.

BMC Med Inform Decis Mak. 2024 Oct 24;24(1):310. doi: 10.1186/s12911-024-02717-7.

Artificial intelligence methods available for cancer research.

Front Med. 2024 Oct;18(5):778-797. doi: 10.1007/s11684-024-1085-3. Epub 2024 Aug 8.

Applications of natural language processing tools in the surgical journey.

Front Surg. 2024 May 17;11:1403540. doi: 10.3389/fsurg.2024.1403540. eCollection 2024.

Theory of radiologist interaction with instant messaging decision support tools: A sequential-explanatory study.

PLOS Digit Health. 2024 Feb 26;3(2):e0000297. doi: 10.1371/journal.pdig.0000297. eCollection 2024 Feb.

Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review.

JMIR Med Inform. 2023 Dec 15;11:e42477. doi: 10.2196/42477.

Information extraction from German radiological reports for general clinical text and language understanding.

Sci Rep. 2023 Feb 9;13(1):2353. doi: 10.1038/s41598-023-29323-3.

Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.

JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.

Proposal of a novel Artificial Intelligence Distribution Service platform for healthcare.

F1000Res. 2021 Mar 26;10:245. doi: 10.12688/f1000research.36775.1. eCollection 2021.

Clinico-radio-pathological Features and Biological Behavior of Breast Cancer in Young Indian Women: A Prospective Study.

Indian J Radiol Imaging. 2021 Apr;31(2):323-332. doi: 10.1055/s-0041-1734342. Epub 2021 Jul 30.

A systematic review of natural language processing applied to radiology reports.

BMC Med Inform Decis Mak. 2021 Jun 3;21(1):179. doi: 10.1186/s12911-021-01533-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在临床决策支持中使用自然语言处理和数据挖掘方法关联乳腺钼靶检查和病理检查结果。

Correlating mammographic and pathologic findings in clinical decision support using natural language processing and data mining methods.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献