文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

使用分类模型确定结直肠癌患者肝脏放射学报告的价值。

Using a classification model for determining the value of liver radiological reports of patients with colorectal cancer.

作者信息

Liu Wenjuan, Zhang Xi, Lv Han, Li Jia, Liu Yawen, Yang Zhenghan, Weng Xutao, Lin Yucong, Song Hong, Wang Zhenchang

机构信息

Department of Radiology, Beijing Friendship Hospital, Capital Medical University, Beijing, China.

School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.

出版信息

Front Oncol. 2022 Nov 21;12:913806. doi: 10.3389/fonc.2022.913806. eCollection 2022.


DOI:10.3389/fonc.2022.913806
PMID:36479085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9720132/
Abstract

BACKGROUND: Medical imaging is critical in clinical practice, and high value radiological reports can positively assist clinicians. However, there is a lack of methods for determining the value of reports. OBJECTIVE: The purpose of this study was to establish an ensemble learning classification model using natural language processing (NLP) applied to the Chinese free text of radiological reports to determine their value for liver lesion detection in patients with colorectal cancer (CRC). METHODS: Radiological reports of upper abdominal computed tomography (CT) and magnetic resonance imaging (MRI) were divided into five categories according to the results of liver lesion detection in patients with CRC. The NLP methods including word segmentation, stop word removal, and n-gram language model establishment were applied for each dataset. Then, a word-bag model was built, high-frequency words were selected as features, and an ensemble learning classification model was constructed. Several machine learning methods were applied, including logistic regression (LR), random forest (RF), and so on. We compared the accuracy between priori choosing pertinent word strings and our machine language methodologies. RESULTS: The dataset of 2790 patients included CT without contrast (10.2%), CT with/without contrast (73.3%), MRI without contrast (1.8%), and MRI with/without contrast (14.6%). The ensemble learning classification model determined the value of reports effectively, reaching 95.91% in the CT with/without contrast dataset using XGBoost. The logistic regression, random forest, and support vector machine also achieved good classification accuracy, reaching 95.89%, 95.04%, and 95.00% respectively. The results of XGBoost were visualized using a confusion matrix. The numbers of errors in categories I, II and V were very small. ELI5 was used to select important words for each category. Words such as "no abnormality", "suggest", "fatty liver", and "transfer" showed a relatively large degree of positive correlation with classification accuracy. The accuracy based on string pattern search method model was lower than that of machine learning. CONCLUSIONS: The learning classification model based on NLP was an effective tool for determining the value of radiological reports focused on liver lesions. The study made it possible to analyze the value of medical imaging examinations on a large scale.

摘要

背景:医学影像在临床实践中至关重要,高价值的放射学报告能够为临床医生提供积极帮助。然而,目前缺乏确定报告价值的方法。 目的:本研究旨在建立一种集成学习分类模型,利用自然语言处理(NLP)技术处理放射学报告的中文自由文本,以确定其对结直肠癌(CRC)患者肝脏病变检测的价值。 方法:根据CRC患者肝脏病变检测结果,将上腹部计算机断层扫描(CT)和磁共振成像(MRI)的放射学报告分为五类。对每个数据集应用包括分词、停用词去除和n-gram语言模型建立在内的NLP方法。然后,构建词袋模型,选择高频词作为特征,并构建集成学习分类模型。应用了几种机器学习方法,包括逻辑回归(LR)、随机森林(RF)等。我们比较了预先选择相关词串与我们的机器学习方法之间的准确性。 结果:2790例患者的数据集包括平扫CT(10.2%)、增强/平扫CT(73.3%)、平扫MRI(1.8%)和增强/平扫MRI(14.6%)。集成学习分类模型有效地确定了报告的价值,在增强/平扫CT数据集中使用XGBoost时达到了95.91%。逻辑回归、随机森林和支持向量机也取得了良好的分类准确率,分别达到95.89%、95.04%和95.00%。使用混淆矩阵对XGBoost的结果进行了可视化。I、II和V类中的错误数量非常少。使用ELI5为每个类别选择重要词汇。“无异常”“提示”“脂肪肝”和“转移”等词汇与分类准确率呈现出相对较大程度的正相关。基于字符串模式搜索方法模型的准确率低于机器学习方法。 结论:基于NLP的学习分类模型是确定聚焦肝脏病变的放射学报告价值的有效工具。该研究使得大规模分析医学影像检查的价值成为可能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/9625e5032260/fonc-12-913806-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/dde8a1168f5e/fonc-12-913806-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/fc70449fc1af/fonc-12-913806-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/34aa53d265bc/fonc-12-913806-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/642209c86002/fonc-12-913806-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/422f53138181/fonc-12-913806-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/9625e5032260/fonc-12-913806-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/dde8a1168f5e/fonc-12-913806-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/fc70449fc1af/fonc-12-913806-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/34aa53d265bc/fonc-12-913806-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/642209c86002/fonc-12-913806-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/422f53138181/fonc-12-913806-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1837/9720132/9625e5032260/fonc-12-913806-g006.jpg

相似文献

[1]
Using a classification model for determining the value of liver radiological reports of patients with colorectal cancer.

Front Oncol. 2022-11-21

[2]
Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.

Sovrem Tekhnologii Med. 2024

[3]
Automatic medical protocol classification using machine learning approaches.

Comput Methods Programs Biomed. 2021-3

[4]
Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.

J Pers Med. 2020-12-16

[5]
Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.

J Digit Imaging. 2022-10

[6]
Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

J Digit Imaging. 2018-4

[7]
The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports.

BMC Med Inform Decis Mak. 2019-12-30

[8]
Natural language processing and machine learning approaches for food categorization and nutrition quality prediction compared with traditional methods.

Am J Clin Nutr. 2023-3

[9]
Natural Language Processing for the Identification of Silent Brain Infarcts From Neuroimaging Reports.

JMIR Med Inform. 2019-4-21

[10]
Using Natural Language Processing and Machine Learning to Preoperatively Predict Lymph Node Metastasis for Non-Small Cell Lung Cancer With Electronic Medical Records: Development and Validation Study.

JMIR Med Inform. 2022-4-25

引用本文的文献

[1]
A foundation systematic review of natural language processing applied to gastroenterology & hepatology.

BMC Gastroenterol. 2025-2-6

[2]
Benefits and Risks of AI in Health Care: Narrative Review.

Interact J Med Res. 2024-11-18

本文引用的文献

[1]
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

BMC Med Inform Decis Mak. 2022-7-30

[2]
Using of n-grams from morphological tags for fake news classification.

PeerJ Comput Sci. 2021-7-19

[3]
A systematic review of natural language processing applied to radiology reports.

BMC Med Inform Decis Mak. 2021-6-3

[4]
-Gram based language processing using Twitter dataset to identify COVID-19 patients.

Sustain Cities Soc. 2021-9

[5]
Text classification models for the automatic detection of nonmedical prescription medication use from social media.

BMC Med Inform Decis Mak. 2021-1-26

[6]
Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.

J Med Internet Res. 2021-1-12

[7]
Analysis of Stroke Detection during the COVID-19 Pandemic Using Natural Language Processing of Radiology Reports.

AJNR Am J Neuroradiol. 2021-3

[8]
Domain specific word embeddings for natural language processing in radiology.

J Biomed Inform. 2021-1

[9]
Can natural language processing help differentiate inflammatory intestinal diseases in China? Models applying random forest and convolutional neural network approaches.

BMC Med Inform Decis Mak. 2020-9-29

[10]
National guidelines for diagnosis and treatment of colorectal cancer 2020 in China (English version).

Chin J Cancer Res. 2020-8

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索