文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

ChatGPT as an effective tool for quality evaluation of radiomics research.

作者信息

Mese Ismail, Kocak Burak

机构信息

Department of Radiology, Erenkoy Mental Health and Neurology Training and Research Hospital, University of Health Sciences, Istanbul, Turkey.

Department of Radiology, Basaksehir Cam and Sakura City Hospital, University of Health Sciences, Istanbul, Turkey.

出版信息

Eur Radiol. 2025 Apr;35(4):2030-2042. doi: 10.1007/s00330-024-11122-7. Epub 2024 Oct 15.


DOI:10.1007/s00330-024-11122-7
PMID:39406959
Abstract

OBJECTIVES: This study aimed to evaluate the effectiveness of ChatGPT-4o in assessing the methodological quality of radiomics research using the radiomics quality score (RQS) compared to human experts. METHODS: Published in European Radiology, European Radiology Experimental, and Insights into Imaging between 2023 and 2024, open-access and peer-reviewed radiomics research articles with creative commons attribution license (CC-BY) were included in this study. Pre-prints from MedRxiv were also included to evaluate potential peer-review bias. Using the RQS, each study was independently assessed twice by ChatGPT-4o and by two radiologists with consensus. RESULTS: In total, 52 open-access and peer-reviewed articles were included in this study. Both ChatGPT-4o evaluation (average of two readings) and human experts had a median RQS of 14.5 (40.3% percentage score) (p > 0.05). Pairwise comparisons revealed no statistically significant difference between the readings of ChatGPT and human experts (corrected p > 0.05). The intraclass correlation coefficient for intra-rater reliability of ChatGPT-4o was 0.905 (95% CI: 0.840-0.944), and those for inter-rater reliability with human experts for each evaluation of ChatGPT-4o were 0.859 (95% CI: 0.756-0.919) and 0.914 (95% CI: 0.855-0.949), corresponding to good to excellent reliability for all. The evaluation by ChatGPT-4o took less time (2.9-3.5 min per article) compared to human experts (13.9 min per article by one reader). Item-wise reliability analysis showed ChatGPT-4o maintained consistently high reliability across almost all RQS items. CONCLUSION: ChatGPT-4o provides reliable and efficient assessments of radiomics research quality. Its evaluations closely align with those of human experts and reduce evaluation time. KEY POINTS: Question Is ChatGPT effective and reliable in evaluating radiomics research quality based on RQS? Findings ChatGPT-4o showed high reliability and efficiency, with evaluations closely matching human experts. It can considerably reduce the time required for radiomics research quality assessment. Clinical relevance ChatGPT-4o offers a quick and reliable automated alternative for evaluating the quality of radiomics research, with the potential to assess radiomics research at a large scale in the future.

摘要

相似文献

[1]
ChatGPT as an effective tool for quality evaluation of radiomics research.

Eur Radiol. 2025-4

[2]
Quality of radiomics research: comprehensive analysis of 1574 unique publications from 89 reviews.

Eur Radiol. 2025-4

[3]
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.

Cochrane Database Syst Rev. 2024-12-16

[4]
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.

Health Technol Assess. 2001

[5]
Quality appraisal of radiomics-based studies on chondrosarcoma using METhodological RadiomICs Score (METRICS) and Radiomics Quality Score (RQS).

Insights Imaging. 2025-6-18

[6]
Comparative evaluation of ChatGPT and LLaMA for reliability, quality, and accuracy in familial Mediterranean fever.

Eur J Pediatr. 2025-7-18

[7]
Radiomics for differentiating radiation-induced brain injury from recurrence in gliomas: systematic review, meta-analysis, and methodological quality evaluation using METRICS and RQS.

Eur Radiol. 2025-2-12

[8]
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021-4-19

[9]
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.

Health Technol Assess. 2008-6

[10]
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.

Ophthalmic Plast Reconstr Surg. 2024-12-24

引用本文的文献

[1]
Histological Image Classification Between Follicular Lymphoma and Reactive Lymphoid Tissue Using Deep Learning and Explainable Artificial Intelligence (XAI).

Cancers (Basel). 2025-7-22

[2]
Explanation and Elaboration with Examples for METRICS (METRICS-E3): an initiative from the EuSoMII Radiomics Auditing Group.

Insights Imaging. 2025-8-13

[3]
Letter to Editor: Pushing large language models for improved radiomics study and research.

Eur Radiol. 2025-7-18

[4]
Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination.

Sci Rep. 2025-4-23

本文引用的文献

[1]
Molecular subtypes of breast cancer identified by dynamically enhanced MRI radiomics: the delayed phase cannot be ignored.

Insights Imaging. 2024-5-31

[2]
A Comparison Between GPT-3.5, GPT-4, and GPT-4V: Can the Large Language Model (ChatGPT) Pass the Japanese Board of Orthopaedic Surgery Examination?

Cureus. 2024-3-18

[3]
The value of machine learning based on CT radiomics in the preoperative identification of peripheral nerve invasion in colorectal cancer: a two-center study.

Insights Imaging. 2024-4-5

[4]
Multiparametric MRI-based intratumoral and peritumoral radiomics for predicting the pathological differentiation of hepatocellular carcinoma.

Insights Imaging. 2024-3-27

[5]
Predicting microvascular invasion in small (≤ 5 cm) hepatocellular carcinomas using radiomics-based peritumoral analysis.

Insights Imaging. 2024-3-26

[6]
Enhancing recurrence risk prediction for bladder cancer using multi-sequence MRI radiomics.

Insights Imaging. 2024-3-25

[7]
CT-based radiomics combined with hematologic parameters for survival prediction in locally advanced esophageal cancer patients receiving definitive chemoradiotherapy.

Insights Imaging. 2024-3-25

[8]
Exploring a multiparameter MRI-based radiomics approach to predict tumor proliferation status of serous ovarian carcinoma.

Insights Imaging. 2024-3-18

[9]
CT-based pancreatic radiomics predicts secondary loss of response to infliximab in biologically naïve patients with Crohn's disease.

Insights Imaging. 2024-3-13

[10]
Prediction of clinically significant prostate cancer using radiomics models in real-world clinical practice: a retrospective multicenter study.

Insights Imaging. 2024-2-29

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索