文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Interobserver agreement between artificial intelligence models in the thyroid imaging and reporting data system (TIRADS) assessment of thyroid nodules.

作者信息

Leoncini Andrea, Trimboli Pierpaolo

机构信息

Clinic for Radiology, Imaging Institute of Southern Switzerland, Ente Ospedaliero Cantonale (EOC), Bellinzona, Switzerland.

Thyroid Unit, Clinic for Endocrinology and Diabetology, Ente Ospedaliero Cantonale (EOC), Bellinzona, Switzerland.

出版信息

Endocrine. 2025 May 15. doi: 10.1007/s12020-025-04272-1.


DOI:10.1007/s12020-025-04272-1
PMID:40372631
Abstract

BACKGROUND: As ultrasound (US) is the most accurate tool for assessing the thyroid nodule (TN) risk of malignancy (RoM), international societies have published various Thyroid Imaging and Reporting Data Systems (TIRADSs). With the recent advent of artificial intelligence (AI), clinicians and researchers should ask themselves how AI could interpret the terminology of the TIRADSs and whether or not AIs agree in the risk assessment of TNs. The study aim was to analyze the interobserver agreement (IOA) between AIs in assessing the RoM of TNs across various TIRADSs categories using a cases series created combining TIRADSs descriptors. METHODS: ChatGPT, Google Gemini, and Claude were compared. ACR-TIRADS, EU-TIRADS, and K-TIRADS, were employed to evaluate the AI assessment. Multiple written scenarios for the three TIRADS were created, the cases were evaluated by the three AIs, and their assessments were analyzed and compared. The IOA was estimated by comparing the kappa (κ) values. RESULTS: Ninety scenarios were created. With ACR-TIRADS the IOA analysis gave κ = 0.58 between ChatGPT and Gemini, 0.53 between ChatGPT and Claude, and 0.90 between Gemini and Claude. With EU-TIRADS it was observed κ value = 0.73 between ChatGPT and Gemini, 0.62 between ChatGPT and Claude, and 0.72 between Gemini and Claude. With K-TIRADS it was found κ = 0.88 between ChatGPT and Gemini, 0.70 between ChatGPT and Claude, and 0.61 between Gemini and Claude. CONCLUSION: This study found that there were non-negligible variability between the three AIs. Clinicians and patients should be aware of these new findings.

摘要

相似文献

[1]
Interobserver agreement between artificial intelligence models in the thyroid imaging and reporting data system (TIRADS) assessment of thyroid nodules.

Endocrine. 2025-5-15

[2]
Analysis of FNAC indication in thyroid nodules assessed as low risk according to various TIRADSs.

Eur Thyroid J. 2025-7-10

[3]
Evaluation of concordance between the Bethesda System for Reporting Thyroid Cytopathology 2023 (TBSRTC) and ACR-TIRADS at a tertiary care center in Gujarat.

Indian J Pathol Microbiol. 2025-3-13

[4]
New Thyroid Imaging Reporting and Data System (TIRADS) Based on Ultrasonography Features for Follicular Thyroid Neoplasms: A Multicenter Study.

Ultrasound Med Biol. 2025-8

[5]
Comparison of the diagnostic performance of the artificial intelligence-based TIRADS algorithm with established classification systems for thyroid nodules.

Diagn Interv Radiol. 2025-9-1

[6]
Thyroid Nodule Characterization: Which Thyroid Imaging Reporting and Data System (TIRADS) Is More Accurate? A Comparison Between Radiologists with Different Experiences and Artificial Intelligence Software.

Diagnostics (Basel). 2025-8-21

[7]
Performance of computer scientists in the assessment of thyroid nodules using TIRADS lexicons.

J Endocrinol Invest. 2025-4

[8]
Diagnostic Performance of Six Ultrasound Risk Stratification Systems for Thyroid Nodules: A Systematic Review and Network Meta-Analysis.

AJR Am J Roentgenol. 2023-6

[9]
The additive value of real-time elastography to thyroid ultrasound in detecting papillary carcinoma in nodules over 20 mm in diameter.

Endocrine. 2025-4-24

[10]
Interobserver Agreement Among Thyroid Ultrasound Operators in Defining Thyroid Nodules as Subcapsular.

Thyroid. 2025-8-25

本文引用的文献

[1]
Performance of computer scientists in the assessment of thyroid nodules using TIRADS lexicons.

J Endocrinol Invest. 2025-4

[2]
Head-to-head comparison of American, European, and Asian TIRADSs in thyroid nodule assessment: systematic review and meta-analysis.

Eur Thyroid J. 2024-4-1

[3]
International Expert Consensus on US Lexicon for Thyroid Nodules.

Radiology. 2023-10

[4]
Diagnostic Performance of Ultrasound-Based Risk Stratification Systems for Thyroid Nodules: A Systematic Review and Meta-Analysis.

Endocrinol Metab (Seoul). 2023-2

[5]
Considerable interobserver variation calls for unambiguous definitions of thyroid nodule ultrasound characteristics.

Eur Thyroid J. 2023-4-1

[6]
2021 Korean Thyroid Imaging Reporting and Data System and Imaging-Based Management of Thyroid Nodules: Korean Society of Thyroid Radiology Consensus Statement and Recommendations.

Korean J Radiol. 2021-12

[7]
Artificial Intelligence in Thyroid Field-A Comprehensive Review.

Cancers (Basel). 2021-9-22

[8]
The New Era of TIRADSs to Stratify the Risk of Malignancy of Thyroid Nodules: Strengths, Weaknesses and Pitfalls.

Cancers (Basel). 2021-8-26

[9]
Unnecessary thyroid nodule biopsy rates under four ultrasound risk stratification systems: a systematic review and meta-analysis.

Eur Radiol. 2021-5

[10]
Inter- and Intraobserver Agreement in the Assessment of Thyroid Nodule Ultrasound Features and Classification Systems: A Blinded Multicenter Study.

Thyroid. 2020-2

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索