文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Accuracy of Artificial Intelligence Based Chatbots in Analyzing Orthopedic Pathologies: An Experimental Multi-Observer Analysis.

作者信息

Gehlen Tobias, Joost Theresa, Solbrig Philipp, Stahnke Katharina, Zahn Robert, Jahn Markus, Adl Amini Dominik, Back David Alexander

机构信息

Center for Musculoskeletal Surgery, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, 13353 Berlin, Germany.

Move Ahead-Foot Ankle and Sportsclinic, 10117 Berlin, Germany.

出版信息

Diagnostics (Basel). 2025 Jan 19;15(2):221. doi: 10.3390/diagnostics15020221.


DOI:10.3390/diagnostics15020221
PMID:39857105
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11764310/
Abstract

The rapid development of artificial intelligence (AI) is impacting the medical sector by offering new possibilities for faster and more accurate diagnoses. Symptom checker apps show potential for supporting patient decision-making in this regard. Whether the AI-based decision-making of symptom checker apps shows better performance in diagnostic accuracy and urgency assessment compared to physicians remains unclear. Therefore, this study aimed to investigate the performance of existing symptom checker apps in orthopedic and traumatology cases compared to physicians in the field. 30 fictitious case vignettes of common conditions in trauma surgery and orthopedics were retrospectively examined by four orthopedic and traumatology specialists and four different symptom checker apps for diagnostic accuracy and the recommended urgency of measures. Based on the estimation provided by the doctors and the individual symptom checker apps, the percentage of correct diagnoses and appropriate assessments of treatment urgency was calculated in mean and standard deviation [SD] in [%]. Data were analyzed statistically for accuracy and correlation between the apps and physicians using a nonparametric Spearman's correlation test ( < 0.05). The physicians provided the correct diagnosis in 84.4 ± 18.4% of cases (range: 53.3 to 96.7%), and the symptom checker apps in 35.8 ± 1.0% of cases (range: 26.7 to 54.2%). The agreement in the accuracy of the diagnoses varied from low to high (Physicians vs. Physicians: Spearman's ρ: 0.143 to 0.538; Physicians vs. Apps: Spearman's ρ: 0.007 to 0.358) depending on the different physicians and apps. In relation to the whole population, the physicians correctly assessed the urgency level in 70.0 ± 4.7% (range: 66.7 to 73.3%) and the apps in 20.6 ± 5.6% (range: 10.8 to 37.5%) of cases. The agreement on the accuracy of estimating urgency levels was moderate to high between and within physicians and individual apps. AI-based symptom checker apps for diagnosis in orthopedics and traumatology do not yet provide a more accurate analysis regarding diagnosis and urgency evaluation than physicians. However, there is a broad variation in the accuracy between different digital tools. Altogether, this field of AI application shows excellent potential and should be further examined in future studies.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2aa/11764310/85d845ab04ff/diagnostics-15-00221-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2aa/11764310/c127a5c37bd0/diagnostics-15-00221-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2aa/11764310/85d845ab04ff/diagnostics-15-00221-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2aa/11764310/c127a5c37bd0/diagnostics-15-00221-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2aa/11764310/85d845ab04ff/diagnostics-15-00221-g002.jpg

相似文献

[1]
Accuracy of Artificial Intelligence Based Chatbots in Analyzing Orthopedic Pathologies: An Experimental Multi-Observer Analysis.

Diagnostics (Basel). 2025-1-19

[2]
Determinants of Laypersons' Trust in Medical Decision Aids: Randomized Controlled Trial.

JMIR Hum Factors. 2022-5-3

[3]
Evaluation of Diagnostic and Triage Accuracy and Usability of a Symptom Checker in an Emergency Department: Observational Study.

JMIR Mhealth Uhealth. 2022-9-19

[4]
Patient Perspectives on the Usefulness of an Artificial Intelligence-Assisted Symptom Checker: Cross-Sectional Survey Study.

J Med Internet Res. 2020-1-30

[5]
Assessment of a Digital Symptom Checker Tool's Accuracy in Suggesting Reproductive Health Conditions: Clinical Vignettes Study.

JMIR Mhealth Uhealth. 2023-12-5

[6]
Comparison of physician and artificial intelligence-based symptom checker diagnostic accuracy.

Rheumatol Int. 2022-12

[7]
How suitable are clinical vignettes for the evaluation of symptom checker apps? A test theoretical perspective.

Digit Health. 2023-8-21

[8]
Does an App a Day Keep the Doctor Away? AI Symptom Checker Applications, Entrenched Bias, and Professional Responsibility.

J Med Internet Res. 2024-6-5

[9]
Longitudinal Changes in Diagnostic Accuracy of a Differential Diagnosis List Developed by an AI-Based Symptom Checker: Retrospective Observational Study.

JMIR Form Res. 2024-5-17

[10]
Accuracy of a Popular Online Symptom Checker for Ophthalmic Diagnoses.

JAMA Ophthalmol. 2019-6-1

本文引用的文献

[1]
The Quality and Utility of Artificial Intelligence in Patient Care.

Dtsch Arztebl Int. 2023-7-10

[2]
Artificial Intelligence Applications in Hepatology.

Clin Gastroenterol Hepatol. 2023-7

[3]
Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review.

PLOS Digit Health. 2022-3-31

[4]
Clinical application and diagnostic accuracy of artificial intelligence in colonoscopy for inflammatory bowel disease: systematic review.

Endosc Int Open. 2022-7-15

[5]
The Emergence of Artificial Intelligence in Cardiology: Current and Future Applications.

Curr Cardiol Rev. 2022

[6]
Enhancing Diagnosis Through Technology: Decision Support, Artificial Intelligence, and Beyond.

Crit Care Clin. 2022-1

[7]
Novel Methods in the Surveillance of Influenza-Like Illness in Germany Using Data From a Symptom Assessment App (Ada): Observational Case Study.

JMIR Public Health Surveill. 2021-11-4

[8]
Clinical impact and quality of randomized controlled trials involving interventions evaluating artificial intelligence prediction tools: a systematic review.

NPJ Digit Med. 2021-10-28

[9]
Accuracy of online symptom checkers and the potential impact on service utilisation.

PLoS One. 2021

[10]
Artificial intelligence, bias, and patients' perspectives.

Lancet. 2021-5-29

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索