• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估 ChatGPT(人工智能-大型语言模型)在肩稳定手术方面的信息质量。

Evaluation High-Quality of Information from ChatGPT (Artificial Intelligence-Large Language Model) Artificial Intelligence on Shoulder Stabilization Surgery.

机构信息

Duke University, Durham, North Carolina, U.S.A..

Duke University, Durham, North Carolina, U.S.A.

出版信息

Arthroscopy. 2024 Mar;40(3):726-731.e6. doi: 10.1016/j.arthro.2023.07.048. Epub 2023 Aug 9.

DOI:10.1016/j.arthro.2023.07.048
PMID:37567487
Abstract

PURPOSE

To analyze the quality and readability of information regarding shoulder stabilization surgery available using an online AI software (ChatGPT), using standardized scoring systems, as well as to report on the given answers by the AI.

METHODS

An open AI model (ChatGPT) was used to answer 23 commonly asked questions from patients on shoulder stabilization surgery. These answers were evaluated for medical accuracy, quality, and readability using The JAMA Benchmark criteria, DISCERN score, Flesch-Kincaid Reading Ease Score (FRES) & Grade Level (FKGL).

RESULTS

The JAMA Benchmark criteria score was 0, which is the lowest score, indicating no reliable resources cited. The DISCERN score was 60, which is considered a good score. The areas that open AI model did not achieve full marks were also related to the lack of available source material used to compile the answers, and finally some shortcomings with information not fully supported by the literature. The FRES was 26.2, and the FKGL was considered to be that of a college graduate.

CONCLUSIONS

There was generally high quality in the answers given on questions relating to shoulder stabilization surgery, but there was a high reading level required to comprehend the information presented. However, it is unclear where the answers came from with no source material cited. It is important to note that the ChatGPT software repeatedly references the need to discuss these questions with an orthopaedic surgeon and the importance of shared discussion making, as well as compliance with surgeon treatment recommendations.

CLINICAL RELEVANCE

As shoulder instability is an injury that predominantly affects younger individuals who may use the Internet for information, this study shows what information patients may be getting online.

摘要

目的

使用标准化评分系统分析在线人工智能软件(ChatGPT)中有关肩部稳定手术的信息的质量和可读性,并报告人工智能的回答。

方法

使用开放式人工智能模型(ChatGPT)回答了 23 个关于肩部稳定手术的常见患者问题。使用 JAMA 基准标准、DISCERN 评分、Flesch-Kincaid 阅读舒适度得分(FRES)和等级水平(FKGL)评估这些答案的医学准确性、质量和可读性。

结果

JAMA 基准标准评分为 0,这是最低分,表明没有引用可靠的资源。DISCERN 评分为 60,这被认为是一个不错的分数。人工智能模型没有获得满分的领域也与用于编写答案的可用资料不足有关,最后,一些信息没有得到文献的充分支持。FRES 为 26.2,FKGL 被认为是大学毕业生的水平。

结论

与肩部稳定手术相关的问题的回答总体质量较高,但理解所呈现信息的阅读水平要求较高。然而,不清楚答案来自何处,也没有引用任何资料来源。需要注意的是,ChatGPT 软件反复提到需要与骨科医生讨论这些问题,并强调共同讨论的重要性,以及遵守外科医生的治疗建议。

临床意义

由于肩不稳定是一种主要影响年轻人的损伤,他们可能会在网上寻找信息,因此本研究展示了患者可能在网上获取哪些信息。

相似文献

1
Evaluation High-Quality of Information from ChatGPT (Artificial Intelligence-Large Language Model) Artificial Intelligence on Shoulder Stabilization Surgery.评估 ChatGPT(人工智能-大型语言模型)在肩稳定手术方面的信息质量。
Arthroscopy. 2024 Mar;40(3):726-731.e6. doi: 10.1016/j.arthro.2023.07.048. Epub 2023 Aug 9.
2
Evaluation of Online Artificial Intelligence-Generated Information on Common Hand Procedures.常见手部手术的在线人工智能生成信息评估
J Hand Surg Am. 2023 Nov;48(11):1122-1127. doi: 10.1016/j.jhsa.2023.08.003. Epub 2023 Sep 9.
3
Evaluation of information from artificial intelligence on rotator cuff repair surgery.人工智能在肩袖修复手术方面信息的评估。
JSES Int. 2023 Oct 21;8(1):53-57. doi: 10.1016/j.jseint.2023.09.009. eCollection 2024 Jan.
4
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
5
Dr. Google to Dr. ChatGPT: assessing the content and quality of artificial intelligence-generated medical information on appendicitis.谷歌博士对 ChatGPT 博士:评估人工智能生成的关于阑尾炎的医学信息的内容和质量。
Surg Endosc. 2024 May;38(5):2887-2893. doi: 10.1007/s00464-024-10739-5. Epub 2024 Mar 5.
6
Evaluation of Online AI-Generated Foot and Ankle Surgery Information.在线人工智能生成的足踝外科信息评估。
J Foot Ankle Surg. 2024 Nov-Dec;63(6):680-683. doi: 10.1053/j.jfas.2024.06.009. Epub 2024 Jul 3.
7
BPPV Information on Google Versus AI (ChatGPT).谷歌与人工智能(ChatGPT)上的良性阵发性位置性眩晕信息
Otolaryngol Head Neck Surg. 2024 Jun;170(6):1504-1511. doi: 10.1002/ohn.506. Epub 2023 Aug 25.
8
The quality and readability of patient information provided by ChatGPT: can AI reliably explain common ENT operations?ChatGPT 提供的患者信息的质量和可读性:人工智能能可靠地解释常见的耳鼻喉科手术吗?
Eur Arch Otorhinolaryngol. 2024 Nov;281(11):6147-6153. doi: 10.1007/s00405-024-08598-w. Epub 2024 Mar 26.
9
Can ChatGPT answer patient questions regarding reverse shoulder arthroplasty?ChatGPT能否回答患者关于反肩关节置换术的问题?
J ISAKOS. 2024 Dec;9(6):100323. doi: 10.1016/j.jisako.2024.100323. Epub 2024 Sep 20.
10
Is ChatGPT a Reliable Source of Patient Information on Asthma?ChatGPT是哮喘患者信息的可靠来源吗?
Cureus. 2024 Jul 8;16(7):e64114. doi: 10.7759/cureus.64114. eCollection 2024 Jul.

引用本文的文献

1
Evaluation of a Popular Large Language Model in Orthopedic Literature Review: Comparison to Previously Published Reviews.评估一种流行的大型语言模型在骨科文献综述中的应用:与先前发表的综述进行比较。
Arch Bone Jt Surg. 2025;13(8):460-469. doi: 10.22038/ABJS.2025.84896.3874.
2
ChatGPT-4 Responses on Ankle Cartilage Surgery Often Diverge from Expert Consensus: A Comparative Analysis.ChatGPT-4对踝关节软骨手术的回答往往与专家共识存在分歧:一项比较分析。
Foot Ankle Orthop. 2025 Aug 13;10(3):24730114251352494. doi: 10.1177/24730114251352494. eCollection 2025 Jul.
3
Evaluating if ChatGPT Can Answer Common Patient Questions Compared to OrthoInfo Regarding Lateral Epicondylitis.
评估与OrthoInfo相比,ChatGPT能否回答有关外侧上髁炎的常见患者问题。
Iowa Orthop J. 2025;45(1):19-32.
4
Assessing information provided via artificial intelligence regarding distal biceps tendon repair surgery.评估通过人工智能提供的有关肱二头肌远端肌腱修复手术的信息。
J Exp Orthop. 2025 May 19;12(2):e70281. doi: 10.1002/jeo2.70281. eCollection 2025 Apr.
5
Microsoft Copilot Provides More Accurate and Reliable Information About Anterior Cruciate Ligament Injury and Repair Than ChatGPT and Google Gemini; However, No Resource Was Overall the Best.与ChatGPT和谷歌Gemini相比,微软Copilot能提供关于前交叉韧带损伤与修复的更准确、更可靠的信息;然而,没有一种资源在各方面都是最佳的。
Arthrosc Sports Med Rehabil. 2024 Nov 19;7(2):101043. doi: 10.1016/j.asmr.2024.101043. eCollection 2025 Apr.
6
Can popular AI large language models provide reliable answers to frequently asked questions about rotator cuff tears?流行的人工智能大语言模型能否为有关肩袖撕裂的常见问题提供可靠答案?
JSES Int. 2024 Nov 29;9(2):390-397. doi: 10.1016/j.jseint.2024.11.012. eCollection 2025 Mar.
7
Evaluating if ChatGPT Can Answer Common Patient Questions Compared With OrthoInfo Regarding Rotator Cuff Tears.评估ChatGPT与OrthoInfo相比能否回答有关肩袖撕裂的常见患者问题。
J Am Acad Orthop Surg Glob Res Rev. 2025 Mar 11;9(3). doi: 10.5435/JAAOSGlobal-D-24-00289. eCollection 2025 Mar 1.
8
Evaluating the Quality and Readability of Generative Artificial Intelligence (AI) Chatbot Responses in the Management of Achilles Tendon Rupture.评估生成式人工智能(AI)聊天机器人在跟腱断裂管理中的回复质量和可读性。
Cureus. 2025 Jan 31;17(1):e78313. doi: 10.7759/cureus.78313. eCollection 2025 Jan.
9
Evaluating the Quality and Readability of Information Provided by Generative Artificial Intelligence Chatbots on Clavicle Fracture Treatment Options.评估生成式人工智能聊天机器人提供的关于锁骨骨折治疗方案信息的质量和可读性。
Cureus. 2025 Jan 9;17(1):e77200. doi: 10.7759/cureus.77200. eCollection 2025 Jan.
10
Large Language Models for Chatbot Health Advice Studies: A Systematic Review.用于聊天机器人健康建议研究的大语言模型:一项系统综述。
JAMA Netw Open. 2025 Feb 3;8(2):e2457879. doi: 10.1001/jamanetworkopen.2024.57879.