• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

定制的ChatGPT可以准确回答来自国际专家截骨术共识声明的问题。

A custom ChatGPT can accurately answer questions from an international expert osteotomy consensus statement.

作者信息

Mabrouk Ahmed, Boutefnouchet Tarek, Malik Shahbaz, Sweed Tamer

机构信息

Basingstoke and North Hampshire Hospital, Basingstoke, United Kingdom.

University Hospitals Birmingham NHS Foundation Trust, Birmingham, United Kingdom.

出版信息

Eur J Orthop Surg Traumatol. 2025 Jun 16;35(1):247. doi: 10.1007/s00590-025-04373-7.

DOI:10.1007/s00590-025-04373-7
PMID:40522338
Abstract

PURPOSE

This study aimed to assess the accuracy of a custom ChatGPT in responding to questions specifically related to high tibial osteotomies (HTO), using an international expert osteotomy consensus statement as the source of information.

METHODS

A custom ChatGPT was developed using European Society of Sports Traumatology, Knee Surgery and Arthroscopy (ESSKA) osteotomy consensus for the painful degenerative varus knee as the primary training material. The custom ChatGPT was then tested for accuracy by generating responses to a series of 10 questions:five directly extracted from the consensus statement (Identical group) and five other common questions related to HTO (Random group). The generated responses were assessed by three knee surgeons using a bespoke scoring system. The scoring system evaluated accuracy, relevance, clarity, completeness, and adherence to the consensus. Each item was scored on a four-point Likert scale from 0 to 3. Inter-rater reliability was calculated with an intra-class correlation coefficient (ICC).

RESULTS

A total of 30 questions were asked to the custom ChatGPT by the three raters. The mean scores for accuracy, relevance, and clarity were 2.5 ± 0.8, 2.9 ± 0.3, and 2.9 ± 0.2, respectively. The inter-rater reliability for these scores was good (ICC 0.7, p = 0.004). Whereas, the mean score for completeness was 2.6 ± 0.5 with moderate inter-rater reliability (ICC 0.5, p = 0.1) and the mean score for adherence to the consensus statement document was 2.5 ± 1.1 with excellent inter-rater reliability (ICC 0.9, p < 0.001). There was no significant intergroup difference in accuracy, relevance, clarity and completeness (All p > 0.05). Only adherence to PDF was significantly lower in the random group 1.9 ± 0.5 versus 3 in the identical group (p = 0.01).

CONCLUSION

A custom ChatGPT can be trained to accurately answer questions from an international expert osteotomy consensus statement, indicating effective training and customization. This can serve as a valuable tool to guide surgeons in their practice by providing evidence-based answers to key questions in a time-efficient manner and is potentially applicable to other consensus statements and published literature.

摘要

目的

本研究旨在以国际专家截骨术共识声明为信息来源,评估定制的ChatGPT回答与高位胫骨截骨术(HTO)相关问题的准确性。

方法

以欧洲运动创伤、膝关节手术和关节镜学会(ESSKA)针对疼痛性退行性膝内翻的截骨术共识为主要训练材料,开发了定制的ChatGPT。然后,通过让定制的ChatGPT回答一系列10个问题来测试其准确性:其中5个问题直接从共识声明中提取(相同组),另外5个是与HTO相关的常见问题(随机组)。由三名膝关节外科医生使用定制的评分系统对生成的回答进行评估。该评分系统评估准确性、相关性、清晰度、完整性以及对共识的遵循情况。每个项目根据从0到3的四点李克特量表进行评分。使用组内相关系数(ICC)计算评分者间信度。

结果

三名评分者共向定制的ChatGPT提出了30个问题。准确性、相关性和清晰度的平均得分分别为2.5±0.8、2.9±0.3和2.9±0.2。这些得分的评分者间信度良好(ICC 0.7,p = 0.004)。而完整性的平均得分为2.6±0.5,评分者间信度中等(ICC 0.5,p = 0.1),对共识声明文件的遵循情况的平均得分为2.5±1.1,评分者间信度极佳(ICC 0.9,p < 0.001)。准确性、相关性、清晰度和完整性方面没有显著的组间差异(所有p > 0.05)。只有随机组对PDF的遵循情况显著低于相同组,分别为1.9±0.5和3(p = 0.01)。

结论

可以训练定制的ChatGPT准确回答来自国际专家截骨术共识声明的问题,表明训练和定制是有效的。这可以作为一种有价值的工具,通过及时有效地为关键问题提供循证答案来指导外科医生的实践,并且可能适用于其他共识声明和已发表的文献。

相似文献

1
A custom ChatGPT can accurately answer questions from an international expert osteotomy consensus statement.定制的ChatGPT可以准确回答来自国际专家截骨术共识声明的问题。
Eur J Orthop Surg Traumatol. 2025 Jun 16;35(1):247. doi: 10.1007/s00590-025-04373-7.
2
Surgical strategy and complication management of osteotomy around the painful degenerative varus knee: ESSKA Formal Consensus Part II.截骨术治疗疼痛性退行性内翻膝的手术策略和并发症管理:ESSKA 正式共识第二部分。
Knee Surg Sports Traumatol Arthrosc. 2024 Aug;32(8):2194-2205. doi: 10.1002/ksa.12273. Epub 2024 May 20.
3
Assessing Ability for ChatGPT to Answer Total Knee Arthroplasty-Related Questions.评估 ChatGPT 回答全膝关节置换术相关问题的能力。
J Arthroplasty. 2024 Aug;39(8):2022-2027. doi: 10.1016/j.arth.2024.02.023. Epub 2024 Feb 14.
4
Radiological outcomes in a randomized trial comparing opening wedge and closing wedge techniques of high tibial osteotomy.一项比较胫骨高位截骨开放楔形和闭合楔形技术的随机试验的放射学结果
Knee Surg Sports Traumatol Arthrosc. 2017 Mar;25(3):910-917. doi: 10.1007/s00167-015-3817-z. Epub 2015 Oct 14.
5
ChatGPT Provides Satisfactory but Occasionally Inaccurate Answers to Common Patient Hip Arthroscopy Questions.ChatGPT对常见的患者髋关节镜检查问题能提供令人满意但偶尔不准确的答案。
Arthroscopy. 2025 May;41(5):1337-1347. doi: 10.1016/j.arthro.2024.06.017. Epub 2024 Jun 22.
6
Evaluation of ChatGPT-4o's answers to questions about hip arthroscopy from the patient perspective.从患者角度评估ChatGPT-4o对髋关节镜检查相关问题的回答。
Jt Dis Relat Surg. 2025 Jan 2;36(1):193-199. doi: 10.52312/jdrs.2025.1961. Epub 2024 Dec 18.
7
Osteotomies for genu varum: Should we always correct at the tibia? A multicenter analysis of practices in France.治疗膝内翻的截骨术:我们是否总是应该在胫骨处进行矫正?法国多中心实践分析
Orthop Traumatol Surg Res. 2025 Feb;111(1):103925. doi: 10.1016/j.otsr.2024.103925. Epub 2024 Jul 2.
8
Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT-4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information.评估DeepResearch和DeepThink在前交叉韧带手术患者教育中的作用:ChatGPT-4o在全面性方面表现出色,DeepSeek R1在骨科信息的清晰度和可读性方面领先。
Knee Surg Sports Traumatol Arthrosc. 2025 Jun 1. doi: 10.1002/ksa.12711.
9
Artificial Intelligence Large Language Models Address Anterior Cruciate Ligament Reconstruction: Superior Clarity and Completeness by Gemini Compared With ChatGPT-4 in Response to American Academy of Orthopaedic Surgeons Clinical Practice Guidelines.人工智能大语言模型助力前交叉韧带重建:与ChatGPT-4相比,Gemini在回应美国矫形外科医师学会临床实践指南时具有更高的清晰度和完整性。
Arthroscopy. 2025 Jun;41(6):2002-2008. doi: 10.1016/j.arthro.2024.09.020. Epub 2024 Sep 21.
10
ChatGPT versus expert arthroplasty surgeons in total knee arthroplasty patient counseling.在全膝关节置换患者咨询方面,ChatGPT与关节置换专家外科医生的比较
Knee. 2025 Aug;55:12-17. doi: 10.1016/j.knee.2025.03.005. Epub 2025 Apr 8.

本文引用的文献

1
Custom GPTs Enhancing Performance and Evidence Compared with GPT-3.5, GPT-4, and GPT-4o? A Study on the Emergency Medicine Specialist Examination.与GPT-3.5、GPT-4和GPT-4o相比,定制生成式预训练变换器(Custom GPTs)在提升性能和证据方面如何?一项关于急诊医学专科考试的研究。
Healthcare (Basel). 2024 Aug 30;12(17):1726. doi: 10.3390/healthcare12171726.
2
Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery.ChatGPT对前交叉韧带手术常见问题回答的准确性评估
Knee. 2024 Dec;51:84-92. doi: 10.1016/j.knee.2024.08.014. Epub 2024 Sep 5.
3
Do ChatGPT and Gemini Provide Appropriate Recommendations for Pediatric Orthopaedic Conditions?
ChatGPT和Gemini是否能为小儿骨科疾病提供恰当的建议?
J Pediatr Orthop. 2025 Jan 1;45(1):e66-e71. doi: 10.1097/BPO.0000000000002797. Epub 2024 Aug 22.
4
Exploring the Potential of Code-Free Custom GPTs in Ophthalmology: An Early Analysis of GPT Store and User-Creator Guidance.探索免代码自定义生成式预训练变换器在眼科领域的潜力:对生成式预训练变换器商店及用户-创作者指南的早期分析
Ophthalmol Ther. 2024 Oct;13(10):2697-2713. doi: 10.1007/s40123-024-01014-w. Epub 2024 Aug 14.
5
Responses From ChatGPT-4 Show Limited Correlation With Expert Consensus Statement on Anterior Shoulder Instability.ChatGPT-4的回答与关于前肩不稳的专家共识声明的相关性有限。
Arthrosc Sports Med Rehabil. 2024 Mar 5;6(3):100923. doi: 10.1016/j.asmr.2024.100923. eCollection 2024 Jun.
6
ChatGPT-4 Generates More Accurate and Complete Responses to Common Patient Questions About Anterior Cruciate Ligament Reconstruction Than Google's Search Engine.与谷歌搜索引擎相比,ChatGPT-4对前交叉韧带重建常见患者问题的回答更准确、更完整。
Arthrosc Sports Med Rehabil. 2024 Apr 9;6(3):100939. doi: 10.1016/j.asmr.2024.100939. eCollection 2024 Jun.
7
Challenges and opportunities of artificial intelligence implementation within sports science and sports medicine teams.体育科学与运动医学团队中人工智能应用的挑战与机遇。
Front Sports Act Living. 2024 May 20;6:1332427. doi: 10.3389/fspor.2024.1332427. eCollection 2024.
8
Surgical strategy and complication management of osteotomy around the painful degenerative varus knee: ESSKA Formal Consensus Part II.截骨术治疗疼痛性退行性内翻膝的手术策略和并发症管理:ESSKA 正式共识第二部分。
Knee Surg Sports Traumatol Arthrosc. 2024 Aug;32(8):2194-2205. doi: 10.1002/ksa.12273. Epub 2024 May 20.
9
Osteotomy around the painful degenerative varus knee has broader indications than conventionally described but must follow a strict planning process: ESSKA Formal Consensus Part I.在疼痛性退行性内翻膝周围进行截骨术的适应证比传统描述的更广泛,但必须遵循严格的规划过程:ESSKA 正式共识第一部分。
Knee Surg Sports Traumatol Arthrosc. 2024 Jul;32(7):1891-1901. doi: 10.1002/ksa.12256. Epub 2024 May 13.
10
Introducing AnatomyGPT: A customized artificial intelligence application for anatomical sciences education.介绍 AnatomyGPT:一个用于解剖科学教育的定制人工智能应用程序。
Clin Anat. 2024 Sep;37(6):661-669. doi: 10.1002/ca.24178. Epub 2024 May 9.