• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

足踝外科的计费与编码:我们能信任人工智能吗?

Billing and Coding in Foot and Ankle Surgery: Can We Trust Artificial Intelligence?

作者信息

Provenzano Gina, McCahon Joseph A S, Nghe Amy, Lencer Adam, Amponsah Nana, Daniel Joseph N, Pedowitz David I, Parekh Selene G

机构信息

Thomas Jefferson University Hospital, Philadelphia, Pennsylvania.

Jefferson Health NJ, Stratford, New Jersey.

出版信息

Foot Ankle Spec. 2025 Aug 20:19386400251363000. doi: 10.1177/19386400251363000.

DOI:10.1177/19386400251363000
PMID:40831290
Abstract

BackgroundBilling and coding for orthopaedic procedures is a complex process with thousands of procedure codes and associated modifiers in existence. Foot and ankle faces an additional challenge as it is among the highest variability regarding procedures performed compared with other orthopaedic subspecialities. This study aimed to investigate the capabilities of the top AI search engines in accurately identifying Current Procedural Terminology (CPT) codes for common foot and ankle procedures.MethodsA comparative analysis of 3 publically available AI search engines (ChatGPT, Bing, and Google Gemini) was performed investigating their accuracy in generating CPT codes for common orthopaedic foot and ankle procedures. The generated CPT codes were recorded and compared with the codes generated by 3 fellowship trained foot and ankle surgeons, serving as the reference standard. Cohen kappa coefficient was used to determine agreement across AI platforms regarding the surgeon coding reference standard. ResultsThe AI search engines were able to correctly generate the appropriate CPT codes 44% of the time, with Bing being the most accurate in generating the correct CPT codes for 8 of the 13 procedures (62%) and partially correct codes 3 of the 13 procedures (23%). ChatGPT demonstrated the worst accuracy, generating the correct CPT codes only 23% of the time (3/13). AI platforms demonstrated an overall Fair Agreement with the reference standard (kappa = 0.201). Individually, Bing demonstrated Moderate Agreement (kappa = 0.405), Google Gemini demonstrated Fair Agreement (kappa = 0.255), and ChatGPT demonstrated Poor Agreement with the reference standard (kappa = 0.171).ConclusionAlthough the capabilities of AI show great promise for many industries, the results of this study bring caution to relying on AI for accurately generating orthopaedic foot and ankle procedure CPT codes.Level of Evidence:.

摘要

背景

骨科手术的计费和编码是一个复杂的过程,现有数千个手术代码及相关修饰符。与其他骨科亚专业相比,足踝手术面临着额外的挑战,因为其手术的变异性是最高的。本研究旨在调查顶级人工智能搜索引擎准确识别常见足踝手术当前手术术语(CPT)代码的能力。

方法

对3个公开可用的人工智能搜索引擎(ChatGPT、必应和谷歌Gemini)进行了比较分析,研究它们生成常见骨科足踝手术CPT代码的准确性。记录生成的CPT代码,并与3名经过专科培训的足踝外科医生生成的代码进行比较,后者作为参考标准。使用科恩kappa系数来确定人工智能平台之间关于外科医生编码参考标准的一致性。

结果

人工智能搜索引擎能够在44%的时间内正确生成适当的CPT代码,必应在13种手术中的8种(62%)生成正确CPT代码方面最为准确,在13种手术中的3种(23%)生成部分正确代码。ChatGPT的准确性最差,仅在23%的时间内(3/13)生成正确的CPT代码。人工智能平台与参考标准总体显示出中等一致性(kappa = 0.201)。单独来看,必应显示出中等一致性(kappa = 0.405),谷歌Gemini显示出中等一致性(kappa = 0.255),ChatGPT与参考标准显示出较差的一致性(kappa = 0.171)。

结论

尽管人工智能的能力在许多行业显示出巨大潜力,但本研究结果提醒人们在依赖人工智能准确生成骨科足踝手术CPT代码时要谨慎。证据水平:

相似文献

1
Billing and Coding in Foot and Ankle Surgery: Can We Trust Artificial Intelligence?足踝外科的计费与编码:我们能信任人工智能吗?
Foot Ankle Spec. 2025 Aug 20:19386400251363000. doi: 10.1177/19386400251363000.
2
Evaluating the Efficacy of Large Language Models in CPT Coding for Craniofacial Surgery: A Comparative Analysis.评估大语言模型在颅面外科手术CPT编码中的有效性:一项比较分析。
J Craniofac Surg. 2025 May 1;36(3):831-835. doi: 10.1097/SCS.0000000000010575. Epub 2024 Sep 2.
3
Evaluating Large Language Model's accuracy in current procedural terminology coding given operative note templates across various plastic surgery sub-specialties.
J Plast Reconstr Aesthet Surg. 2025 Jul;106:50-52. doi: 10.1016/j.bjps.2025.04.025. Epub 2025 Apr 23.
4
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.用于计算局部麻醉药最大安全剂量的3种对话式生成人工智能模型的性能:比较分析
JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。
JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.
7
Comparison of Responses from ChatGPT-4, Google Gemini, and Google Search to Common Patient Questions About Ankle Sprains: A Readability Analysis.ChatGPT-4、谷歌Gemini和谷歌搜索对关于脚踝扭伤的常见患者问题的回答比较:可读性分析
J Am Acad Orthop Surg. 2025 Jul 3;33(16):924-930. doi: 10.5435/JAAOS-D-25-00260.
8
"Dr. AI Will See You Now": How Do ChatGPT-4 Treatment Recommendations Align With Orthopaedic Clinical Practice Guidelines?“AI 医生为您服务”:ChatGPT-4 的治疗建议与骨科临床实践指南如何契合?
Clin Orthop Relat Res. 2024 Dec 1;482(12):2098-2106. doi: 10.1097/CORR.0000000000003234. Epub 2024 Sep 6.
9
The Reliability Gap: How Traditional Search Engines Outperform Artificial Intelligence (AI) Chatbots in Rosacea Public Health Information Quality.可靠性差距:传统搜索引擎在酒渣鼻公共卫生信息质量方面如何优于人工智能(AI)聊天机器人。
Cureus. 2025 Jun 22;17(6):e86543. doi: 10.7759/cureus.86543. eCollection 2025 Jun.
10
Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.评估 ChatGPT 和 Bard 生成的结构化摘要与脊柱外科领域人类撰写的摘要在可重复性方面的比较:对比分析。
J Med Internet Res. 2024 Jun 26;26:e52001. doi: 10.2196/52001.