• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人工智能的大语言模型在疝病学决策支持中的有效性和安全性:专家和普通外科医生的评估

Efficacy and safety of artificial intelligence-based large language models for decision making support in herniology: evaluation by experts and general surgeons.

作者信息

Nechay T V, Sazhin A V, Loban K M, Bogomolova A K, Suglob V V, Beniia T R

机构信息

Pirogov Russian National Research Medical University, Moscow, Russia.

出版信息

Khirurgiia (Mosk). 2024(8):6-14. doi: 10.17116/hirurgia20240816.

DOI:10.17116/hirurgia20240816
PMID:39140937
Abstract

OBJECTIVE

To evaluate the quality of recommendations provided by ChatGPT regarding inguinal hernia repair.

MATERIAL AND METHODS

ChatGPT was asked 5 questions about surgical management of inguinal hernias. The chat-bot was assigned the role of expert in herniology and requested to search only specialized medical databases and provide information about references and evidence. Herniology experts and surgeons (non-experts) rated the quality of recommendations generated by ChatGPT using 4-point scale (from 0 to 3 points). Statistical correlations were explored between participants' ratings and their stance regarding artificial intelligence.

RESULTS

Experts scored the quality of ChatGPT responses lower than non-experts (2 (1-2) vs. 2 (2-3), <0.001). The chat-bot failed to provide valid references and actual evidence, as well as falsified half of references. Respondents were optimistic about the future of neural networks for clinical decision-making support. Most of them were against restricting their use in healthcare.

CONCLUSION

We would not recommend non-specialized large language models as a single or primary source of information for clinical decision making or virtual searching assistant.

摘要

目的

评估ChatGPT提供的关于腹股沟疝修补术建议的质量。

材料与方法

向ChatGPT询问了5个关于腹股沟疝手术治疗的问题。将聊天机器人设定为疝科学专家的角色,并要求其仅在专业医学数据库中进行搜索,并提供参考文献和证据方面的信息。疝科学专家和外科医生(非专家)使用4分制(从0到3分)对ChatGPT生成的建议质量进行评分。探讨了参与者评分与其对人工智能的态度之间的统计相关性。

结果

专家对ChatGPT回复质量的评分低于非专家(2(1 - 2)对2(2 - 3),<0.001)。聊天机器人未能提供有效的参考文献和实际证据,并且伪造了一半的参考文献。受访者对神经网络用于临床决策支持的未来持乐观态度。他们中的大多数人反对在医疗保健中限制其使用。

结论

我们不建议将非专业的大语言模型作为临床决策或虚拟搜索助手的唯一或主要信息来源。

相似文献

1
Efficacy and safety of artificial intelligence-based large language models for decision making support in herniology: evaluation by experts and general surgeons.基于人工智能的大语言模型在疝病学决策支持中的有效性和安全性:专家和普通外科医生的评估
Khirurgiia (Mosk). 2024(8):6-14. doi: 10.17116/hirurgia20240816.
2
International guidelines for groin hernia management.腹股沟疝治疗的国际指南。
Hernia. 2018 Feb;22(1):1-165. doi: 10.1007/s10029-017-1668-x. Epub 2018 Jan 12.
3
AI's deep dive into complex pediatric inguinal hernia issues: a challenge to traditional guidelines?人工智能深入研究复杂的小儿腹股沟疝问题:对传统指南的挑战?
Hernia. 2023 Dec;27(6):1587-1599. doi: 10.1007/s10029-023-02900-1. Epub 2023 Oct 16.
4
A cross-sectional study to evaluate responses generated by two AI software programs for common patient queries about laparoscopic repair of inguinal hernia.一项横断面研究,旨在评估两个人工智能软件程序对患者关于腹股沟疝腹腔镜修补术常见问题的回答。
Updates Surg. 2025 Apr;77(2):583-588. doi: 10.1007/s13304-025-02158-5. Epub 2025 Mar 5.
5
ChatGPT versus NASS clinical guidelines for degenerative spondylolisthesis: a comparative analysis.ChatGPT 与 NASS 退行性脊柱滑脱临床指南比较分析。
Eur Spine J. 2024 Nov;33(11):4182-4203. doi: 10.1007/s00586-024-08198-6. Epub 2024 Mar 15.
6
Appropriateness of Online Chat-Based Artificial Intelligence (ChatGPT) Answers to Common Questions on Inguinal Hernia Repair.基于在线聊天的人工智能(ChatGPT)对腹股沟疝修补常见问题回答的适宜性。
J Laparoendosc Adv Surg Tech A. 2024 Feb;34(2):141-143. doi: 10.1089/lap.2023.0403. Epub 2023 Dec 21.
7
Assessing the accuracy and quality of artificial intelligence (AI) chatbot-generated responses in making patient-specific drug-therapy and healthcare-related decisions.评估人工智能(AI)聊天机器人生成的回复在做出针对特定患者的药物治疗和医疗相关决策时的准确性和质量。
BMC Med Inform Decis Mak. 2024 Dec 24;24(1):404. doi: 10.1186/s12911-024-02824-5.
8
Artificial intelligence in neurovascular decision-making: a comparative analysis of ChatGPT-4 and multidisciplinary expert recommendations for unruptured intracranial aneurysms.人工智能在神经血管决策中的应用:ChatGPT-4与颅内未破裂动脉瘤多学科专家建议的比较分析
Neurosurg Rev. 2025 Feb 21;48(1):261. doi: 10.1007/s10143-025-03341-3.
9
Online artificial intelligence platforms and their applicability to gastrointestinal surgical operations.在线人工智能平台及其在胃肠外科手术中的适用性。
J Gastrointest Surg. 2024 Jan;28(1):64-69. doi: 10.1016/j.gassur.2023.11.019.
10
Do we need changes in the organization of training in the hernia treatment in Poland? A survey on the acknowledgment of the recommendations of international hernia societies by surgeons.我们是否需要改变波兰疝气治疗的培训组织方式?一项关于外科医生对国际疝气学会建议认可情况的调查。
Pol Przegl Chir. 2017 Oct 31;89(5):12-18. doi: 10.5604/01.3001.0010.5406.