文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

ChatGPT能否准确回答PICOT问题?评估人工智能对临床问题的回答。

Can ChatGPT Accurately Answer a PICOT Question? Assessing AI Response to a Clinical Question.

作者信息

Branum Candise, Schiavenato Martin

机构信息

Health Science Librarian and Assistant Professor (Mx Branum), Foley Center Library, and Assistant Professor (Dr Schiavenato), School of Nursing and Human Physiology, Gonzaga University, Spokane, Washington.

出版信息

Nurse Educ. 2023;48(5):231-233. doi: 10.1097/NNE.0000000000001436. Epub 2023 Apr 28.


DOI:10.1097/NNE.0000000000001436
PMID:37130197
Abstract

BACKGROUND: ChatGPT, an artificial intelligence (AI) text generator trained to predict correct words, can provide answers to questions but has shown mixed results in answering medical questions. PURPOSE: To assess the reliability and accuracy of ChatGPT in providing answers to a complex clinical question. METHODS: A Population, Intervention, Comparison, Outcome, and Time (PICOT) formatted question was queried, along with a request for references. Full-text articles were reviewed to verify the accuracy of the evidence summary provided by the chatbot. RESULTS: ChatGPT was unable to provide a certifiable response to a PICOT question. The references cited as evidence included incorrect journal information, and many study details summarized by ChatGPT proved to be patently false, including providing fabricated data. CONCLUSIONS: ChatGPT provides answers that appear legitimate but may be factually incorrect. The system is not transparent in how it gathers data to answer questions and sometimes fabricates information that looks plausible, making it an unreliable tool for clinical questions.

摘要

背景:ChatGPT是一种经过训练以预测正确单词的人工智能(AI)文本生成器,它可以回答问题,但在回答医学问题时结果参差不齐。 目的:评估ChatGPT在回答复杂临床问题时的可靠性和准确性。 方法:查询了一个采用人群、干预措施、对照、结局和时间(PICOT)格式的问题,并要求提供参考文献。对全文进行了审查,以验证聊天机器人提供的证据总结的准确性。 结果:ChatGPT无法对PICOT问题提供可认证的回答。作为证据引用的参考文献包含错误的期刊信息,ChatGPT总结的许多研究细节被证明明显是错误的,包括提供伪造的数据。 结论:ChatGPT提供的答案看似合理,但可能与事实不符。该系统在收集数据以回答问题的方式上不透明,有时会编造看似合理的信息,使其成为临床问题的不可靠工具。

相似文献

[1]
Can ChatGPT Accurately Answer a PICOT Question? Assessing AI Response to a Clinical Question.

Nurse Educ. 2023

[2]
Can ChatGPT be trusted as a resource for a scholarly article on treatment planning implant-supported prostheses?

J Prosthet Dent. 2025-4-9

[3]
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.

JMIR Form Res. 2025-5-20

[4]
Evaluating ChatGPT Responses on Thyroid Nodules for Patient Education.

Thyroid. 2024-3

[5]
Artificial Intelligence in Orthopaedics: Performance of ChatGPT on Text and Image Questions on a Complete AAOS Orthopaedic In-Training Examination (OITE).

J Surg Educ. 2024-11

[6]
"Dr. AI Will See You Now": How Do ChatGPT-4 Treatment Recommendations Align With Orthopaedic Clinical Practice Guidelines?

Clin Orthop Relat Res. 2024-12-1

[7]
Can generative artificial intelligence pass the orthopaedic board examination?

J Orthop. 2023-11-5

[8]
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.

Cureus. 2025-6-1

[9]
Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?

Clin Orthop Relat Res. 2025-2-1

[10]
Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis.

BMC Med Educ. 2024-9-16

引用本文的文献

[1]
ChatGPT Applications in Nursing: Current Status and Future Perspectives.

Nurs Open. 2025-6

[2]
AI Chatbots as Sources of STD Information: A Study on Reliability and Readability.

J Med Syst. 2025-4-3

[3]
Performance of ChatGPT-4 on Taiwanese Traditional Chinese Medicine Licensing Examinations: Cross-Sectional Study.

JMIR Med Educ. 2025-3-19

[4]
Performance of Artificial Intelligence Chatbots on Ultrasound Examinations: Cross-Sectional Comparative Analysis.

JMIR Med Inform. 2025-1-9

[5]
Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study.

JMIR Med Inform. 2025-1-10

[6]
Exploring ChatGPT in clinical inquiry: a scoping review of characteristics, applications, challenges, and evaluation.

Ann Med Surg (Lond). 2024-11-8

[7]
Comparing the Accuracy of Two Generated Large Language Models in Identifying Health-Related Rumors or Misconceptions and the Applicability in Health Science Popularization: Proof-of-Concept Study.

JMIR Form Res. 2024-12-2

[8]
PICOT questions and search strategies formulation: A novel approach using artificial intelligence automation.

J Nurs Scholarsh. 2025-1

[9]
ChatGPT in medicine: A cross-disciplinary systematic review of ChatGPT's (artificial intelligence) role in research, clinical practice, education, and patient interaction.

Medicine (Baltimore). 2024-8-9

[10]
Potential Roles of Large Language Models in the Production of Systematic Reviews and Meta-Analyses.

J Med Internet Res. 2024-6-25

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索