• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于ChatGPT及其在医学和牙科研究中的应用的系统评价和荟萃分析。

A systematic review and meta-analysis on ChatGPT and its utilization in medical and dental research.

作者信息

Bagde Hiroj, Dhopte Ashwini, Alam Mohammad Khursheed, Basri Rehana

机构信息

Department of Periodontology, Chhattisgarh Dental College and Research Institute, Rajnandgaon, Chhattisgarh, India.

Department of Oral Medicine and Radiology, Chhattisgarh Dental College and Research Institute, Rajnandgaon, Chhattisgarh, India.

出版信息

Heliyon. 2023 Nov 29;9(12):e23050. doi: 10.1016/j.heliyon.2023.e23050. eCollection 2023 Dec.

DOI:10.1016/j.heliyon.2023.e23050
PMID:38144348
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10746423/
Abstract

UNLABELLED

Since its release, ChatGPT has taken the world by storm with its utilization in various fields of life. This review's main goal was to offer a thorough and fact-based evaluation of ChatGPT's potential as a tool for medical and dental research, which could direct subsequent research and influence clinical practices.

METHODS

Different online databases were scoured for relevant articles that were in accordance with the study objectives. A team of reviewers was assembled to devise a proper methodological framework for inclusion of articles and meta-analysis.

RESULTS

11 descriptive studies were considered for this review that evaluated the accuracy of ChatGPT in answering medical queries related to different domains such as systematic reviews, cancer, liver diseases, diagnostic imaging, education, and COVID-19 vaccination. The studies reported different accuracy ranges, from 18.3 % to 100 %, across various datasets and specialties. The meta-analysis showed an odds ratio (OR) of 2.25 and a relative risk (RR) of 1.47 with a 95 % confidence interval (CI), indicating that the accuracy of ChatGPT in providing correct responses was significantly higher compared to the total responses for queries. However, significant heterogeneity was present among the studies, suggesting considerable variability in the effect sizes across the included studies.

CONCLUSION

The observations indicate that ChatGPT has the ability to provide appropriate solutions to questions in the medical and dentistry areas, but researchers and doctors should cautiously assess its responses because they might not always be dependable. Overall, the importance of this study rests in shedding light on ChatGPT's accuracy in the medical and dentistry fields and emphasizing the need for additional investigation to enhance its performance. © 2017 Elsevier Inc. All rights reserved.

摘要

未标注

自发布以来,ChatGPT在生活的各个领域得到应用,席卷全球。本综述的主要目标是对ChatGPT作为医学和牙科研究工具的潜力进行全面且基于事实的评估,这可为后续研究提供指导并影响临床实践。

方法

在不同在线数据库中搜索符合研究目标的相关文章。组建了一个评审团队来设计纳入文章和进行荟萃分析的适当方法框架。

结果

本综述纳入了11项描述性研究,这些研究评估了ChatGPT回答与不同领域相关医学问题的准确性,如系统评价、癌症、肝脏疾病、诊断成像、教育和新冠疫苗接种。这些研究报告了不同数据集和专业领域的准确率范围,从18.3%到100%不等。荟萃分析显示,优势比(OR)为2.25,相对风险(RR)为1.47,95%置信区间(CI)表明,ChatGPT提供正确回答的准确率显著高于查询的总回答率。然而,研究之间存在显著异质性,表明纳入研究的效应大小存在相当大的变异性。

结论

观察结果表明,ChatGPT有能力为医学和牙科领域的问题提供适当解决方案,但研究人员和医生应谨慎评估其回答,因为它们可能并不总是可靠的。总体而言,本研究的重要性在于揭示ChatGPT在医学和牙科领域的准确性,并强调需要进一步研究以提高其性能。© 2017爱思唯尔公司。保留所有权利。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/7f9bbcd776d1/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/45718ffe48e2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/5610114328f7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/9a9eeeba9957/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/7f9bbcd776d1/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/45718ffe48e2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/5610114328f7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/9a9eeeba9957/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86ea/10746423/7f9bbcd776d1/gr4.jpg

相似文献

1
A systematic review and meta-analysis on ChatGPT and its utilization in medical and dental research.关于ChatGPT及其在医学和牙科研究中的应用的系统评价和荟萃分析。
Heliyon. 2023 Nov 29;9(12):e23050. doi: 10.1016/j.heliyon.2023.e23050. eCollection 2023 Dec.
2
Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis.ChatGPT-3.5 和 GPT-4 在医学、药学、牙科和护理国家执照考试中的表现:系统评价和荟萃分析。
BMC Med Educ. 2024 Sep 16;24(1):1013. doi: 10.1186/s12909-024-05944-8.
3
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
4
Evaluation of ChatGPT-generated medical responses: A systematic review and meta-analysis.评价 ChatGPT 生成的医学回复:系统评价和荟萃分析。
J Biomed Inform. 2024 Mar;151:104620. doi: 10.1016/j.jbi.2024.104620. Epub 2024 Mar 8.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
7
ChatGPT's performance in dentistry and allergyimmunology assessments: a comparative study.ChatGPT 在牙科和过敏免疫评估中的表现:一项比较研究。
Swiss Dent J. 2023 Oct 4;134(2):1-17. doi: 10.61872/sdj-2024-06-01.
8
ChatGPT's performance in dentistry and allergy-immunology assessments: a comparative study.ChatGPT在牙科和过敏免疫学评估中的表现:一项比较研究。
Swiss Dent J. 2023 Oct 6;134(5).
9
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响:来自台湾护理执照考试的见解。
Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.
10
Implications of ChatGPT in Public Health Dentistry: A Systematic Review.ChatGPT在公共卫生牙科中的应用:一项系统综述。
Cureus. 2023 Jun 13;15(6):e40367. doi: 10.7759/cureus.40367. eCollection 2023 Jun.

引用本文的文献

1
Evaluation of the accuracy of ChatGPT-4 and Gemini's responses to the World Dental Federation's frequently asked questions on oral health.评估ChatGPT-4和Gemini对世界牙科联盟关于口腔健康常见问题的回答的准确性。
BMC Oral Health. 2025 Aug 2;25(1):1293. doi: 10.1186/s12903-025-06624-9.
2
Exploring the Application Capability of ChatGPT as an Instructor in Skills Education for Dental Medical Students: Randomized Controlled Trial.探索ChatGPT作为牙科医学生技能教育指导者的应用能力:随机对照试验。
J Med Internet Res. 2025 May 27;27:e68538. doi: 10.2196/68538.
3
Impact of large language model (ChatGPT) in healthcare: an umbrella review and evidence synthesis.

本文引用的文献

1
ChatGPT applications in medical, dental, pharmacy, and public health education: A descriptive study highlighting the advantages and limitations.ChatGPT在医学、牙科、药学和公共卫生教育中的应用:一项突出优势与局限的描述性研究。
Narra J. 2023 Apr;3(1):e103. doi: 10.52225/narra.v3i1.103. Epub 2023 Mar 29.
2
ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports.ChatGPT 让医学文献通俗易懂:简化放射学报告的探索性案例研究。
Eur Radiol. 2024 May;34(5):2817-2825. doi: 10.1007/s00330-023-10213-1. Epub 2023 Oct 5.
3
Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study.
大语言模型(ChatGPT)在医疗保健领域的影响:一项综述与证据综合
J Biomed Sci. 2025 May 7;32(1):45. doi: 10.1186/s12929-025-01131-z.
4
Accuracy of Large Language Models When Answering Clinical Research Questions: Systematic Review and Network Meta-Analysis.大型语言模型回答临床研究问题的准确性:系统评价与网络荟萃分析
J Med Internet Res. 2025 Apr 30;27:e64486. doi: 10.2196/64486.
5
Transforming dental diagnostics with artificial intelligence: advanced integration of ChatGPT and large language models for patient care.利用人工智能变革牙科诊断:ChatGPT与大语言模型在患者护理中的深度整合
Front Dent Med. 2025 Jan 6;5:1456208. doi: 10.3389/fdmed.2024.1456208. eCollection 2024.
6
Development and Comparative Evaluation of a Reinstructed GPT-4o Model Specialized in Periodontology.一种经重新训练的牙周病学专用GPT-4o模型的开发与比较评估
J Clin Periodontol. 2025 May;52(5):707-716. doi: 10.1111/jcpe.14101. Epub 2024 Dec 26.
7
Evaluation of Artificial Intelligence as a Search Tool for Patients: Can ChatGPT-4 Provide Accurate Evidence-Based Orthodontic-Related Information?评估人工智能作为患者搜索工具的效果:ChatGPT-4能否提供准确的循证正畸相关信息?
Cureus. 2024 Jul 31;16(7):e65820. doi: 10.7759/cureus.65820. eCollection 2024 Jul.
8
Toward Clinical Generative AI: Conceptual Framework.迈向临床生成式人工智能:概念框架
JMIR AI. 2024 Jun 7;3:e55957. doi: 10.2196/55957.
9
Evaluating the accuracy of Chat Generative Pre-trained Transformer version 4 (ChatGPT-4) responses to United States Food and Drug Administration (FDA) frequently asked questions about dental amalgam.评估 Chat Generative Pre-trained Transformer 版本 4(ChatGPT-4)对美国食品和药物管理局(FDA)关于牙银合金常见问题的回答的准确性。
BMC Oral Health. 2024 May 24;24(1):605. doi: 10.1186/s12903-024-04358-8.
10
Building Trustworthy Generative Artificial Intelligence for Diabetes Care and Limb Preservation: A Medical Knowledge Extraction Case.为糖尿病护理和肢体保全构建可信的生成式人工智能:一个医学知识提取案例。
J Diabetes Sci Technol. 2024 May 20:19322968241253568. doi: 10.1177/19322968241253568.
评估 ChatGPT 在整个临床工作流程中的效用:开发和可用性研究。
J Med Internet Res. 2023 Aug 22;25:e48659. doi: 10.2196/48659.
4
Exploring the future of nursing: Insights from the ChatGPT model.探索护理的未来:来自ChatGPT模型的见解。
Belitung Nurs J. 2023 Feb 12;9(1):1-5. doi: 10.33546/bnj.2551. eCollection 2023.
5
Chatbot vs Medical Student Performance on Free-Response Clinical Reasoning Examinations.聊天机器人与医学生在自由应答临床推理考试中的表现对比
JAMA Intern Med. 2023 Sep 1;183(9):1028-1030. doi: 10.1001/jamainternmed.2023.2909.
6
A step-by-step researcher's guide to the use of an AI-based transformer in epidemiology: an exploratory analysis of ChatGPT using the STROBE checklist for observational studies.研究人员使用基于人工智能的变换器进行流行病学研究的分步指南:使用观察性研究的STROBE清单对ChatGPT进行探索性分析
Z Gesundh Wiss. 2023 May 26:1-36. doi: 10.1007/s10389-023-01936-y.
7
Evaluating GPT as an Adjunct for Radiologic Decision Making: GPT-4 Versus GPT-3.5 in a Breast Imaging Pilot.评估 GPT 作为放射学决策辅助工具:GPT-4 与 GPT-3.5 在乳腺成像试点中的比较。
J Am Coll Radiol. 2023 Oct;20(10):990-997. doi: 10.1016/j.jacr.2023.05.003. Epub 2023 Jun 21.
8
Analysis of large-language model versus human performance for genetics questions.大语言模型与人类在遗传学问题表现上的分析。
Eur J Hum Genet. 2024 Apr;32(4):466-468. doi: 10.1038/s41431-023-01396-8. Epub 2023 May 29.
9
Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma.评估 ChatGPT 在回答肝硬化和肝细胞癌相关问题方面的表现。
Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.
10
Expanding Cosmetic Plastic Surgery Research With ChatGPT.利用 ChatGPT 拓展美容整形外科学研究。
Aesthet Surg J. 2023 Jul 15;43(8):930-937. doi: 10.1093/asj/sjad069.