• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估各种人工智能应用在应对牙髓外科技术问题方面的表现。

Assessment of various artificial intelligence applications in responding to technical questions in endodontic surgery.

作者信息

Baris Sevda Durust, Baris Kubilay

机构信息

Kırıkkale University, Kırıkkale, Turkey.

出版信息

BMC Oral Health. 2025 May 22;25(1):763. doi: 10.1186/s12903-025-06149-1.

DOI:10.1186/s12903-025-06149-1
PMID:40405212
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12096613/
Abstract

BACKGROUND

The objective of this study was to evaluate the performance of ScholarGPT, ChatGPT-4o and Google Gemini in responding to queries pertaining to endodontic apical surgery, a subject that demands advanced specialist knowledge in endodontics.

METHODS

A total of 30 questions, including 12 binary and 18 open-ended queries, were formulated based on information on endodontic apical surgery taken from a well-known endodontic book called Cohen's pathways of the pulp (12th edition). The questions were posed by two different researchers using different accounts on the ScholarGPT, ChatGPT-4o and Gemini platforms. The responses were then coded by the researchers and categorised as 'correct', 'incorrect', or 'insufficient'. The Pearson chi-square test was used to assess the relationships between the platforms.

RESULTS

A total of 5,400 responses were evaluated. Chi-square analysis revealed statistically significant differences between the accuracy of the responses provided applications (χ² = 22.61; p < 0.05). ScholarGPT demonstrated the highest rate of correct responses (97.7%), followed by ChatGPT-4o with 90.1%. Conversely, Gemini exhibited the lowest correct response rate (59.5%) among the applications examined.

CONCLUSIONS

ScholarGPT performed better overall on questions about endodontic apical surgery than ChatGPT-4o and Gemini. GPT models based on academic databases, such as ScholarGPT, may provide more accurate information about dentistry. However, additional research should be conducted to develop a GPT model that is specifically tailored to the field of endodontics.

摘要

背景

本研究的目的是评估ScholarGPT、ChatGPT - 4o和谷歌Gemini在回答与牙髓病根尖手术相关问题方面的表现,牙髓病根尖手术这一主题需要牙髓病学方面的高级专业知识。

方法

基于从一本著名的牙髓病学书籍《科恩牙髓通路》(第12版)中获取的牙髓病根尖手术信息,总共制定了30个问题,其中包括12个二元问题和18个开放式问题。这些问题由两名不同的研究人员使用ScholarGPT、ChatGPT - 4o和Gemini平台上的不同账号提出。然后,研究人员对回答进行编码,并分类为“正确”、“错误”或“不充分”。使用Pearson卡方检验来评估各平台之间的关系。

结果

总共评估了5400个回答。卡方分析显示,各应用程序提供的回答准确性之间存在统计学上的显著差异(χ² = 22.61;p < 0.05)。ScholarGPT的正确回答率最高(97.7%),其次是ChatGPT - 4o,为90.1%。相反,在所研究的应用程序中,Gemini的正确回答率最低(59.5%)。

结论

在关于牙髓病根尖手术的问题上,ScholarGPT总体表现优于ChatGPT - 4o和Gemini。基于学术数据库的GPT模型,如ScholarGPT,可能会提供有关牙科更准确的信息。然而,应该进行更多的研究来开发专门针对牙髓病学领域的GPT模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba13/12096613/c944109ddc95/12903_2025_6149_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba13/12096613/c944109ddc95/12903_2025_6149_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba13/12096613/c944109ddc95/12903_2025_6149_Fig1_HTML.jpg

相似文献

1
Assessment of various artificial intelligence applications in responding to technical questions in endodontic surgery.评估各种人工智能应用在应对牙髓外科技术问题方面的表现。
BMC Oral Health. 2025 May 22;25(1):763. doi: 10.1186/s12903-025-06149-1.
2
Performance of 4 Artificial Intelligence Chatbots in Answering Endodontic Questions.4款人工智能聊天机器人回答牙髓病学问题的表现
J Endod. 2025 May;51(5):602-608. doi: 10.1016/j.joen.2025.01.002. Epub 2025 Jan 13.
3
Evaluation of different artificial intelligence applications in responding to regenerative endodontic procedures.不同人工智能应用在应对牙髓再生治疗程序中的评估
BMC Oral Health. 2025 Jan 11;25(1):53. doi: 10.1186/s12903-025-05424-5.
4
A Comparative Analysis of Artificial Intelligence Platforms: ChatGPT-4o and Google Gemini in Answering Questions About Birth Control Methods.人工智能平台的比较分析:ChatGPT-4o与谷歌Gemini在回答避孕方法相关问题方面的表现
Cureus. 2025 Jan 1;17(1):e76745. doi: 10.7759/cureus.76745. eCollection 2025 Jan.
5
Performance of AI-Chatbots to Common Temporomandibular Joint Disorders (TMDs) Patient Queries: Accuracy, Completeness, Reliability and Readability.人工智能聊天机器人对常见颞下颌关节紊乱病(TMDs)患者问题的回答:准确性、完整性、可靠性和可读性。
Orthod Craniofac Res. 2025 May 7. doi: 10.1111/ocr.12939.
6
Accuracy and quality of ChatGPT-4o and Google Gemini performance on image-based neurosurgery board questions.ChatGPT-4o和谷歌Gemini在基于图像的神经外科委员会问题上的表现准确性和质量。
Neurosurg Rev. 2025 Mar 25;48(1):320. doi: 10.1007/s10143-025-03472-7.
7
Comparative performance of artificial intelligence models in rheumatology board-level questions: evaluating Google Gemini and ChatGPT-4o.人工智能模型在风湿病委员会级问题中的比较性能:评估 Google Gemini 和 ChatGPT-4o。
Clin Rheumatol. 2024 Nov;43(11):3507-3513. doi: 10.1007/s10067-024-07154-5. Epub 2024 Sep 28.
8
Comparing diagnostic skills in endodontic cases: dental students versus ChatGPT-4o.比较牙髓病病例中的诊断技能:牙科学生与ChatGPT-4o。
BMC Oral Health. 2025 Mar 29;25(1):457. doi: 10.1186/s12903-025-05857-y.
9
Comparative analysis of ChatGPT-4o mini, ChatGPT-4o and Gemini Advanced in the treatment of postmenopausal osteoporosis.ChatGPT-4o mini、ChatGPT-4o与Gemini Advanced在绝经后骨质疏松症治疗中的对比分析。
BMC Musculoskelet Disord. 2025 Apr 16;26(1):369. doi: 10.1186/s12891-025-08601-3.
10
Evaluating ChatGPT and Google Gemini Performance and Implications in Turkish Dental Education.评估ChatGPT和谷歌Gemini在土耳其牙科教育中的性能及影响
Cureus. 2025 Jan 11;17(1):e77292. doi: 10.7759/cureus.77292. eCollection 2025 Jan.

引用本文的文献

1
AI-HOPE-TP53: A Conversational Artificial Intelligence Agent for Pathway-Centric Analysis of TP53-Driven Molecular Alterations in Early-Onset Colorectal Cancer.AI-HOPE-TP53:一种用于以通路为中心分析早发性结直肠癌中TP53驱动的分子改变的对话式人工智能代理。
Cancers (Basel). 2025 Aug 31;17(17):2865. doi: 10.3390/cancers17172865.

本文引用的文献

1
Evaluation of different artificial intelligence applications in responding to regenerative endodontic procedures.不同人工智能应用在应对牙髓再生治疗程序中的评估
BMC Oral Health. 2025 Jan 11;25(1):53. doi: 10.1186/s12903-025-05424-5.
2
Accuracy and Consistency of Gemini Responses Regarding the Management of Traumatized Permanent Teeth.双子座(Gemini)关于外伤恒牙治疗反应的准确性和一致性
Dent Traumatol. 2025 Apr;41(2):171-177. doi: 10.1111/edt.13004. Epub 2024 Oct 26.
3
ScholarGPT's performance in oral and maxillofacial surgery.
ScholarGPT在口腔颌面外科的表现。
J Stomatol Oral Maxillofac Surg. 2024 Oct 9;126(4):102114. doi: 10.1016/j.jormas.2024.102114.
4
Performance of large language artificial intelligence models on solving restorative dentistry and endodontics student assessments.大型语言人工智能模型在解决修复牙科和牙髓学生评估方面的性能。
Clin Oral Investig. 2024 Oct 7;28(11):575. doi: 10.1007/s00784-024-05968-w.
5
Toward a responsible future: recommendations for AI-enabled clinical decision support.迈向负责任的未来:人工智能支持的临床决策支持的建议。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2730-2739. doi: 10.1093/jamia/ocae209.
6
Performance of large language models in oral and maxillofacial surgery examinations.大型语言模型在口腔颌面外科学考试中的表现。
Int J Oral Maxillofac Surg. 2024 Oct;53(10):881-886. doi: 10.1016/j.ijom.2024.06.003. Epub 2024 Jun 25.
7
Assessment of artificial intelligence applications in responding to dental trauma.评估人工智能在应对牙科创伤中的应用。
Dent Traumatol. 2024 Dec;40(6):722-729. doi: 10.1111/edt.12965. Epub 2024 May 14.
8
Assessing the research landscape and clinical utility of large language models: a scoping review.评估大型语言模型的研究现状和临床实用性:范围综述。
BMC Med Inform Decis Mak. 2024 Mar 12;24(1):72. doi: 10.1186/s12911-024-02459-6.
9
Performance of a commercially available Generative Pre-trained Transformer (GPT) in describing radiolucent lesions in panoramic radiographs and establishing differential diagnoses.商用生成式预训练转换器(GPT)在描述全景片上的透光性病变并建立鉴别诊断中的性能。
Clin Oral Investig. 2024 Mar 9;28(3):204. doi: 10.1007/s00784-024-05587-5.
10
Validity and reliability of artificial intelligence chatbots as public sources of information on endodontics.人工智能聊天机器人作为牙髓学公共信息源的有效性和可靠性。
Int Endod J. 2024 Mar;57(3):305-314. doi: 10.1111/iej.14014. Epub 2023 Dec 20.