• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ScholarGPT在口腔颌面外科的表现。

ScholarGPT's performance in oral and maxillofacial surgery.

作者信息

Balel Yunus

机构信息

Department of Oral and Maxillofacial Surgery, Faculty of Dentistry, Sivas Cumhuriyet University, Sivas 58000, Turkiye.

出版信息

J Stomatol Oral Maxillofac Surg. 2024 Oct 9;126(4):102114. doi: 10.1016/j.jormas.2024.102114.

DOI:10.1016/j.jormas.2024.102114
PMID:39389541
Abstract

OBJECTIVE

The purpose of this study is to evaluate the performance of Scholar GPT in answering technical questions in the field of oral and maxillofacial surgery and to conduct a comparative analysis with the results of a previous study that assessed the performance of ChatGPT.

MATERIALS AND METHODS

Scholar GPT was accessed via ChatGPT (www.chatgpt.com) on March 20, 2024. A total of 60 technical questions (15 each on impacted teeth, dental implants, temporomandibular joint disorders, and orthognathic surgery) from our previous study were used. Scholar GPT's responses were evaluated using a modified Global Quality Scale (GQS). The questions were randomized before scoring using an online randomizer (www.randomizer.org). A single researcher performed the evaluations at three different times, three weeks apart, with each evaluation preceded by a new randomization. In cases of score discrepancies, a fourth evaluation was conducted to determine the final score.

RESULTS

Scholar GPT performed well across all technical questions, with an average GQS score of 4.48 (SD=0.93). Comparatively, ChatGPT's average GQS score in previous study was 3.1 (SD=1.492). The Wilcoxon Signed-Rank Test indicated a statistically significant higher average score for Scholar GPT compared to ChatGPT (Mean Difference = 2.00, SE = 0.163, p < 0.001). The Kruskal-Wallis Test showed no statistically significant differences among the topic groups (χ² = 0.799, df = 3, p = 0.850, ε² = 0.0135).

CONCLUSION

Scholar GPT demonstrated a generally high performance in technical questions within oral and maxillofacial surgery and produced more consistent and higher-quality responses compared to ChatGPT. The findings suggest that GPT models based on academic databases can provide more accurate and reliable information. Additionally, developing a specialized GPT model for oral and maxillofacial surgery could ensure higher quality and consistency in artificial intelligence-generated information.

摘要

目的

本研究旨在评估Scholar GPT在回答口腔颌面外科领域技术问题方面的表现,并与之前评估ChatGPT表现的研究结果进行对比分析。

材料与方法

2024年3月20日通过ChatGPT(www.chatgpt.com)访问Scholar GPT。使用了我们之前研究中的60个技术问题(关于阻生牙、牙种植体、颞下颌关节紊乱和正颌外科各15个)。Scholar GPT的回答使用改良的全球质量量表(GQS)进行评估。在评分前使用在线随机工具(www.randomizer.org)对问题进行随机排序。一名研究人员在三个不同时间进行评估,每次间隔三周,每次评估前都进行新的随机排序。在分数出现差异的情况下,进行第四次评估以确定最终分数。

结果

Scholar GPT在所有技术问题上表现良好,平均GQS得分为4.48(标准差=0.93)。相比之下,ChatGPT在之前研究中的平均GQS得分为3.1(标准差=1.492)。Wilcoxon符号秩检验表明,与ChatGPT相比,Scholar GPT的平均得分在统计学上显著更高(平均差异=2.00,标准误=0.163,p<0.001)。Kruskal-Wallis检验显示各主题组之间在统计学上无显著差异(χ²=0.799,自由度=3,p=0.850,ε²=0.0135)。

结论

Scholar GPT在口腔颌面外科的技术问题上总体表现出色,与ChatGPT相比,产生了更一致、质量更高的回答。研究结果表明,基于学术数据库的GPT模型可以提供更准确可靠的信息。此外,开发专门用于口腔颌面外科的GPT模型可以确保人工智能生成信息的更高质量和一致性。

相似文献

1
ScholarGPT's performance in oral and maxillofacial surgery.ScholarGPT在口腔颌面外科的表现。
J Stomatol Oral Maxillofac Surg. 2024 Oct 9;126(4):102114. doi: 10.1016/j.jormas.2024.102114.
2
New generative artificial intelligence model: ScholarGPT's performance on dental avulsion.新型生成式人工智能模型:ScholarGPT在牙脱位方面的表现。
Int J Med Inform. 2025 Dec;204:106080. doi: 10.1016/j.ijmedinf.2025.106080. Epub 2025 Aug 13.
3
Performance of ChatGPT Across Different Versions in Medical Licensing Examinations Worldwide: Systematic Review and Meta-Analysis.ChatGPT 在全球医学执照考试不同版本中的表现:系统评价和荟萃分析。
J Med Internet Res. 2024 Jul 25;26:e60807. doi: 10.2196/60807.
4
Can ChatGPT-4o provide new systematic review ideas to oral and maxillofacial surgeons?ChatGPT-4 能否为口腔颌面外科医生提供新的系统评价思路?
J Stomatol Oral Maxillofac Surg. 2024 Oct;125(5S2):101979. doi: 10.1016/j.jormas.2024.101979. Epub 2024 Jul 26.
5
Potential of ChatGPT in youth mental health emergency triage: Comparative analysis with clinicians.ChatGPT在青少年心理健康紧急分诊中的潜力:与临床医生的比较分析
PCN Rep. 2025 Jul 15;4(3):e70159. doi: 10.1002/pcn5.70159. eCollection 2025 Sep.
6
Evaluating ChatGPT's Utility in Biologic Therapy for Systemic Lupus Erythematosus: Comparative Study of ChatGPT and Google Web Search.评估ChatGPT在系统性红斑狼疮生物治疗中的效用:ChatGPT与谷歌网络搜索的比较研究
JMIR Form Res. 2025 Aug 28;9:e76458. doi: 10.2196/76458.
7
Optimizing patient education for radioactive iodine therapy and the role of ChatGPT incorporating chain-of-thought technique: ChatGPT questionnaire.优化放射性碘治疗的患者教育以及结合思维链技术的ChatGPT的作用:ChatGPT问卷
Digit Health. 2025 Jul 7;11:20552076251357468. doi: 10.1177/20552076251357468. eCollection 2025 Jan-Dec.
8
[Preliminary exploration of the applications of five large language models in the field of oral auxiliary diagnosis, treatment and health consultation].五种大语言模型在口腔辅助诊断、治疗及健康咨询领域的应用初探
Zhonghua Kou Qiang Yi Xue Za Zhi. 2025 Jul 30;60(8):871-878. doi: 10.3760/cma.j.cn112144-20241107-00418.
9
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.甲状腺眼病与人工智能:ChatGPT-3.5、ChatGPT-4o和Gemini在患者信息传递方面的比较研究
Ophthalmic Plast Reconstr Surg. 2024 Dec 24. doi: 10.1097/IOP.0000000000002882.
10
Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis.ChatGPT-3.5 和 GPT-4 在医学、药学、牙科和护理国家执照考试中的表现:系统评价和荟萃分析。
BMC Med Educ. 2024 Sep 16;24(1):1013. doi: 10.1186/s12909-024-05944-8.

引用本文的文献

1
Deep learning-based approach to third molar impaction analysis with clinical classifications.基于深度学习的第三磨牙阻生分析及临床分类方法。
Sci Rep. 2025 Jul 3;15(1):23688. doi: 10.1038/s41598-025-93783-y.
2
Assessment of various artificial intelligence applications in responding to technical questions in endodontic surgery.评估各种人工智能应用在应对牙髓外科技术问题方面的表现。
BMC Oral Health. 2025 May 22;25(1):763. doi: 10.1186/s12903-025-06149-1.
3
Chat Generative Pre-Trained Transformer (ChatGPT) in Oral and Maxillofacial Surgery: A Narrative Review on Its Research Applications and Limitations.
口腔颌面外科中的聊天生成预训练变换器(ChatGPT):关于其研究应用和局限性的叙述性综述
J Clin Med. 2025 Feb 18;14(4):1363. doi: 10.3390/jcm14041363.