• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

聊天机器人在光化学基本概念相关查询中的表现。

Performance of chatbots in queries concerning fundamental concepts in photochemistry.

作者信息

Taniguchi Masahiko, Lindsey Jonathan S

机构信息

Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA.

出版信息

Photochem Photobiol. 2024 Nov 4. doi: 10.1111/php.14037.

DOI:10.1111/php.14037
PMID:39496555
Abstract

The advent of chatbots raises the possibility of a paradigm shift across society including the most technical of fields with regard to access to information, generation of knowledge, and dissemination of education and training. Photochemistry is a scientific endeavor with roots in chemistry and physics and branches that encompass diverse disciplines ranging from astronomy to zoology. Here, five chatbots have each been challenged with 13 photochemically relevant queries. The chatbots included ChatGPT 3.5, ChatGPT 4.0, Copilot, Gemini Advanced, and Meta AI. The queries encompassed fundamental concepts (e.g., "Why is the fluorescence spectrum typically the mirror image of the absorption spectrum?"), practical matters (e.g., "What is the inner filter effect and how to avoid it?"), philosophical matters ("Please create the most important photochemistry questions."), and specific molecular features (e.g., "Why are azo dyes non-fluorescent?"). The chatbots were moderately effective in answering queries concerning fundamental concepts in photochemistry but were glaringly deficient in specialized queries for dyes and fluorophores. In some instances, a correct response was embedded in verbose scientific nonsense whereas in others the entire response, while grammatically correct, was utterly meaningless. The unreliable accuracy makes present chatbots poorly suited for unaided educational purposes and highlights the importance of domain experts.

摘要

聊天机器人的出现引发了整个社会范式转变的可能性,包括在获取信息、知识生成以及教育和培训传播等方面的最具技术性的领域。光化学是一项源于化学和物理学的科学事业,其分支涵盖了从天文学到动物学等不同学科。在此,五个聊天机器人分别接受了13个与光化学相关的问题的挑战。这些聊天机器人包括ChatGPT 3.5、ChatGPT 4.0、Copilot、Gemini Advanced和Meta AI。问题涵盖基本概念(例如,“为什么荧光光谱通常是吸收光谱的镜像?”)、实际问题(例如,“什么是内滤光效应以及如何避免它?”)、哲学问题(“请提出最重要的光化学问题。”)以及特定分子特征(例如,“为什么偶氮染料不发荧光?”)。聊天机器人在回答有关光化学基本概念的问题时表现一般,但在关于染料和荧光团的专业问题上明显不足。在某些情况下,正确答案夹杂在冗长的科学废话中,而在其他情况下,整个回答虽然语法正确,但完全没有意义。其准确性不可靠,使得当前的聊天机器人不太适合独立的教育目的,并凸显了领域专家的重要性。

相似文献

1
Performance of chatbots in queries concerning fundamental concepts in photochemistry.聊天机器人在光化学基本概念相关查询中的表现。
Photochem Photobiol. 2024 Nov 4. doi: 10.1111/php.14037.
2
Accuracy and Reliability of Artificial Intelligence Chatbots as Public Information Sources in Implant Dentistry.人工智能聊天机器人作为种植牙科公共信息来源的准确性和可靠性
Int J Oral Maxillofac Implants. 2025 Jun 25;0(0):1-23. doi: 10.11607/jomi.11280.
3
Evaluating the Performance of State-of-the-Art Artificial Intelligence Chatbots Based on the WHO Global Guidelines for the Prevention of Surgical Site Infection: Cross-Sectional Study.基于世界卫生组织预防手术部位感染全球指南评估最先进的人工智能聊天机器人的性能:横断面研究
J Med Internet Res. 2025 Jul 31;27:e75567. doi: 10.2196/75567.
4
Comparative Performance of Chatbots in Endodontic Clinical Decision Support: A 4-Day Accuracy and Consistency Study.聊天机器人在牙髓病临床决策支持中的比较性能:一项为期4天的准确性和一致性研究。
Int Dent J. 2025 Jul 27;75(5):100920. doi: 10.1016/j.identj.2025.100920.
5
Accuracy of ChatGPT-3.5, ChatGPT-4o, Copilot, Gemini, Claude, and Perplexity in advising on lumbosacral radicular pain against clinical practice guidelines: cross-sectional study.ChatGPT-3.5、ChatGPT-4o、Copilot、Gemini、Claude和Perplexity在依据临床实践指南对腰骶神经根性疼痛提供建议方面的准确性:横断面研究
Front Digit Health. 2025 Jun 27;7:1574287. doi: 10.3389/fdgth.2025.1574287. eCollection 2025.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
Evaluating the Potential of AI Chatbots in Treatment Decision-making for Acquired Bilateral Vocal Fold Paralysis in Adults.评估人工智能聊天机器人在成人获得性双侧声带麻痹治疗决策中的潜力。
J Voice. 2024 Apr 6. doi: 10.1016/j.jvoice.2024.02.020.
8
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.用于计算局部麻醉药最大安全剂量的3种对话式生成人工智能模型的性能:比较分析
JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.
9
Comparative performance of ChatGPT, Gemini, and final-year emergency medicine clerkship students in answering multiple-choice questions: implications for the use of AI in medical education.ChatGPT、Gemini与急诊医学实习最后一年学生在回答多项选择题方面的表现比较:人工智能在医学教育中的应用启示
Int J Emerg Med. 2025 Aug 7;18(1):146. doi: 10.1186/s12245-025-00949-6.
10
Evaluating the readability, quality, and reliability of responses generated by ChatGPT, Gemini, and Perplexity on the most commonly asked questions about Ankylosing spondylitis.评估ChatGPT、Gemini和Perplexity针对强直性脊柱炎最常见问题生成的回答的可读性、质量和可靠性。
PLoS One. 2025 Jun 18;20(6):e0326351. doi: 10.1371/journal.pone.0326351. eCollection 2025.

引用本文的文献

1
Assessing the Accuracy of ChatGPT in Answering Questions About Prolonged Disorders of Consciousness.评估ChatGPT回答关于长期意识障碍问题的准确性。
Brain Sci. 2025 Apr 13;15(4):392. doi: 10.3390/brainsci15040392.

本文引用的文献

1
Performance of ChatGPT Across Different Versions in Medical Licensing Examinations Worldwide: Systematic Review and Meta-Analysis.ChatGPT 在全球医学执照考试不同版本中的表现:系统评价和荟萃分析。
J Med Internet Res. 2024 Jul 25;26:e60807. doi: 10.2196/60807.
2
Evaluation of ChatGPT's responses to information needs and information seeking of dementia patients.评估 ChatGPT 对痴呆症患者信息需求和信息检索的响应。
Sci Rep. 2024 May 4;14(1):10273. doi: 10.1038/s41598-024-61068-5.
3
Assessing ChatGPT 4.0's test performance and clinical diagnostic accuracy on USMLE STEP 2 CK and clinical case reports.
评估ChatGPT 4.0在美国医师执照考试第二步临床知识考试(USMLE STEP 2 CK)及临床病例报告中的测试表现和临床诊断准确性。
Sci Rep. 2024 Apr 23;14(1):9330. doi: 10.1038/s41598-024-58760-x.
4
Response accuracy of ChatGPT 3.5 Copilot and Gemini in interpreting biochemical laboratory data a pilot study.ChatGPT 3.5 Copilot 和 Gemini 解读生化实验室数据的反应准确性:一项初步研究。
Sci Rep. 2024 Apr 8;14(1):8233. doi: 10.1038/s41598-024-58964-1.
5
ChatGPT applications in medical, dental, pharmacy, and public health education: A descriptive study highlighting the advantages and limitations.ChatGPT在医学、牙科、药学和公共卫生教育中的应用:一项突出优势与局限的描述性研究。
Narra J. 2023 Apr;3(1):e103. doi: 10.52225/narra.v3i1.103. Epub 2023 Mar 29.
6
Empirical assessment of ChatGPT's answering capabilities in natural science and engineering.ChatGPT在自然科学与工程领域回答能力的实证评估。
Sci Rep. 2024 Feb 29;14(1):4998. doi: 10.1038/s41598-024-54936-7.
7
Evaluating AI in medicine: a comparative analysis of expert and ChatGPT responses to colorectal cancer questions.评估医学中的人工智能:专家与 ChatGPT 对结直肠癌问题回答的比较分析。
Sci Rep. 2024 Feb 3;14(1):2840. doi: 10.1038/s41598-024-52853-3.
8
ChatGPT in the Material Design: Selected Case Studies to Assess the Potential of ChatGPT.材料设计中的ChatGPT:评估ChatGPT潜力的精选案例研究
J Chem Inf Model. 2024 Feb 12;64(3):799-811. doi: 10.1021/acs.jcim.3c01702. Epub 2024 Jan 18.
9
ChatGPT in Drug Discovery: A Case Study on Anticocaine Addiction Drug Development with Chatbots.ChatGPT 在药物研发中的应用:基于聊天机器人的抗可卡因成瘾药物开发案例研究。
J Chem Inf Model. 2023 Nov 27;63(22):7189-7209. doi: 10.1021/acs.jcim.3c01429. Epub 2023 Nov 13.
10
The Potential of GPT-4 as a Support Tool for Pharmacists: Analytical Study Using the Japanese National Examination for Pharmacists.GPT-4作为药剂师辅助工具的潜力:使用日本药剂师国家考试的分析研究
JMIR Med Educ. 2023 Oct 30;9:e48452. doi: 10.2196/48452.