• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对偏头痛患者教育的最先进大型语言模型基准测试:对常见查询的响应性能比较。

Benchmarking State-of-the-Art Large Language Models for Migraine Patient Education: Performance Comparison of Responses to Common Queries.

机构信息

Department of Neurology, The Affiliated Hospital of Qingdao University, Qingdao, China.

Department of Internal Medicine, The Affiliated Hospital of Qingdao University, Qingdao, China.

出版信息

J Med Internet Res. 2024 Jul 23;26:e55927. doi: 10.2196/55927.

DOI:10.2196/55927
PMID:38828692
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11303883/
Abstract

This study assessed the potential of large language models (OpenAI's ChatGPT 3.5 and 4.0, Google Bard, Meta Llama2, and Anthropic Claude2) in addressing 30 common migraine-related queries, providing a foundation to advance artificial intelligence-assisted patient education and insights for a holistic approach to migraine management.

摘要

本研究评估了大型语言模型(OpenAI 的 ChatGPT 3.5 和 4.0、Google Bard、Meta Llama2 和 Anthropic Claude2)在回答 30 个常见偏头痛相关问题方面的潜力,为推进人工智能辅助患者教育以及整体偏头痛管理方法提供了基础和见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7313/11303883/47d2df65c858/jmir_v26i1e55927_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7313/11303883/47d2df65c858/jmir_v26i1e55927_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7313/11303883/47d2df65c858/jmir_v26i1e55927_fig1.jpg

相似文献

1
Benchmarking State-of-the-Art Large Language Models for Migraine Patient Education: Performance Comparison of Responses to Common Queries.针对偏头痛患者教育的最先进大型语言模型基准测试:对常见查询的响应性能比较。
J Med Internet Res. 2024 Jul 23;26:e55927. doi: 10.2196/55927.
2
Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard.评估印度全国医预考用大型语言模型:GPT-3.5、GPT-4 和 Bard 的比较分析。
JMIR Med Educ. 2024 Feb 21;10:e51523. doi: 10.2196/51523.
3
Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.比较分析 ChatGPT-3.5、ChatGPT-4.0 和谷歌巴德在近视防控方面的表现:大型语言模型的基准测试。
EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.
4
Capacity of Generative AI to Interpret Human Emotions From Visual and Textual Data: Pilot Evaluation Study.生成式人工智能从视觉和文本数据中解读人类情感的能力:初步评估研究。
JMIR Ment Health. 2024 Feb 6;11:e54369. doi: 10.2196/54369.
5
Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。
Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.
6
Evaluating capabilities of large language models: Performance of GPT-4 on surgical knowledge assessments.评估大语言模型的能力:GPT-4在外科知识评估中的表现。
Surgery. 2024 Apr;175(4):936-942. doi: 10.1016/j.surg.2023.12.014. Epub 2024 Jan 20.
7
Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions.人工智能在减重手术中的表现:ChatGPT-4、Bing 和 Bard 在《美国代谢与减重外科学会减重手术教科书》减重手术问题中的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):609-613. doi: 10.1016/j.soard.2024.04.014. Epub 2024 May 8.
8
The performance of artificial intelligence models in generating responses to general orthodontic questions: ChatGPT vs Google Bard.人工智能模型在生成正畸常见问题回答方面的表现:ChatGPT与谷歌巴德的对比
Am J Orthod Dentofacial Orthop. 2024 Jun;165(6):652-662. doi: 10.1016/j.ajodo.2024.01.012. Epub 2024 Mar 15.
9
Language discrepancies in the performance of generative artificial intelligence models: an examination of infectious disease queries in English and Arabic.生成式人工智能模型在性能方面的语言差异:对英文和阿拉伯文传染病查询的考察。
BMC Infect Dis. 2024 Aug 8;24(1):799. doi: 10.1186/s12879-024-09725-y.
10
Utilizing Artificial Intelligence-Based Tools for Addressing Clinical Queries: ChatGPT Versus Google Gemini.利用基于人工智能的工具解决临床问题:ChatGPT 与 Google Gemini 之比较。
J Nurs Educ. 2024 Aug;63(8):556-559. doi: 10.3928/01484834-20240426-01. Epub 2024 Aug 1.

引用本文的文献

1
Development and Validation of a Large Language Model-Based System for Medical History-Taking Training: Prospective Multicase Study on Evaluation Stability, Human-AI Consistency, and Transparency.基于大语言模型的病史采集训练系统的开发与验证:关于评估稳定性、人机一致性和透明度的前瞻性多案例研究
JMIR Med Educ. 2025 Aug 29;11:e73419. doi: 10.2196/73419.
2
The Emerging Clinical Relevance of Artificial Intelligence, Data Science, and Wearable Devices in Headache: A Narrative Review.人工智能、数据科学和可穿戴设备在头痛领域中新兴的临床相关性:一篇叙述性综述。
Life (Basel). 2025 Jun 4;15(6):909. doi: 10.3390/life15060909.
3
AI in Home Care-Evaluation of Large Language Models for Future Training of Informal Caregivers: Observational Comparative Case Study.

本文引用的文献

1
Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.比较分析 ChatGPT-3.5、ChatGPT-4.0 和谷歌巴德在近视防控方面的表现:大型语言模型的基准测试。
EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.
2
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
3
Diagnostic accuracy of an artificial intelligence online engine in migraine: A multi-center study.
家庭护理中的人工智能——对用于未来非正式护理人员培训的大语言模型的评估:观察性比较案例研究
J Med Internet Res. 2025 Apr 28;27:e70703. doi: 10.2196/70703.
4
Large language models in patient education: a scoping review of applications in medicine.用于患者教育的大语言模型:医学应用的范围综述
Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024.
5
The Role of Artificial Intelligence in the Primary Prevention of Common Musculoskeletal Diseases.人工智能在常见肌肉骨骼疾病一级预防中的作用
Cureus. 2024 Jul 25;16(7):e65372. doi: 10.7759/cureus.65372. eCollection 2024 Jul.
人工智能在线引擎在偏头痛诊断中的准确性:一项多中心研究。
Headache. 2022 Jul;62(7):870-882. doi: 10.1111/head.14324. Epub 2022 Jun 3.
4
Diagnosis and Management of Headache: A Review.头痛的诊断与管理:综述
JAMA. 2021 May 11;325(18):1874-1885. doi: 10.1001/jama.2021.1640.
5
Migraine: epidemiology and systems of care.偏头痛:流行病学和照护体系。
Lancet. 2021 Apr 17;397(10283):1485-1495. doi: 10.1016/S0140-6736(20)32160-7. Epub 2021 Mar 25.
6
Migraine: integrated approaches to clinical management and emerging treatments.偏头痛:临床管理的综合方法和新兴治疗方法。
Lancet. 2021 Apr 17;397(10283):1505-1518. doi: 10.1016/S0140-6736(20)32342-4. Epub 2021 Mar 25.
7
Burden of Neurological Disorders Across the US From 1990-2017: A Global Burden of Disease Study.《1990-2017 年美国神经障碍疾病负担:全球疾病负担研究》
JAMA Neurol. 2021 Feb 1;78(2):165-176. doi: 10.1001/jamaneurol.2020.4152.
8
Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019.204 个国家和地区 1990-2019 年 369 种疾病和伤害导致的全球负担:2019 年全球疾病负担研究的系统分析。
Lancet. 2020 Oct 17;396(10258):1204-1222. doi: 10.1016/S0140-6736(20)30925-9.
9
Headache Classification Committee of the International Headache Society (IHS) The International Classification of Headache Disorders, 3rd edition.国际头痛协会(IHS)头痛分类委员会《国际头痛疾病分类》第三版
Cephalalgia. 2018 Jan;38(1):1-211. doi: 10.1177/0333102417738202.