• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大型语言模型在解决基层医疗保健问题中的应用:对远程医疗和医学教育的潜在影响。

A large language model in solving primary healthcare issues: A potential implication for remote healthcare and medical education.

作者信息

Mondal Himel, De Rajesh, Mondal Shaikat, Juhi Ayesha

机构信息

Department of Physiology, All India Institute of Medical Sciences, Deoghar, Jharkhand, India.

Department of Community Medicine, Malda Medical College and Hospital, Malda, West Bengal, India.

出版信息

J Educ Health Promot. 2024 Sep 28;13:362. doi: 10.4103/jehp.jehp_688_23. eCollection 2024.

DOI:10.4103/jehp.jehp_688_23
PMID:39679030
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11639534/
Abstract

BACKGROUND AND AIM

Access to quality health care is essential, particularly in remote areas where the availability of healthcare professionals may be limited. The advancement of artificial intelligence (AI) and natural language processing (NLP) has led to the development of large language models (LLMs) that exhibit capabilities in understanding and generating human-like text. This study aimed to evaluate the performance of a LLM, ChatGPT, in addressing primary healthcare issues.

MATERIALS AND METHODS

This study was conducted in May 2023 with ChatGPT May 12 version. A total of 30 multiple-choice questions (MCQs) related to primary health care were selected to test the proficiency of ChatGPT. These MCQs covered various topics commonly encountered in primary healthcare practice. ChatGPT answered the questions in two segments-one is choosing the single best answer of MCQ and another is supporting text for the answer. The answers to MCQs were compared with the predefined answer keys. The justifications of the answers were checked by two primary healthcare professionals on a 5-point Likert-type scale. The data were presented as number and percentage.

RESULTS

Among the 30 questions, ChatGPT provided correct responses for 28 yielding an accuracy of 93.33%. The mean score for explanation in supporting the answer was 4.58 ± 0.85. There was an inter-item correlation of 0.896, and the average measure intraclass correlation coefficient (ICC) was 0.94 (95% confidence interval 0.88-0.97) indicating a high level of interobserver agreement.

CONCLUSION

LLMs, such as ChatGPT, show promising potential in addressing primary healthcare issues. The high accuracy rate achieved by ChatGPT in answering primary healthcare-related MCQs underscores the value of these models as resources for patients and healthcare providers in remote healthcare settings. This can also help in self-directed learning by medical students.

摘要

背景与目的

获得优质医疗保健至关重要,尤其是在医疗专业人员供应可能有限的偏远地区。人工智能(AI)和自然语言处理(NLP)的发展催生了大型语言模型(LLMs),这些模型在理解和生成类人文本方面展现出能力。本研究旨在评估大型语言模型ChatGPT在解决初级医疗保健问题方面的表现。

材料与方法

本研究于2023年5月使用ChatGPT 5月12日版本进行。共选择了30道与初级医疗保健相关的多项选择题(MCQs)来测试ChatGPT的能力。这些多项选择题涵盖了初级医疗保健实践中常见的各种主题。ChatGPT分两部分回答问题——一是选择多项选择题的最佳单一答案,二是为答案提供支持文本。将多项选择题的答案与预先定义的答案键进行比较。由两名初级医疗保健专业人员以5点李克特量表对答案的理由进行检查。数据以数量和百分比形式呈现。

结果

在30道问题中,ChatGPT给出了28道正确答案,准确率为93.33%。答案支持解释的平均得分为4.58±0.85。项间相关性为0.896,平均组内相关系数(ICC)为0.94(95%置信区间0.88 - 0.97),表明观察者间一致性水平较高。

结论

ChatGPT等大型语言模型在解决初级医疗保健问题方面显示出有前景的潜力。ChatGPT在回答与初级医疗保健相关的多项选择题时所达到的高准确率凸显了这些模型作为偏远医疗环境中患者和医疗服务提供者资源的价值。这也有助于医学生进行自主学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/62fd0f84d020/JEHP-13-362-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/70adc4349715/JEHP-13-362-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/92b33e1072ca/JEHP-13-362-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/62fd0f84d020/JEHP-13-362-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/70adc4349715/JEHP-13-362-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/92b33e1072ca/JEHP-13-362-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9051/11639534/62fd0f84d020/JEHP-13-362-g003.jpg

相似文献

1
A large language model in solving primary healthcare issues: A potential implication for remote healthcare and medical education.大型语言模型在解决基层医疗保健问题中的应用:对远程医疗和医学教育的潜在影响。
J Educ Health Promot. 2024 Sep 28;13:362. doi: 10.4103/jehp.jehp_688_23. eCollection 2024.
2
A Comparative Analysis of the Performance of Large Language Models and Human Respondents in Dermatology.大语言模型与人类受试者在皮肤病学方面表现的比较分析
Indian Dermatol Online J. 2025 Feb 27;16(2):241-247. doi: 10.4103/idoj.idoj_221_24. eCollection 2025 Mar-Apr.
3
Performance of ChatGPT on Nursing Licensure Examinations in the United States and China: Cross-Sectional Study.ChatGPT 在中美护理执照考试中的表现:横断面研究。
JMIR Med Educ. 2024 Oct 3;10:e52746. doi: 10.2196/52746.
4
Embracing Large Language Models for Adult Life Support Learning.拥抱用于成人生命支持学习的大语言模型。
Cureus. 2024 Dec 18;16(12):e75961. doi: 10.7759/cureus.75961. eCollection 2024 Dec.
5
Performance of Large Language Models (ChatGPT, Bing Search, and Google Bard) in Solving Case Vignettes in Physiology.大语言模型(ChatGPT、必应搜索和谷歌巴德)在解决生理学病例 vignettes 中的表现。
Cureus. 2023 Aug 4;15(8):e42972. doi: 10.7759/cureus.42972. eCollection 2023 Aug.
6
Evaluating ChatGPT-3.5 and Claude-2 in Answering and Explaining Conceptual Medical Physiology Multiple-Choice Questions.评估ChatGPT-3.5和Claude-2在回答和解释概念性医学生理学选择题方面的表现。
Cureus. 2023 Sep 29;15(9):e46222. doi: 10.7759/cureus.46222. eCollection 2023 Sep.
7
Using large language models (ChatGPT, Copilot, PaLM, Bard, and Gemini) in Gross Anatomy course: Comparative analysis.在大体解剖学课程中使用大语言模型(ChatGPT、Copilot、PaLM、Bard和Gemini):比较分析
Clin Anat. 2025 Mar;38(2):200-210. doi: 10.1002/ca.24244. Epub 2024 Nov 21.
8
AI in radiography education: Evaluating multiple-choice questions difficulty and discrimination.放射学教育中的人工智能:评估多项选择题的难度和区分度。
J Med Imaging Radiat Sci. 2025 Mar 28;56(4):101896. doi: 10.1016/j.jmir.2025.101896.
9
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
10
Integrating ChatGPT in Orthopedic Education for Medical Undergraduates: Randomized Controlled Trial.将 ChatGPT 融入骨科医学本科生教育:随机对照试验。
J Med Internet Res. 2024 Aug 20;26:e57037. doi: 10.2196/57037.

引用本文的文献

1
Evaluation of Three Large Language Models' Response Performances to Inquiries Regarding Post-Abortion Care in the Context of Chinese Language: A Comparative Analysis.中文语境下三种大语言模型对堕胎后护理相关询问的回应表现评估:一项对比分析
Risk Manag Healthc Policy. 2025 Aug 18;18:2731-2741. doi: 10.2147/RMHP.S531777. eCollection 2025.
2
Empowering standardized residency training in China through large language models: problem analysis and solutions.通过大语言模型推动中国住院医师规范化培训:问题分析与解决方案
Ann Med. 2025 Dec;57(1):2516695. doi: 10.1080/07853890.2025.2516695. Epub 2025 Jul 15.
3
Unveiling the Potential of Large Language Models in Transforming Chronic Disease Management: Mixed Methods Systematic Review.

本文引用的文献

1
FUTURE OF THE LANGUAGE MODELS IN HEALTHCARE: THE ROLE OF CHATGPT.语言模型在医疗保健领域的未来:ChatGPT 的作用。
Arq Bras Cir Dig. 2023 May 8;36:e1727. doi: 10.1590/0102-672020230002e1727. eCollection 2023.
2
Impact of ChatGPT on medical chatbots as a disruptive technology.ChatGPT作为一种颠覆性技术对医疗聊天机器人的影响。
Front Artif Intell. 2023 Apr 5;6:1166014. doi: 10.3389/frai.2023.1166014. eCollection 2023.
3
The Capability of ChatGPT in Predicting and Explaining Common Drug-Drug Interactions.ChatGPT在预测和解释常见药物相互作用方面的能力。
揭示大语言模型在转变慢性病管理中的潜力:混合方法系统评价
J Med Internet Res. 2025 Apr 16;27:e70535. doi: 10.2196/70535.
Cureus. 2023 Mar 17;15(3):e36272. doi: 10.7759/cureus.36272. eCollection 2023 Mar.
4
Assessing the Capability of ChatGPT in Answering First- and Second-Order Knowledge Questions on Microbiology as per Competency-Based Medical Education Curriculum.根据基于能力的医学教育课程评估ChatGPT回答微生物学一阶和二阶知识问题的能力。
Cureus. 2023 Mar 12;15(3):e36034. doi: 10.7759/cureus.36034. eCollection 2023 Mar.
5
ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用:对其前景与合理担忧的系统评价
Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.
6
Applicability of ChatGPT in Assisting to Solve Higher Order Problems in Pathology.ChatGPT在协助解决病理学高阶问题中的适用性。
Cureus. 2023 Feb 20;15(2):e35237. doi: 10.7759/cureus.35237. eCollection 2023 Feb.
7
ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。
Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.
8
Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma.评估 ChatGPT 在回答肝硬化和肝细胞癌相关问题方面的表现。
Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.
9
The rise of ChatGPT: Exploring its potential in medical education.ChatGPT 的兴起:探索其在医学教育中的潜力。
Anat Sci Educ. 2024 Jul-Aug;17(5):926-931. doi: 10.1002/ase.2270. Epub 2023 Mar 28.
10
The promise of large language models in health care.大型语言模型在医疗保健领域的前景。
Lancet. 2023 Feb 25;401(10377):641. doi: 10.1016/S0140-6736(23)00216-7.