• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过ChatGPT增强围产期健康患者信息——一项准确性研究。

Enhancing perinatal health patient information through ChatGPT - An accuracy study.

作者信息

de Vries P L M, Baud D, Baggio S, Ceulemans M, Favre G, Gerbier E, Legardeur H, Maisonneuve E, Pena-Reyes C, Pomar L, Winterfeld U, Panchaud A

机构信息

Department of Gynecology and Obstetrics, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland.

Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland.

出版信息

PEC Innov. 2025 Feb 10;6:100381. doi: 10.1016/j.pecinn.2025.100381. eCollection 2025 Jun.

DOI:10.1016/j.pecinn.2025.100381
PMID:40028463
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11872132/
Abstract

OBJECTIVES

To evaluate ChatGPT's accuracy as information source for women and maternity-care workers on "nutrition" and "red flags" in pregnancy.

METHODS

Accuracy of ChatGPT-generated recommendations was assessed by a 5-point Likert scale by eight raters for ten indicators per topic in four languages (French, English, German and Dutch). Accuracy and interrater agreement were calculated per topic and language.

RESULTS

For both topics, median accuracy scores of ChatGPT-generated recommendations were excellent (5.0; IQR 4-5) independently of language. Median accuracy scores varied with a maximum of 1 on a 5-point Likert-scare according to question's framing. Overall accuracy scores were 83-89 % for 'nutrition in pregnancy' versus 96-98 % for 'red flags in pregnancy'. Inter-rater agreement was good to excellent for both topics.

CONCLUSION

Although ChatGPT generated accurate recommendations regarding the tested indicators for nutrition and red flags during pregnancy, women should be aware of ChatGPT's limitations such as inconsistencies according to formulation, language and the woman's personal context.

INNOVATION

Despite a growing interest in the potential use of artificial intelligence in healthcare, this is, to the best of our knowledge, the first study assessing potential limitations that may impact accuracy of ChatGPT-generated recommendations such as language and question-framing in key domains of perinatal health.

摘要

目的

评估ChatGPT作为女性及孕产护理人员获取孕期“营养”和“危险信号”信息来源的准确性。

方法

由八位评分者使用5级李克特量表,对ChatGPT生成的建议在四种语言(法语、英语、德语和荷兰语)下每个主题的十个指标进行准确性评估。计算每个主题和语言的准确性及评分者间一致性。

结果

对于两个主题,ChatGPT生成建议的中位数准确性得分均为优秀(5.0;四分位距4 - 5),与语言无关。根据问题的框架,中位数准确性得分在5级李克特量表上最多相差1分。“孕期营养”的总体准确性得分是83 - 89%,而“孕期危险信号”为96 - 98%。两个主题的评分者间一致性均为良好到优秀。

结论

尽管ChatGPT针对孕期营养和危险信号的测试指标生成了准确的建议,但女性应意识到ChatGPT的局限性,如因表述、语言和女性个人情况而产生的不一致性。

创新点

尽管人们对人工智能在医疗保健中的潜在应用兴趣日益浓厚,但据我们所知,这是第一项评估可能影响ChatGPT生成建议准确性的潜在局限性的研究,这些局限性包括围产期健康关键领域中的语言和问题框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/acd8c068c642/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/bea46f9b6eed/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/3b41b9864bb0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/acd8c068c642/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/bea46f9b6eed/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/3b41b9864bb0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2be8/11872132/acd8c068c642/gr3.jpg

相似文献

1
Enhancing perinatal health patient information through ChatGPT - An accuracy study.通过ChatGPT增强围产期健康患者信息——一项准确性研究。
PEC Innov. 2025 Feb 10;6:100381. doi: 10.1016/j.pecinn.2025.100381. eCollection 2025 Jun.
2
Artificial intelligence large language model ChatGPT: is it a trustworthy and reliable source of information for sarcoma patients?人工智能大语言模型 ChatGPT:它是肉瘤患者值得信赖和可靠的信息来源吗?
Front Public Health. 2024 Mar 22;12:1303319. doi: 10.3389/fpubh.2024.1303319. eCollection 2024.
3
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响:来自台湾护理执照考试的见解。
Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.
4
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
5
Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.评估生成式对话人工智能在破除睡眠健康误区方面的准确性:采用专家分析的混合方法比较研究
JMIR Form Res. 2024 Apr 16;8:e55762. doi: 10.2196/55762.
6
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
7
Evaluating the Potential of Large Language Models for Vestibular Rehabilitation Education: A Comparison of ChatGPT, Google Gemini, and Clinicians.评估大语言模型用于前庭康复教育的潜力:ChatGPT、谷歌Gemini与临床医生的比较
Phys Ther. 2025 Apr 2;105(4). doi: 10.1093/ptj/pzaf010.
8
ChatGPT's Performance in Cardiac Arrest and Bradycardia Simulations Using the American Heart Association's Advanced Cardiovascular Life Support Guidelines: Exploratory Study.ChatGPT 在使用美国心脏协会高级心血管生命支持指南进行心脏骤停和心动过缓模拟中的表现:探索性研究。
J Med Internet Res. 2024 Apr 22;26:e55037. doi: 10.2196/55037.
9
Accuracy of Spanish and English-generated ChatGPT responses to commonly asked patient questions about labor epidurals: a survey-based study among bilingual obstetric anesthesia experts.
Int J Obstet Anesth. 2025 Feb;61:104290. doi: 10.1016/j.ijoa.2024.104290. Epub 2024 Nov 6.
10
Application of Large Language Models in Medical Training Evaluation-Using ChatGPT as a Standardized Patient: Multimetric Assessment.大语言模型在医学培训评估中的应用——以ChatGPT作为标准化病人:多指标评估
J Med Internet Res. 2025 Jan 1;27:e59435. doi: 10.2196/59435.

引用本文的文献

1
Benchmarking AI Chatbots for Maternal Lactation Support: A Cross-Platform Evaluation of Quality, Readability, and Clinical Accuracy.用于产妇泌乳支持的人工智能聊天机器人基准测试:质量、可读性和临床准确性的跨平台评估
Healthcare (Basel). 2025 Jul 20;13(14):1756. doi: 10.3390/healthcare13141756.
2
Large language models and women's health: a digital companion for informed decision-making.大语言模型与女性健康:助力明智决策的数字伴侣。
Arch Gynecol Obstet. 2025 Jun 21. doi: 10.1007/s00404-025-08065-9.

本文引用的文献

1
A comparative study of English and Japanese ChatGPT responses to anaesthesia-related medical questions.关于英语和日语版ChatGPT对麻醉相关医学问题回答的比较研究。
BJA Open. 2024 Jun 14;10:100296. doi: 10.1016/j.bjao.2024.100296. eCollection 2024 Jun.
2
Performance of ChatGPT Compared to Clinical Practice Guidelines in Making Informed Decisions for Lumbosacral Radicular Pain: A Cross-sectional Study.ChatGPT 在腰椎神经根性疼痛知情决策方面的表现与临床实践指南的比较:一项横断面研究。
J Orthop Sports Phys Ther. 2024 Mar;54(3):222-228. doi: 10.2519/jospt.2024.12151.
3
How does ChatGPT-4 preform on non-English national medical licensing examination? An evaluation in Chinese language.
ChatGPT-4在非英语国家医学执照考试中的表现如何?中文语言环境下的一项评估。
PLOS Digit Health. 2023 Dec 1;2(12):e0000397. doi: 10.1371/journal.pdig.0000397. eCollection 2023 Dec.
4
Appropriateness and Comprehensiveness of Using ChatGPT for Perioperative Patient Education in Thoracic Surgery in Different Language Contexts: Survey Study.不同语言背景下使用ChatGPT进行胸外科围手术期患者教育的适用性和全面性:调查研究
Interact J Med Res. 2023 Aug 14;12:e46900. doi: 10.2196/46900.
5
A Critical Review of ChatGPT as a Potential Substitute for Diabetes Educators.对ChatGPT作为糖尿病教育者潜在替代品的批判性综述。
Cureus. 2023 May 1;15(5):e38380. doi: 10.7759/cureus.38380. eCollection 2023 May.
6
ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.医学领域的ChatGPT:其应用、优势、局限性、未来前景及伦理考量概述
Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.
7
Pregnant woman awareness of obstetric danger signs in developing country: systematic review.发展中国家孕妇对产科危险信号的认知:系统评价。
BMC Pregnancy Childbirth. 2023 May 16;23(1):357. doi: 10.1186/s12884-023-05674-7.
8
Association between language barrier and inadequate prenatal care utilization among migrant women in the PreCARE prospective cohort study.语言障碍与 PreCARE 前瞻性队列研究中移民妇女产前保健利用不足的关联。
Eur J Public Health. 2023 Jun 1;33(3):403-410. doi: 10.1093/eurpub/ckad078.
9
ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用:对其前景与合理担忧的系统评价
Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.
10
ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。
Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.