• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估 ChatGPT 对常见产科问题回答的有效性:潜在的临床应用及意义。

Evaluating the validity of ChatGPT responses on common obstetric issues: Potential clinical applications and implications.

机构信息

Department of Obstetrics and Gynecology, Shaare Zedek Medical Center, Affiliated with the Hebrew University School of Medicine, Jerusalem, Israel.

Division of Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, Hamilton Health Sciences, McMaster University, Hamilton, Ontario, Canada.

出版信息

Int J Gynaecol Obstet. 2024 Sep;166(3):1127-1133. doi: 10.1002/ijgo.15501. Epub 2024 Mar 25.

DOI:10.1002/ijgo.15501
PMID:38523565
Abstract

OBJECTIVE

To evaluate the quality of ChatGPT responses to common issues in obstetrics and assess its ability to provide reliable responses to pregnant individuals. The study aimed to examine the responses based on expert opinions using predetermined criteria, including "accuracy," "completeness," and "safety."

METHODS

We curated 15 common and potentially clinically significant questions that pregnant women are asking. Two native English-speaking women were asked to reframe the questions in their own words, and we employed the ChatGPT language model to generate responses to the questions. To evaluate the accuracy, completeness, and safety of the ChatGPT's generated responses, we developed a questionnaire with a scale of 1 to 5 that obstetrics and gynecology experts from different countries were invited to rate accordingly. The ratings were analyzed to evaluate the average level of agreement and percentage of positive ratings (≥4) for each criterion.

RESULTS

Of the 42 experts invited, 20 responded to the questionnaire. The combined score for all responses yielded a mean rating of 4, with 75% of responses receiving a positive rating (≥4). While examining specific criteria, the ChatGPT responses were better for the accuracy criterion, with a mean rating of 4.2 and 80% of the questions received a positive rating. The responses scored less for the completeness criterion, with a mean rating of 3.8 and 46.7% of questions received a positive rating. For safety, the mean rating was 3.9 and 53.3% of questions received a positive rating. There was no response with an average negative rating below three.

CONCLUSION

This study demonstrates promising results regarding potential use of ChatGPT's in providing accurate responses to obstetric clinical questions posed by pregnant women. However, it is crucial to exercise caution when addressing inquiries concerning the safety of the fetus or the mother.

摘要

目的

评估 ChatGPT 对妇产科常见问题的回答质量,并评估其为孕妇提供可靠回答的能力。本研究旨在使用预定标准(包括“准确性”、“完整性”和“安全性”),基于专家意见来检查回答。

方法

我们整理了 15 个常见且具有潜在临床意义的问题,这些问题是孕妇正在询问的。两名以英语为母语的女性被要求用自己的话重新表述这些问题,我们使用 ChatGPT 语言模型来回答这些问题。为了评估 ChatGPT 生成的回答的准确性、完整性和安全性,我们开发了一个 1 到 5 分的问卷,邀请了来自不同国家的妇产科专家进行评分。分析评分以评估每个标准的平均一致性水平和阳性评分(≥4)的百分比。

结果

在邀请的 42 名专家中,有 20 名回应了问卷。所有回应的综合得分为 4 分,有 75%的回应获得了阳性评分(≥4)。在检查具体标准时,ChatGPT 的回答在准确性标准方面表现更好,平均评分为 4.2,80%的问题获得了阳性评分。在完整性标准方面,得分较低,平均评分为 3.8,46.7%的问题获得了阳性评分。在安全性方面,平均评分为 3.9,53.3%的问题获得了阳性评分。没有任何一个回答的平均负面评分低于 3 分。

结论

本研究表明,ChatGPT 有潜力用于为孕妇提出的妇产科临床问题提供准确的回答。然而,在处理涉及胎儿或母亲安全的询问时,谨慎行事至关重要。

相似文献

1
Evaluating the validity of ChatGPT responses on common obstetric issues: Potential clinical applications and implications.评估 ChatGPT 对常见产科问题回答的有效性:潜在的临床应用及意义。
Int J Gynaecol Obstet. 2024 Sep;166(3):1127-1133. doi: 10.1002/ijgo.15501. Epub 2024 Mar 25.
2
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
3
Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性:ChatGPT与谷歌巴德人工智能的比较分析
Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.
4
The future of patient education: A study on AI-driven responses to urinary incontinence inquiries.患者教育的未来:一项关于人工智能驱动的尿失禁咨询应答的研究。
Int J Gynaecol Obstet. 2024 Dec;167(3):1004-1009. doi: 10.1002/ijgo.15751. Epub 2024 Jun 30.
5
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
6
Evaluating ChatGPT to test its robustness as an interactive information database of radiation oncology and to assess its responses to common queries from radiotherapy patients: A single institution investigation.评估ChatGPT以测试其作为放射肿瘤学交互式信息数据库的稳健性,并评估其对放疗患者常见问题的回答:一项单机构调查。
Cancer Radiother. 2024 Jun;28(3):258-264. doi: 10.1016/j.canrad.2023.11.005. Epub 2024 Jun 12.
7
Evaluation of ChatGPT's responses to information needs and information seeking of dementia patients.评估 ChatGPT 对痴呆症患者信息需求和信息检索的响应。
Sci Rep. 2024 May 4;14(1):10273. doi: 10.1038/s41598-024-61068-5.
8
Assessing question characteristic influences on ChatGPT's performance and response-explanation consistency: Insights from Taiwan's Nursing Licensing Exam.评估问题特征对 ChatGPT 表现和回应解释一致性的影响:来自台湾护理执照考试的见解。
Int J Nurs Stud. 2024 May;153:104717. doi: 10.1016/j.ijnurstu.2024.104717. Epub 2024 Feb 8.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Assessing the Quality of ChatGPT Responses to Dementia Caregivers' Questions: Qualitative Analysis.评估 ChatGPT 对痴呆症护理人员问题回答的质量:定性分析。
JMIR Aging. 2024 May 6;7:e53019. doi: 10.2196/53019.

引用本文的文献

1
Can ChatGPT Provide Patient-Friendly and Reliable Information on Cervical Cancer Screening? A Study of ChatGPT-Generated Information in Polish.ChatGPT能否提供有关宫颈癌筛查的患者友好且可靠的信息?一项关于波兰语的ChatGPT生成信息的研究。
Med Sci Monit. 2025 Jul 3;31:e947992. doi: 10.12659/MSM.947992.
2
Assessing the Accuracy, Completeness and Safety of ChatGPT-4o Responses on Pressure Injuries in Infants: Clinical Applications and Future Implications.评估ChatGPT-4o对婴儿压力性损伤回答的准确性、完整性和安全性:临床应用及未来影响
Nurs Rep. 2025 Apr 14;15(4):130. doi: 10.3390/nursrep15040130.
3
AI-Driven Information for Relatives of Patients with Malignant Middle Cerebral Artery Infarction: A Preliminary Validation Study Using GPT-4o.
人工智能驱动的大脑中动脉恶性梗死患者亲属信息:使用GPT-4o的初步验证研究
Brain Sci. 2025 Apr 11;15(4):391. doi: 10.3390/brainsci15040391.