• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT评估在线医学信息质量和可读性的能力:一项横断面研究的证据。

ChatGPT's Ability to Assess Quality and Readability of Online Medical Information: Evidence From a Cross-Sectional Study.

作者信息

Golan Roei, Ripps Sarah J, Reddy Raghuram, Loloi Justin, Bernstein Ari P, Connelly Zachary M, Golan Noa S, Ramasamy Ranjith

机构信息

Department of Clinical Sciences, Florida State University College of Medicine, Tallahassee, USA.

Herbert Wertheim College of Medicine, Florida International University, Miami, USA.

出版信息

Cureus. 2023 Jul 20;15(7):e42214. doi: 10.7759/cureus.42214. eCollection 2023 Jul.

DOI:10.7759/cureus.42214
PMID:37484787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10362474/
Abstract

Introduction Artificial Intelligence (AI) platforms have gained widespread attention for their distinct ability to generate automated responses to various prompts. However, its role in assessing the quality and readability of a provided text remains unclear. Thus, the purpose of this study is to evaluate the proficiency of the conversational generative pre-trained transformer (ChatGPT) in utilizing the DISCERN tool to evaluate the quality of online content regarding shock wave therapy for erectile dysfunction. Methods Websites were generated using a Google search of "shock wave therapy for erectile dysfunction" with location filters disabled. Readability was analyzed using Readable software (Readable.com, Horsham, United Kingdom). Quality was assessed independently by three reviewers using the DISCERN tool. The same plain text files collected were inputted into ChatGPT to determine whether they produced comparable metrics for readability and quality. Results The study results revealed a notable disparity between ChatGPT's readability assessment and that obtained from a reliable tool, Readable.com (p<0.05). This indicates a lack of alignment between ChatGPT's algorithm and that of established tools, such as Readable.com. Similarly, the DISCERN score generated by ChatGPT differed significantly from the scores generated manually by human evaluators (p<0.05), suggesting that ChatGPT may not be capable of accurately identifying poor-quality information sources regarding shock wave therapy as a treatment for erectile dysfunction. Conclusion ChatGPT's evaluation of the quality and readability of online text regarding shockwave therapy for erectile dysfunction differs from that of human raters and trusted tools. Therefore, ChatGPT's current capabilities were not sufficient for reliably assessing the quality and readability of textual content. Further research is needed to elucidate the role of AI in the objective evaluation of online medical content in other fields. Continued development in AI and incorporation of tools such as DISCERN into AI software may enhance the way patients navigate the web in search of high-quality medical content in the future.

摘要

引言 人工智能(AI)平台因其能够对各种提示生成自动回复的独特能力而受到广泛关注。然而,其在评估所提供文本的质量和可读性方面的作用仍不明确。因此,本研究的目的是评估对话式生成预训练变换器(ChatGPT)在利用DISCERN工具评估关于勃起功能障碍冲击波治疗的在线内容质量方面的熟练程度。方法 通过谷歌搜索“勃起功能障碍冲击波治疗”生成网站,禁用位置过滤器。使用Readable软件(Readable.com,英国霍舍姆)分析可读性。由三名审阅者使用DISCERN工具独立评估质量。将收集到的相同纯文本文件输入ChatGPT,以确定它们是否能产生可比的可读性和质量指标。结果 研究结果显示,ChatGPT的可读性评估与从可靠工具Readable.com获得的评估结果存在显著差异(p<0.05)。这表明ChatGPT的算法与Readable.com等既定工具的算法不一致。同样,ChatGPT生成的DISCERN分数与人类评估者手动生成的分数有显著差异(p<0.05),这表明ChatGPT可能无法准确识别关于冲击波治疗作为勃起功能障碍治疗方法的低质量信息来源。结论 ChatGPT对关于勃起功能障碍冲击波治疗的在线文本的质量和可读性的评估与人类评分者和可靠工具的评估不同。因此,ChatGPT目前的能力不足以可靠地评估文本内容的质量和可读性。需要进一步研究以阐明人工智能在其他领域在线医学内容客观评估中的作用。人工智能的持续发展以及将DISCERN等工具纳入人工智能软件可能会在未来改善患者在网络上搜索高质量医学内容的方式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ab7/10362474/0b0a210d04dc/cureus-0015-00000042214-i01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ab7/10362474/0b0a210d04dc/cureus-0015-00000042214-i01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ab7/10362474/0b0a210d04dc/cureus-0015-00000042214-i01.jpg

相似文献

1
ChatGPT's Ability to Assess Quality and Readability of Online Medical Information: Evidence From a Cross-Sectional Study.ChatGPT评估在线医学信息质量和可读性的能力:一项横断面研究的证据。
Cureus. 2023 Jul 20;15(7):e42214. doi: 10.7759/cureus.42214. eCollection 2023 Jul.
2
Assessing the quality and readability of online content on shock wave therapy for erectile dysfunction.评估关于勃起功能障碍冲击波治疗的在线内容的质量和可读性。
Andrologia. 2022 Dec;54(11):e14607. doi: 10.1111/and.14607. Epub 2022 Oct 14.
3
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
4
How artificial intelligence can provide information about subdural hematoma: Assessment of readability, reliability, and quality of ChatGPT, BARD, and perplexity responses.人工智能如何提供关于硬膜下血肿的信息:对ChatGPT、BARD和Perplexity回答的可读性、可靠性和质量评估。
Medicine (Baltimore). 2024 May 3;103(18):e38009. doi: 10.1097/MD.0000000000038009.
5
Dr. Google to Dr. ChatGPT: assessing the content and quality of artificial intelligence-generated medical information on appendicitis.谷歌博士对 ChatGPT 博士:评估人工智能生成的关于阑尾炎的医学信息的内容和质量。
Surg Endosc. 2024 May;38(5):2887-2893. doi: 10.1007/s00464-024-10739-5. Epub 2024 Mar 5.
6
Can ChatGPT Aid Clinicians in Educating Patients on the Surgical Management of Glaucoma?ChatGPT 能否帮助临床医生向患者讲解青光眼的手术治疗?
J Glaucoma. 2024 Feb 1;33(2):94-100. doi: 10.1097/IJG.0000000000002338. Epub 2023 Nov 24.
7
Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。
Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.
8
Artificial intelligence insights into osteoporosis: assessing ChatGPT's information quality and readability.人工智能在骨质疏松症中的应用:评估 ChatGPT 的信息质量和可读性。
Arch Osteoporos. 2024 Mar 19;19(1):17. doi: 10.1007/s11657-024-01376-5.
9
A Multidisciplinary Assessment of ChatGPT's Knowledge of Amyloidosis: Observational Study.对ChatGPT关于淀粉样变性知识的多学科评估:观察性研究。
JMIR Cardio. 2024 Apr 19;8:e53421. doi: 10.2196/53421.
10
An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer.评估 ChatGPT 对宫颈癌和乳腺癌常见问题的回答。
BMC Womens Health. 2024 Sep 2;24(1):482. doi: 10.1186/s12905-024-03320-8.

引用本文的文献

1
Evaluating the Readability of Pediatric Neurocutaneous Syndromes-Related Patient Education Material Created by a Custom GPT With Retrieval Augmentation.评估由具有检索增强功能的定制GPT创建的儿科神经皮肤综合征相关患者教育材料的可读性。
JMIR Dermatol. 2025 Jul 16;8:e59054. doi: 10.2196/59054.
2
Potential role of large language models and personalized medicine to innovate cardiac rehabilitation.大语言模型和个性化医疗在创新心脏康复中的潜在作用。
World J Clin Cases. 2025 Jul 6;13(19):98095. doi: 10.12998/wjcc.v13.i19.98095.
3
How accurate are ChatGPT-4 responses in chronic urticaria? A critical analysis with information quality metrics.

本文引用的文献

1
Drug discovery companies are customizing ChatGPT: here's how.药物研发公司正在定制ChatGPT:方法如下。
Nat Biotechnol. 2023 May;41(5):585-586. doi: 10.1038/s41587-023-01788-7.
2
The Capability of ChatGPT in Predicting and Explaining Common Drug-Drug Interactions.ChatGPT在预测和解释常见药物相互作用方面的能力。
Cureus. 2023 Mar 17;15(3):e36272. doi: 10.7759/cureus.36272. eCollection 2023 Mar.
3
Peer Review: A Process Primed for Quality Improvement?同行评审:一种旨在提高质量的过程?
ChatGPT-4对慢性荨麻疹的回答有多准确?基于信息质量指标的批判性分析。
World Allergy Organ J. 2025 Jun 14;18(7):101071. doi: 10.1016/j.waojou.2025.101071. eCollection 2025 Jul.
4
The Availability and Readability of Multilingual Online Patient Materials for Legg-Calve-Perthes Disease.关于Legg-Calve-Perthes病的多语言在线患者资料的可获取性与可读性
J Pediatr Soc North Am. 2025 Mar 6;11:100169. doi: 10.1016/j.jposna.2025.100169. eCollection 2025 May.
5
A Cross-Sectional Study Comparing Patient Information Guides for Amyotrophic Lateral Sclerosis, Myasthenia Gravis, and Guillain-Barré Syndrome Produced by ChatGPT-4 and Google Gemini 1.5.一项比较ChatGPT-4和谷歌Gemini 1.5生成的肌萎缩侧索硬化症、重症肌无力和吉兰-巴雷综合征患者信息指南的横断面研究。
Cureus. 2025 Feb 25;17(2):e79646. doi: 10.7759/cureus.79646. eCollection 2025 Feb.
6
Evaluating the Quality and Readability of Generative Artificial Intelligence (AI) Chatbot Responses in the Management of Achilles Tendon Rupture.评估生成式人工智能(AI)聊天机器人在跟腱断裂管理中的回复质量和可读性。
Cureus. 2025 Jan 31;17(1):e78313. doi: 10.7759/cureus.78313. eCollection 2025 Jan.
7
Large Language Models May Help Patients Understand Peer-Reviewed Scientific Articles About Ophthalmology: Development and Usability Study.大语言模型可能有助于患者理解关于眼科的同行评审科学文章:开发与可用性研究。
J Med Internet Res. 2024 Dec 24;26:e59843. doi: 10.2196/59843.
8
Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.ChatGPT 及其他会话型大型语言模型在医疗保健中的应用及关注:系统评价。
J Med Internet Res. 2024 Nov 7;26:e22769. doi: 10.2196/22769.
9
Can ChatGPT provide quality information about fever in children?ChatGPT能提供有关儿童发烧的高质量信息吗?
J Paediatr Child Health. 2025 Jan;61(1):60-65. doi: 10.1111/jpc.16710. Epub 2024 Oct 29.
10
Assessing the readability, reliability, and quality of artificial intelligence chatbot responses to the 100 most searched queries about cardiopulmonary resuscitation: An observational study.评估人工智能聊天机器人对心肺复苏术 100 个最常见查询的回答的易读性、可靠性和质量:一项观察性研究。
Medicine (Baltimore). 2024 May 31;103(22):e38352. doi: 10.1097/MD.0000000000038352.
J Urol. 2023 Jun;209(6):1069-1070. doi: 10.1097/JU.0000000000003460. Epub 2023 Apr 12.
4
ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用:对其前景与合理担忧的系统评价
Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.
5
ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。
Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.
6
Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information.利用 ChatGPT 评估癌症谣言和误解:人工智能与癌症信息。
JNCI Cancer Spectr. 2023 Mar 1;7(2). doi: 10.1093/jncics/pkad015.
7
Role of Chat GPT in Public Health.Chat GPT 在公共卫生中的作用。
Ann Biomed Eng. 2023 May;51(5):868-869. doi: 10.1007/s10439-023-03172-7. Epub 2023 Mar 15.
8
Artificial intelligence in academic writing: a paradigm-shifting technological advance.学术写作中的人工智能:一项具有范式转变意义的技术进步。
Nat Rev Urol. 2023 Jun;20(6):327-328. doi: 10.1038/s41585-023-00746-x.
9
The Assessment of TikTok as a Source of Quality Health Information on Varicoceles.精索静脉曲张相关优质健康信息来源评估——TikTok 研究
Urology. 2023 May;175:170-174. doi: 10.1016/j.urology.2022.12.016. Epub 2022 Dec 24.
10
Assessing the quality and readability of online content on shock wave therapy for erectile dysfunction.评估关于勃起功能障碍冲击波治疗的在线内容的质量和可读性。
Andrologia. 2022 Dec;54(11):e14607. doi: 10.1111/and.14607. Epub 2022 Oct 14.