• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估ChatGPT在评估肾病患者非处方药和补充剂安全性方面的功效。

Evaluating ChatGPT's efficacy in assessing the safety of non-prescription medications and supplements in patients with kidney disease.

作者信息

Sheikh Mohammad S, Barreto Erin F, Miao Jing, Thongprayoon Charat, Gregoire James R, Dreesman Benjamin, Erickson Stephen B, Craici Iasmina M, Cheungpasitporn Wisit

机构信息

Department of Nephrology, Mayo Clinic Minnesota, Rochester, MN, USA.

Department of Pharmacy, Mayo Clinic Minnesota, Rochester, MN, USA.

出版信息

Digit Health. 2024 Apr 17;10:20552076241248082. doi: 10.1177/20552076241248082. eCollection 2024 Jan-Dec.

DOI:10.1177/20552076241248082
PMID:38638404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11025428/
Abstract

BACKGROUND

This study investigated the efficacy of ChatGPT-3.5 and ChatGPT-4 in assessing drug safety for patients with kidney diseases, comparing their performance to Micromedex, a well-established drug information source. Despite the perception of non-prescription medications and supplements as safe, risks exist, especially for those with kidney issues. The study's goal was to evaluate ChatGPT's versions for their potential in clinical decision-making regarding kidney disease patients.

METHOD

The research involved analyzing 124 common non-prescription medications and supplements using ChatGPT-3.5 and ChatGPT-4 with queries about their safety for people with kidney disease. The AI responses were categorized as "generally safe," "potentially harmful," or "unknown toxicity." Simultaneously, these medications and supplements were assessed in Micromedex using similar categories, allowing for a comparison of the concordance between the two resources.

RESULTS

Micromedex identified 85 (68.5%) medications as generally safe, 35 (28.2%) as potentially harmful, and 4 (3.2%) of unknown toxicity. ChatGPT-3.5 identified 89 (71.8%) as generally safe, 11 (8.9%) as potentially harmful, and 24 (19.3%) of unknown toxicity. GPT-4 identified 82 (66.1%) as generally safe, 29 (23.4%) as potentially harmful, and 13 (10.5%) of unknown toxicity. The overall agreement between Micromedex and ChatGPT-3.5 was 64.5% and ChatGPT-4 demonstrated a higher agreement at 81.4%. Notably, ChatGPT-3.5's suboptimal performance was primarily influenced by a lower concordance rate among supplements, standing at 60.3%. This discrepancy could be attributed to the limited data on supplements within ChatGPT-3.5, with supplements constituting 80% of medications identified as unknown.

CONCLUSION

ChatGPT's capabilities in evaluating the safety of non-prescription drugs and supplements for kidney disease patients are modest compared to established drug information resources. Neither ChatGPT-3.5 nor ChatGPT-4 can be currently recommended as reliable drug information sources for this demographic. The results highlight the need for further improvements in the model's accuracy and reliability in the medical domain.

摘要

背景

本研究调查了ChatGPT-3.5和ChatGPT-4在评估肾病患者药物安全性方面的功效,并将它们的表现与成熟的药物信息来源Micromedex进行比较。尽管非处方药和补充剂被认为是安全的,但仍存在风险,尤其是对肾病患者而言。该研究的目标是评估ChatGPT的不同版本在针对肾病患者的临床决策中的潜力。

方法

该研究使用ChatGPT-3.5和ChatGPT-4分析了124种常见的非处方药和补充剂,并询问它们对肾病患者的安全性。人工智能的回答被归类为“一般安全”、“潜在有害”或“毒性未知”。同时,使用类似的类别在Micromedex中对这些药物和补充剂进行评估,以便比较这两种资源之间的一致性。

结果

Micromedex将85种(68.5%)药物鉴定为一般安全,35种(28.2%)为潜在有害,4种(3.2%)毒性未知。ChatGPT-3.5将89种(71.8%)鉴定为一般安全,11种(8.9%)为潜在有害,24种(19.3%)毒性未知。ChatGPT-4将82种(66.1%)鉴定为一般安全,29种(23.4%)为潜在有害,13种(10.5%)毒性未知。Micromedex和ChatGPT-3.5之间的总体一致性为64.5%,ChatGPT-4的一致性更高,为81.4%。值得注意的是,ChatGPT-3.5的表现欠佳主要是受补充剂中较低的一致性率影响,为60.3%。这种差异可能归因于ChatGPT-3.5中关于补充剂的数据有限,补充剂占被鉴定为未知毒性药物的80%。

结论

与成熟的药物信息资源相比,ChatGPT在评估肾病患者非处方药和补充剂安全性方面的能力有限。目前,ChatGPT-3.5和ChatGPT-4都不能被推荐为这一人群可靠的药物信息来源。研究结果凸显了在医学领域进一步提高该模型准确性和可靠性的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/c22e8c7ea3b3/10.1177_20552076241248082-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/8cde175f371f/10.1177_20552076241248082-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/023d8b47d3d2/10.1177_20552076241248082-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/00fa46b40ffe/10.1177_20552076241248082-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/df021495f5e5/10.1177_20552076241248082-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/9f8dc35c1d0c/10.1177_20552076241248082-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/c22e8c7ea3b3/10.1177_20552076241248082-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/8cde175f371f/10.1177_20552076241248082-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/023d8b47d3d2/10.1177_20552076241248082-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/00fa46b40ffe/10.1177_20552076241248082-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/df021495f5e5/10.1177_20552076241248082-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/9f8dc35c1d0c/10.1177_20552076241248082-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df5b/11025428/c22e8c7ea3b3/10.1177_20552076241248082-fig6.jpg

相似文献

1
Evaluating ChatGPT's efficacy in assessing the safety of non-prescription medications and supplements in patients with kidney disease.评估ChatGPT在评估肾病患者非处方药和补充剂安全性方面的功效。
Digit Health. 2024 Apr 17;10:20552076241248082. doi: 10.1177/20552076241248082. eCollection 2024 Jan-Dec.
2
Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.ChatGPT 在临床医学研究生入学考试中的表现:调查研究。
JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.
3
Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI.ChatGPT 在中文体检、病历和教育方面的表现和探索:为医疗 AI 铺平道路。
Int J Med Inform. 2023 Sep;177:105173. doi: 10.1016/j.ijmedinf.2023.105173. Epub 2023 Aug 4.
4
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
5
Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.评估生成式对话人工智能在破除睡眠健康误区方面的准确性:采用专家分析的混合方法比较研究
JMIR Form Res. 2024 Apr 16;8:e55762. doi: 10.2196/55762.
6
ChatGPT versus NASS clinical guidelines for degenerative spondylolisthesis: a comparative analysis.ChatGPT 与 NASS 退行性脊柱滑脱临床指南比较分析。
Eur Spine J. 2024 Nov;33(11):4182-4203. doi: 10.1007/s00586-024-08198-6. Epub 2024 Mar 15.
7
ChatGPT's Performance in Cardiac Arrest and Bradycardia Simulations Using the American Heart Association's Advanced Cardiovascular Life Support Guidelines: Exploratory Study.ChatGPT 在使用美国心脏协会高级心血管生命支持指南进行心脏骤停和心动过缓模拟中的表现:探索性研究。
J Med Internet Res. 2024 Apr 22;26:e55037. doi: 10.2196/55037.
8
Enhanced Artificial Intelligence Strategies in Renal Oncology: Iterative Optimization and Comparative Analysis of GPT 3.5 Versus 4.0.增强型人工智能策略在肾肿瘤学中的应用:GPT 3.5 与 4.0 的迭代优化与比较分析。
Ann Surg Oncol. 2024 Jun;31(6):3887-3893. doi: 10.1245/s10434-024-15107-0. Epub 2024 Mar 12.
9
Assessing ChatGPT's capacity for clinical decision support in pediatrics: A comparative study with pediatricians using KIDMAP of Rasch analysis.评估 ChatGPT 在儿科临床决策支持方面的能力:使用 KIDMAP 的 Rasch 分析与儿科医生进行的比较研究。
Medicine (Baltimore). 2023 Jun 23;102(25):e34068. doi: 10.1097/MD.0000000000034068.
10
Are Different Versions of ChatGPT's Ability Comparable to the Clinical Diagnosis Presented in Case Reports? A Descriptive Study.ChatGPT不同版本的能力与病例报告中的临床诊断具有可比性吗?一项描述性研究。
J Multidiscip Healthc. 2023 Dec 6;16:3825-3831. doi: 10.2147/JMDH.S441790. eCollection 2023.

引用本文的文献

1
ChatGPT Performance Deteriorated in Patients with Comorbidities When Providing Cardiological Therapeutic Consultations.在提供心脏治疗咨询时,合并症患者的ChatGPT性能会下降。
Healthcare (Basel). 2025 Jul 3;13(13):1598. doi: 10.3390/healthcare13131598.
2
Advancing personalized medicine in digital health: The role of artificial intelligence in enhancing clinical interpretation of 24-h ambulatory blood pressure monitoring.推进数字健康中的个性化医疗:人工智能在增强24小时动态血压监测临床解读中的作用。
Digit Health. 2025 Mar 14;11:20552076251326014. doi: 10.1177/20552076251326014. eCollection 2025 Jan-Dec.
3

本文引用的文献

1
Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications.将检索增强生成与大型语言模型在肾脏病学中的整合:推进实际应用。
Medicina (Kaunas). 2024 Mar 8;60(3):445. doi: 10.3390/medicina60030445.
2
Personalized Medicine Transformed: ChatGPT's Contribution to Continuous Renal Replacement Therapy Alarm Management in Intensive Care Units.个性化医疗的变革:ChatGPT对重症监护病房持续肾脏替代治疗警报管理的贡献。
J Pers Med. 2024 Feb 22;14(3):233. doi: 10.3390/jpm14030233.
3
Chain of Thought Utilization in Large Language Models and Application in Nephrology.
Navigating the potential and pitfalls of large language models in patient-centered medication guidance and self-decision support.
探索大语言模型在以患者为中心的用药指导和自我决策支持中的潜力与陷阱。
Front Med (Lausanne). 2025 Jan 23;12:1527864. doi: 10.3389/fmed.2025.1527864. eCollection 2025.
大语言模型中的思维链利用及其在肾脏病学中的应用。
Medicina (Kaunas). 2024 Jan 13;60(1):148. doi: 10.3390/medicina60010148.
4
Innovating Personalized Nephrology Care: Exploring the Potential Utilization of ChatGPT.创新个性化肾脏病护理:探索ChatGPT的潜在应用
J Pers Med. 2023 Dec 4;13(12):1681. doi: 10.3390/jpm13121681.
5
Feasibility and acceptability of ChatGPT generated radiology report summaries for cancer patients.ChatGPT生成的癌症患者放射学报告摘要的可行性和可接受性。
Digit Health. 2023 Dec 19;9:20552076231221620. doi: 10.1177/20552076231221620. eCollection 2023 Jan-Dec.
6
Examining the Potential of ChatGPT on Biomedical Information Retrieval: Fact-Checking Drug-Disease Associations.探讨 ChatGPT 在生物医学信息检索中的潜力:药物-疾病关联的事实核查。
Ann Biomed Eng. 2024 Aug;52(8):1919-1927. doi: 10.1007/s10439-023-03385-w. Epub 2023 Oct 19.
7
Performance of ChatGPT on Nephrology Test Questions.ChatGPT 在肾病学试题上的表现。
Clin J Am Soc Nephrol. 2024 Jan 1;19(1):35-43. doi: 10.2215/CJN.0000000000000330. Epub 2023 Oct 18.
8
ChatGPT fails the test of evidence-based medicine.ChatGPT未通过循证医学的检验。
Eur Heart J Digit Health. 2023 Jul 13;4(5):366-367. doi: 10.1093/ehjdh/ztad043. eCollection 2023 Oct.
9
ChatGPT in pharmacy practice: a cross-sectional exploration of Jordanian pharmacists' perception, practice, and concerns.ChatGPT在药学实践中的应用:对约旦药剂师认知、实践及担忧的横断面研究
J Pharm Policy Pract. 2023 Oct 3;16(1):115. doi: 10.1186/s40545-023-00624-2.
10
Comparison of Diagnostic and Triage Accuracy of Ada Health and WebMD Symptom Checkers, ChatGPT, and Physicians for Patients in an Emergency Department: Clinical Data Analysis Study.Ada 健康和 WebMD 症状检查器、ChatGPT 和医生对急诊科患者的诊断和分诊准确性比较:临床数据分析研究。
JMIR Mhealth Uhealth. 2023 Oct 3;11:e49995. doi: 10.2196/49995.