• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人工智能的大语言模型可促进患者教育。

Artificial Intelligence-Based Large Language Models Can Facilitate Patient Education.

作者信息

Bryson Xochitl, Albarran Marleni, Pham Nicole, Salunga Arianne, Johnson Taylor, Hogue Grant D, Brooks Jaysson T, Tileston Kali R, Louer Craig R, El-Hawary Ron, Imrie Meghan N, Policy James F, Bouton Daniel, Hariharan Arun R, Van Nortwick Sara, Upasani Vidyadhar V, Bauer Jennifer M, Tice Andrew, Vorhies John S

机构信息

Stanford School of Medicine Department of Orthopedic Surgery, Palo Alto, CA, USA.

Boston Children's Hospital, Boston, MA, USA.

出版信息

J Pediatr Soc North Am. 2025 May 24;12:100196. doi: 10.1016/j.jposna.2025.100196. eCollection 2025 Aug.

DOI:10.1016/j.jposna.2025.100196
PMID:40791971
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12337203/
Abstract

BACKGROUND

Artificial intelligence (AI) large language models (LLMs) are becoming increasingly popular, with patients and families more likely to utilize LLM when conducting internet-based research about scoliosis. For this reason, it is vital to understand the abilities and limitations of this technology in disseminating accurate medical information. We used an expert panel to compare LLM-generated and professional society-authored answers to frequently asked questions about pediatric scoliosis.

METHODS

We used three publicly available LLMs to generate answers to 15 frequently asked questions (FAQs) regarding pediatric scoliosis. The FAQs were derived from the Scoliosis Research Society, the American Academy of Orthopaedic Surgeons, and the Pediatric Spine Foundation. We gave minimal training to the LLM other than specifying the response length and requesting answers at a 5th-grade reading level. A 15-question survey was distributed to an expert panel composed of pediatric spine surgeons. To determine readability, responses were inputted into an open-source calculator. The panel members were presented with an AI and a physician-generated response to a FAQ and asked to select which they preferred. They were then asked to individually grade the accuracy of responses on a Likert scale.

RESULTS

The panel members had a mean of 8.9 years of experience post-fellowship (range: 3-23 years). The panel reported nearly equivalent agreement between AI-generated and physician-generated answers. The expert panel favored professional society-written responses for 40% of questions, AI for 40%, ranked responses equally good for 13%, and saw a tie between AI and "equally good" for 7%. For two professional society-generated and three AI-generated responses, the error bars of the expert panel mean score for accuracy and appropriateness fell below neutral, indicating a lack of consensus and mixed opinions with the response.

CONCLUSIONS

Based on the expert panel review, AI delivered accurate and appropriate answers as frequently as professional society-authored FAQ answers from professional society websites. AI and professional society websites were equally likely to generate answers with which the expert panel disagreed.

KEY CONCEPTS

(1)Large language models (LLMs) are increasingly used for generating medical information online, necessitating an evaluation of their accuracy and effectiveness compared with traditional sources.(2)An expert panel of physicians compared artificial intelligence (AI)-generated answers with professional society-authored answers to pediatric scoliosis frequently asked questions, finding that both types of answers were equally favored in terms of accuracy and appropriateness.(3)The panel reported a similar rate of disagreement with AI-generated and professional society-generated answers, indicating that both had areas of controversy.(4)Over half of the expert panel members felt they could distinguish between AI-generated and professional society-generated answers but this did not relate to their preferences.(5)While AI can support medical information dissemination, further research and improvements are needed to address its limitations and ensure high-quality, accessible patient education.

LEVELS OF EVIDENCE

IV.

摘要

背景

人工智能(AI)大语言模型(LLMs)越来越受欢迎,患者和家庭在进行关于脊柱侧弯的网络研究时更有可能使用大语言模型。因此,了解这项技术在传播准确医学信息方面的能力和局限性至关重要。我们使用了一个专家小组来比较大语言模型生成的和专业协会撰写的关于小儿脊柱侧弯常见问题的答案。

方法

我们使用三个公开可用的大语言模型来生成关于小儿脊柱侧弯的15个常见问题(FAQ)的答案。这些常见问题来自脊柱侧弯研究协会、美国骨科医师学会和小儿脊柱基金会。除了指定回答长度并要求以五年级阅读水平给出答案外,我们对大语言模型的训练极少。向一个由小儿脊柱外科医生组成的专家小组发放了一份包含15个问题的调查问卷。为了确定可读性,将回答输入到一个开源计算器中。向专家小组成员展示一个人工智能生成的和一个医生生成的关于常见问题的回答,并要求他们选择更喜欢哪一个。然后要求他们以李克特量表单独对回答的准确性进行评分。

结果

专家小组成员 fellowship 后的平均经验为8.9年(范围:3 - 23年)。该小组报告称,人工智能生成的答案和医生生成的答案之间的一致性几乎相同。专家小组对40%的问题青睐专业协会撰写的回答,40%青睐人工智能生成的回答,13%认为两者同样好,7%认为人工智能生成的回答与“同样好”的回答不分上下。对于两个专业协会生成的回答和三个人工智能生成的回答,专家小组关于准确性和适当性的平均得分的误差线低于中性水平,表明对这些回答缺乏共识且意见不一。

结论

基于专家小组的审查,人工智能提供准确和适当答案的频率与专业协会在其网站上撰写的常见问题解答相同。人工智能生成的答案和专业协会网站生成的答案同样有可能引发专家小组的不同意见。

关键概念

(1)大语言模型(LLMs)越来越多地用于在线生成医学信息,因此有必要将其与传统来源的准确性和有效性进行评估。(2)一个医生专家小组比较了人工智能(AI)生成的答案与专业协会撰写的小儿脊柱侧弯常见问题的答案,发现两种类型的答案在准确性和适当性方面同样受到青睐。(3)该小组报告称,对人工智能生成的答案和专业协会生成的答案的不同意见发生率相似,表明两者都存在有争议的领域。(4)超过一半的专家小组成员认为他们能够区分人工智能生成的答案和专业协会生成的答案,但这与他们的偏好无关。(5)虽然人工智能可以支持医学信息传播,但需要进一步研究和改进以解决其局限性并确保高质量、易于获取的患者教育。

证据级别

IV。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/8b21a13febf7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/480ecb2c6d26/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/4772b6ad34fd/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/bfb2a65d0e65/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/8b21a13febf7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/480ecb2c6d26/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/4772b6ad34fd/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/bfb2a65d0e65/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f738/12337203/8b21a13febf7/gr4.jpg

相似文献

1
Artificial Intelligence-Based Large Language Models Can Facilitate Patient Education.基于人工智能的大语言模型可促进患者教育。
J Pediatr Soc North Am. 2025 May 24;12:100196. doi: 10.1016/j.jposna.2025.100196. eCollection 2025 Aug.
2
Sexual Harassment and Prevention Training性骚扰与预防培训
3
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
4
Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.人工智能在提高西班牙语骨科患者教育材料的可读性方面成效有限。
Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型:机遇、风险与伦理
Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.
8
Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?来自大语言模型或网络资源的关于肌肉骨骼恶性肿瘤的信息对患者来说是否处于合适的阅读水平?
Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.
9
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

本文引用的文献

1
Artificial Intelligence in Anesthetic Care: A Survey of Physician Anesthesiologists.人工智能在麻醉护理中的应用:对医师麻醉师的调查。
Anesth Analg. 2024 May 1;138(5):938-950. doi: 10.1213/ANE.0000000000006752. Epub 2023 Dec 6.
2
Artificial intelligence in medicine: it is neither new, nor frightening.医学中的人工智能:它既不新鲜,也不可怕。
F S Rep. 2023 Aug 9;4(3):239-240. doi: 10.1016/j.xfre.2023.08.004. eCollection 2023 Sep.
3
AI and Medical Education - A 21st-Century Pandora's Box.人工智能与医学教育——一个21世纪的潘多拉魔盒。
N Engl J Med. 2023 Aug 3;389(5):385-387. doi: 10.1056/NEJMp2304993. Epub 2023 Jul 29.
4
Readability assessment of patient educational materials for pediatric spinal conditions from top academic orthopedic institutions.顶级学术骨科机构针对小儿脊柱疾病的患者教育材料可读性评估
J Child Orthop. 2023 May 16;17(3):284-290. doi: 10.1177/18632521231156435. eCollection 2023 Jun.
5
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
6
Artificial intelligence in spine care: current applications and future utility.人工智能在脊柱护理中的应用:当前的应用和未来的效用。
Eur Spine J. 2022 Aug;31(8):2057-2081. doi: 10.1007/s00586-022-07176-0. Epub 2022 Mar 27.
7
General Practitioners' Attitudes Toward Artificial Intelligence-Enabled Systems: Interview Study.全科医生对人工智能系统的态度:访谈研究。
J Med Internet Res. 2022 Jan 27;24(1):e28916. doi: 10.2196/28916.
8
A Survey of Methods and Technologies Used for Diagnosis of Scoliosis.脊柱侧凸诊断方法和技术综述。
Sensors (Basel). 2021 Dec 16;21(24):8410. doi: 10.3390/s21248410.
9
Readability of Patient Educational Materials in Pediatric Orthopaedics.儿科骨科患者教育材料的可读性。
J Bone Joint Surg Am. 2021 Jun 16;103(12):e47. doi: 10.2106/JBJS.20.01347.
10
Cross sectional analysis of scoliosis-specific information on the internet: potential for patient confusion and misinformation.互联网上脊柱侧凸特定信息的横断面分析:可能导致患者困惑和错误信息。
Spine Deform. 2020 Dec;8(6):1159-1167. doi: 10.1007/s43390-020-00156-8. Epub 2020 Jun 23.