医疗记录中的人工智能抄写员和大语言模型技术：优势、局限性与建议

Artificial Intelligence Scribe and Large Language Model Technology in Healthcare Documentation: Advantages, Limitations, and Recommendations.

作者信息

Mess Sarah A, Mackey Alison J, Yarowsky David E

机构信息

From Sarah A. Mess, M. D., LLC, Columbia, MD.

Department of Plastic Surgery, Georgetown University Clinical Faculty, Washington, DC.

出版信息

Plast Reconstr Surg Glob Open. 2025 Jan 16;13(1):e6450. doi: 10.1097/GOX.0000000000006450. eCollection 2025 Jan.

DOI:10.1097/GOX.0000000000006450

PMID:39823022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11737491/

Abstract

Artificial intelligence (AI) scribe applications in the healthcare community are in the early adoption phase and offer unprecedented efficiency for medical documentation. They typically use an application programming interface with a large language model (LLM), for example, generative pretrained transformer 4. They use automatic speech recognition on the physician-patient interaction, generating a full medical note for the encounter, together with a draft follow-up e-mail for the patient and, often, recommendations, all within seconds or minutes. This provides physicians with increased cognitive freedom during medical encounters due to less time needed interfacing with electronic medical records. However, careful proofreading of the AI-generated language by the physician signing the note is essential. Insidious and potentially significant errors of omission, fabrication, or substitution may occur. The neural network algorithms of LLMs have unpredictable sensitivity to user input and inherent variability in their output. LLMs are unconstrained by established medical knowledge or rules. As they gain increasing levels of access to large corpora of medical records, the explosion of discovered knowledge comes with large potential risks, including to patient privacy, and potential bias in algorithms. Medical AI developers should use robust regulatory oversights, adhere to ethical guidelines, correct bias in algorithms, and improve detection and correction of deviations from the intended output.

摘要

人工智能（AI）抄写应用在医疗保健领域正处于早期采用阶段，为医疗记录提供了前所未有的效率。它们通常使用应用程序编程接口与大语言模型（LLM），例如生成式预训练变换器4。它们对医患互动使用自动语音识别，在几秒钟或几分钟内生成此次就诊的完整医疗记录，以及给患者的后续电子邮件草稿，并且通常还有建议。这使得医生在医疗就诊期间有了更大的认知自由度，因为与电子病历交互所需的时间减少了。然而，签署记录的医生对人工智能生成的语言进行仔细校对至关重要。可能会出现隐匿且潜在重大的遗漏、编造或替换错误。大语言模型的神经网络算法对用户输入具有不可预测的敏感性，其输出存在固有变异性。大语言模型不受既定医学知识或规则的约束。随着它们越来越多地访问大量医疗记录语料库，发现的知识激增伴随着巨大的潜在风险，包括对患者隐私的风险以及算法中的潜在偏差。医疗人工智能开发者应采用强有力的监管监督，遵守道德准则，纠正算法偏差，并改进对预期输出偏差的检测和纠正。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e4b4/11737491/012c4b460870/gox-13-e6450-g001.jpg

相似文献

Artificial Intelligence Scribe and Large Language Model Technology in Healthcare Documentation: Advantages, Limitations, and Recommendations.

Plast Reconstr Surg Glob Open. 2025 Jan 16;13(1):e6450. doi: 10.1097/GOX.0000000000006450. eCollection 2025 Jan.

Utilizing large language models for gastroenterology research: a conceptual framework.

Therap Adv Gastroenterol. 2025 Apr 1;18:17562848251328577. doi: 10.1177/17562848251328577. eCollection 2025.

DeepSeek in Healthcare: Revealing Opportunities and Steering Challenges of a New Open-Source Artificial Intelligence Frontier.

Cureus. 2025 Feb 18;17(2):e79221. doi: 10.7759/cureus.79221. eCollection 2025 Feb.

Using ChatGPT-4 to Create Structured Medical Notes From Audio Recordings of Physician-Patient Encounters: Comparative Study.

J Med Internet Res. 2024 Apr 22;26:e54419. doi: 10.2196/54419.

Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.

Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.

Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis.

J Med Internet Res. 2024 Dec 27;26:e66114. doi: 10.2196/66114.

Ethical Considerations of Artificial Intelligence in Health Care: Examining the Role of Generative Pretrained Transformer-4.

J Am Acad Orthop Surg. 2024 Mar 1;32(5):205-210. doi: 10.5435/JAAOS-D-23-00787. Epub 2024 Jan 3.

The Accuracy and Capability of Artificial Intelligence Solutions in Health Care Examinations and Certificates: Systematic Review and Meta-Analysis.

J Med Internet Res. 2024 Nov 5;26:e56532. doi: 10.2196/56532.

The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review.

JMIR Med Inform. 2024 May 10;12:e53787. doi: 10.2196/53787.

Development of a Preliminary Patient Safety Classification System for Generative AI.

BMJ Qual Saf. 2025 Jan 28;34(2):130-132. doi: 10.1136/bmjqs-2024-017918.

引用本文的文献

A Randomized-Clinical Trial of Two Ambient Artificial Intelligence Scribes: Measuring Documentation Efficiency and Physician Burnout.

medRxiv. 2025 Jul 11:2025.07.10.25331333. doi: 10.1101/2025.07.10.25331333.

Artificial Intelligence in Medical Education: Promise, Pitfalls, and Practical Pathways.

Adv Med Educ Pract. 2025 Jun 14;16:1039-1046. doi: 10.2147/AMEP.S523255. eCollection 2025.

The low-carbon fruit tree for primary care.

Healthc Manage Forum. 2025 May 14;38(4):8404704251333639. doi: 10.1177/08404704251333639.

Inspired Spine Smart Universal Resource Identifier (SURI): An Adaptive AI Framework for Transforming Multilingual Speech Into Structured Medical Reports.

Cureus. 2025 Mar 26;17(3):e81243. doi: 10.7759/cureus.81243. eCollection 2025 Mar.

Alarm: Retracted articles on cancer imaging are not only continuously cited by publications but also used by ChatGPT to answer questions.

J Adv Res. 2025 May;71:1-3. doi: 10.1016/j.jare.2025.03.020. Epub 2025 Mar 12.

本文引用的文献

Implications of Large Language Models for Quality and Efficiency of Neurologic Care: Emerging Issues in Neurology.

Neurology. 2024 Jun 11;102(11):e209497. doi: 10.1212/WNL.0000000000209497. Epub 2024 May 17.

Harnessing the potential of large language models in medical education: promise and pitfalls.

J Am Med Inform Assoc. 2024 Feb 16;31(3):776-783. doi: 10.1093/jamia/ocad252.

The Role of Large Language Models in Medical Education: Applications and Implications.

JMIR Med Educ. 2023 Aug 14;9:e50945. doi: 10.2196/50945.

How do we know how smart AI systems are?

Science. 2023 Jul 14;381(6654):adj5957. doi: 10.1126/science.adj5957. Epub 2023 Jul 13.

The imperative for regulatory oversight of large language models (or generative AI) in healthcare.

NPJ Digit Med. 2023 Jul 6;6(1):120. doi: 10.1038/s41746-023-00873-0.

Artificial Intelligence in Facial Plastic Surgery: A Review of Current Applications, Future Applications, and Ethical Considerations.

Facial Plast Surg. 2023 Oct;39(5):454-459. doi: 10.1055/s-0043-1770160. Epub 2023 Jun 23.

Artificial intelligence hallucinations.

Crit Care. 2023 May 10;27(1):180. doi: 10.1186/s13054-023-04473-y.

Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.

N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.

AI in health and medicine.

Nat Med. 2022 Jan;28(1):31-38. doi: 10.1038/s41591-021-01614-0. Epub 2022 Jan 20.

Physician Time Spent Using the Electronic Health Record During Outpatient Encounters: A Descriptive Study.

Ann Intern Med. 2020 Feb 4;172(3):169-174. doi: 10.7326/M18-3684. Epub 2020 Jan 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

医疗记录中的人工智能抄写员和大语言模型技术：优势、局限性与建议

Artificial Intelligence Scribe and Large Language Model Technology in Healthcare Documentation: Advantages, Limitations, and Recommendations.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献