• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

日语人工智能生成的临床病例能否用于医学和语言方面?

Can AI-Generated Clinical Vignettes in Japanese Be Used Medically and Linguistically?

作者信息

Yanagita Yasutaka, Yokokawa Daiki, Uchida Shun, Li Yu, Uehara Takanori, Ikusaka Masatomi

机构信息

Department of General Medicine, Chiba University Hospital, Chiba, Japan.

Uchida Internal Medicine Clinic, Saitama, Japan.

出版信息

J Gen Intern Med. 2024 Dec;39(16):3282-3289. doi: 10.1007/s11606-024-09031-y. Epub 2024 Sep 23.

DOI:10.1007/s11606-024-09031-y
PMID:39313665
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11618267/
Abstract

BACKGROUND

Creating clinical vignettes requires considerable effort. Recent developments in generative artificial intelligence (AI) for natural language processing have been remarkable and may allow for the easy and immediate creation of diverse clinical vignettes.

OBJECTIVE

In this study, we evaluated the medical accuracy and grammatical correctness of AI-generated clinical vignettes in Japanese and verified their usefulness.

METHODS

Clinical vignettes were created using the generative AI model GPT-4-0613. The input prompts for the clinical vignettes specified the following seven elements: (1) age, (2) sex, (3) chief complaint and time course since onset, (4) physical findings, (5) examination results, (6) diagnosis, and (7) treatment course. The list of diseases integrated into the vignettes was based on 202 cases considered in the management of diseases and symptoms in Japan's Primary Care Physicians Training Program. The clinical vignettes were evaluated for medical and Japanese-language accuracy by three physicians using a five-point scale. A total score of 13 points or above was defined as "sufficiently beneficial and immediately usable with minor revisions," a score between 10 and 12 points was defined as "partly insufficient and in need of modifications," and a score of 9 points or below was defined as "insufficient."

RESULTS

Regarding medical accuracy, of the 202 clinical vignettes, 118 scored 13 points or above, 78 scored between 10 and 12 points, and 6 scored 9 points or below. Regarding Japanese-language accuracy, 142 vignettes scored 13 points or above, 56 scored between 10 and 12 points, and 4 scored 9 points or below. Overall, 97% (196/202) of vignettes were available with some modifications.

CONCLUSION

Overall, 97% of the clinical vignettes proved practically useful, based on confirmation and revision by Japanese medical physicians. Given the significant effort required by physicians to create vignettes without AI, using GPT is expected to greatly optimize this process.

摘要

背景

创建临床病例需要付出相当大的努力。生成式人工智能(AI)在自然语言处理方面的最新进展显著,可能使多样化临床病例的轻松、即时创建成为可能。

目的

在本研究中,我们评估了日语中人工智能生成的临床病例的医学准确性和语法正确性,并验证了它们的实用性。

方法

使用生成式人工智能模型GPT - 4 - 0613创建临床病例。临床病例的输入提示指定了以下七个要素:(1)年龄,(2)性别,(3)主诉及发病后的病程,(4)体格检查结果,(5)检查结果,(6)诊断,以及(7)治疗过程。纳入病例的疾病列表基于日本初级保健医生培训计划中所考虑的202种疾病和症状。由三名医生使用五分制对临床病例的医学和日语准确性进行评估。总分13分及以上被定义为“足够有益且稍作修改即可立即使用”,10至12分被定义为“部分不足且需要修改”,9分及以下被定义为“不足”。

结果

在医学准确性方面,202个临床病例中,118个得分13分及以上,78个得分在10至12分之间,6个得分9分及以下。在日语准确性方面,142个病例得分13分及以上,56个得分在10至12分之间,4个得分9分及以下。总体而言,97%(196/202)的病例稍作修改即可使用。

结论

总体而言,经日本医学医生确认和修订,97%的临床病例被证明具有实际用途。鉴于医生在没有人工智能的情况下创建病例需要付出巨大努力,使用GPT有望极大地优化这一过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/b042c70ab4d6/11606_2024_9031_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/118f392e0fb1/11606_2024_9031_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/11ae6396ec52/11606_2024_9031_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/2d71bc416d7c/11606_2024_9031_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/f5a06bacd9d4/11606_2024_9031_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/b042c70ab4d6/11606_2024_9031_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/118f392e0fb1/11606_2024_9031_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/11ae6396ec52/11606_2024_9031_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/2d71bc416d7c/11606_2024_9031_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/f5a06bacd9d4/11606_2024_9031_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4789/11618267/b042c70ab4d6/11606_2024_9031_Fig5_HTML.jpg

相似文献

1
Can AI-Generated Clinical Vignettes in Japanese Be Used Medically and Linguistically?日语人工智能生成的临床病例能否用于医学和语言方面?
J Gen Intern Med. 2024 Dec;39(16):3282-3289. doi: 10.1007/s11606-024-09031-y. Epub 2024 Sep 23.
2
Educational Utility of Clinical Vignettes Generated in Japanese by ChatGPT-4: Mixed Methods Study.ChatGPT-4 生成的日语临床病例对教育的效用:混合方法研究。
JMIR Med Educ. 2024 Aug 13;10:e59133. doi: 10.2196/59133.
3
Diagnostic accuracy of large language models in psychiatry.精神科大语言模型的诊断准确性。
Asian J Psychiatr. 2024 Oct;100:104168. doi: 10.1016/j.ajp.2024.104168. Epub 2024 Jul 25.
4
The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study.GPT-3 人工智能模型的诊断和分诊准确性:一项观察性研究。
Lancet Digit Health. 2024 Aug;6(8):e555-e561. doi: 10.1016/S2589-7500(24)00097-9.
5
Optimizing ChatGPT's Interpretation and Reporting of Delirium Assessment Outcomes: Exploratory Study.优化 ChatGPT 对谵妄评估结果的解释和报告:探索性研究。
JMIR Form Res. 2024 Oct 1;8:e51383. doi: 10.2196/51383.
6
Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot Study.基于生成式预训练 Transformer 3 聊天机器人为常见主诉临床病例生成鉴别诊断列表的诊断准确性:一项初步研究。
Int J Environ Res Public Health. 2023 Feb 15;20(4):3378. doi: 10.3390/ijerph20043378.
7
ChatGPT-Generated Differential Diagnosis Lists for Complex Case-Derived Clinical Vignettes: Diagnostic Accuracy Evaluation.基于复杂病例临床案例生成的ChatGPT鉴别诊断列表:诊断准确性评估。
JMIR Med Inform. 2023 Oct 9;11:e48808. doi: 10.2196/48808.
8
Performance Comparison of ChatGPT-4 and Japanese Medical Residents in the General Medicine In-Training Examination: Comparison Study.ChatGPT-4与日本内科住院医师在普通内科培训考试中的表现比较:比较研究
JMIR Med Educ. 2023 Dec 6;9:e52202. doi: 10.2196/52202.
9
Performance of Artificial Intelligence Chatbots in Answering Clinical Questions on Japanese Practical Guidelines for Implant-based Breast Reconstruction.人工智能聊天机器人在回答基于日本乳房植入重建实用指南的临床问题中的表现
Aesthetic Plast Surg. 2025 Apr;49(7):1947-1953. doi: 10.1007/s00266-024-04515-y. Epub 2024 Nov 26.
10
Measuring the Impact of AI in the Diagnosis of Hospitalized Patients: A Randomized Clinical Vignette Survey Study.测量人工智能在住院患者诊断中的影响:一项随机临床病例调查研究。
JAMA. 2023 Dec 19;330(23):2275-2284. doi: 10.1001/jama.2023.22295.

引用本文的文献

1
Large language models improve clinical decision making of medical students through patient simulation and structured feedback: a randomized controlled trial.大型语言模型通过患者模拟和结构化反馈提高医学生的临床决策能力:一项随机对照试验。
BMC Med Educ. 2024 Nov 28;24(1):1391. doi: 10.1186/s12909-024-06399-7.

本文引用的文献

1
Expert assessment of ChatGPT's ability to generate illness scripts: an evaluative study.专家评估 ChatGPT 生成疾病剧本的能力:一项评估研究。
BMC Med Educ. 2024 May 15;24(1):536. doi: 10.1186/s12909-024-05534-8.
2
ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。
Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.
3
Application of artificial intelligence chatbots, including ChatGPT, in education, scholarly work, programming, and content generation and its prospects: a narrative review.
人工智能聊天机器人(包括 ChatGPT)在教育、学术工作、编程、内容生成等领域的应用及其前景:叙述性综述。
J Educ Eval Health Prof. 2023;20:38. doi: 10.3352/jeehp.2023.20.38. Epub 2023 Dec 27.
4
Using Clinical Vignettes to Understand the Complexity of Diagnosing Type 1 Diabetes in Sub-Saharan Africa.利用临床案例了解撒哈拉以南非洲地区1型糖尿病诊断的复杂性。
Res Rep Trop Med. 2023 Nov 13;14:111-120. doi: 10.2147/RRTM.S397127. eCollection 2023.
5
ChatGPT Interactive Medical Simulations for Early Clinical Education: Case Study.用于早期临床教育的ChatGPT交互式医学模拟:案例研究
JMIR Med Educ. 2023 Nov 10;9:e49877. doi: 10.2196/49877.
6
Influence on the accuracy in ChatGPT: Differences in the amount of information per medical field.对 ChatGPT 准确性的影响:各医学领域信息量的差异。
Int J Med Inform. 2023 Dec;180:105283. doi: 10.1016/j.ijmedinf.2023.105283. Epub 2023 Nov 3.
7
The Impact of Multimodal Large Language Models on Health Care's Future.多模态大型语言模型对医疗保健未来的影响。
J Med Internet Res. 2023 Nov 2;25:e52865. doi: 10.2196/52865.
8
An explorative assessment of ChatGPT as an aid in medical education: Use it with caution.探索性评估 ChatGPT 在医学教育中的辅助作用:谨慎使用。
Med Teach. 2024 May;46(5):657-664. doi: 10.1080/0142159X.2023.2271159. Epub 2023 Oct 20.
9
Accuracy of ChatGPT on Medical Questions in the National Medical Licensing Examination in Japan: Evaluation Study.ChatGPT在日本国家医师资格考试医学问题上的准确性:评估研究
JMIR Form Res. 2023 Oct 13;7:e48023. doi: 10.2196/48023.
10
Ethical Considerations of Using ChatGPT in Health Care.使用 ChatGPT 在医疗保健中的伦理考虑。
J Med Internet Res. 2023 Aug 11;25:e48009. doi: 10.2196/48009.