使用大语言模型的研究的TRIPOD-LLM报告指南。

The TRIPOD-LLM reporting guideline for studies using large language models.

作者信息

Gallifant Jack, Afshar Majid, Ameen Saleem, Aphinyanaphongs Yindalon, Chen Shan, Cacciamani Giovanni, Demner-Fushman Dina, Dligach Dmitriy, Daneshjou Roxana, Fernandes Chrystinne, Hansen Lasse Hyldig, Landman Adam, Lehmann Lisa, McCoy Liam G, Miller Timothy, Moreno Amy, Munch Nikolaj, Restrepo David, Savova Guergana, Umeton Renato, Gichoya Judy Wawira, Collins Gary S, Moons Karel G M, Celi Leo A, Bitterman Danielle S

机构信息

Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA.

Department of Critical Care, Guy's and St Thomas' NHS Foundation Trust, London, UK.

出版信息

Nat Med. 2025 Jan;31(1):60-69. doi: 10.1038/s41591-024-03425-5. Epub 2025 Jan 8.

Abstract

Large language models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present transparent reporting of a multivariable model for individual prognosis or diagnosis (TRIPOD)-LLM, an extension of the TRIPOD + artificial intelligence statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce a modular format accommodating various LLM research designs and tasks, with 14 main items and 32 subitems applicable across all categories. Developed through an expedited Delphi process and expert consensus, TRIPOD-LLM emphasizes transparency, human oversight and task-specific performance reporting. We also introduce an interactive website ( https://tripod-llm.vercel.app/ ) facilitating easy guideline completion and PDF generation for submission. As a living document, TRIPOD-LLM will evolve with the field, aiming to enhance the quality, reproducibility and clinical applicability of LLM research in healthcare through comprehensive reporting.

摘要

大语言模型(LLMs)正在医疗保健领域迅速得到应用,这就需要标准化的报告指南。我们展示了个体预后或诊断多变量模型透明报告(TRIPOD)-LLM,它是TRIPOD + 人工智能声明的扩展,解决了大语言模型在生物医学应用中的独特挑战。TRIPOD-LLM提供了一份包含19个主要项目和50个子项目的全面清单,涵盖了从标题到讨论的关键方面。该指南引入了一种模块化格式,适用于各种大语言模型研究设计和任务,其中14个主要项目和32个子项目适用于所有类别。通过快速德尔菲法和专家共识制定,TRIPOD-LLM强调透明度、人工监督和特定任务的性能报告。我们还推出了一个交互式网站(https://tripod-llm.vercel.app/),便于完成指南并生成用于提交的PDF。作为一份动态文件,TRIPOD-LLM将随着该领域的发展而演变,旨在通过全面报告提高医疗保健领域大语言模型研究的质量、可重复性和临床适用性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索