Suppr
超能文献

FFA-GPT：一种用于眼底荧光血管造影解释和问答的自动化流程。

FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.

作者信息

Chen Xiaolan, Zhang Weiyi, Xu Pusheng, Zhao Ziwei, Zheng Yingfeng, Shi Danli, He Mingguang

机构信息

School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China.

State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, China.

出版信息

NPJ Digit Med. 2024 May 3;7(1):111. doi: 10.1038/s41746-024-01101-z.

DOI:10.1038/s41746-024-01101-z

PMID:38702471

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11068733/

Abstract

Fundus fluorescein angiography (FFA) is a crucial diagnostic tool for chorioretinal diseases, but its interpretation requires significant expertise and time. Prior studies have used Artificial Intelligence (AI)-based systems to assist FFA interpretation, but these systems lack user interaction and comprehensive evaluation by ophthalmologists. Here, we used large language models (LLMs) to develop an automated interpretation pipeline for both report generation and medical question-answering (QA) for FFA images. The pipeline comprises two parts: an image-text alignment module (Bootstrapping Language-Image Pre-training) for report generation and an LLM (Llama 2) for interactive QA. The model was developed using 654,343 FFA images with 9392 reports. It was evaluated both automatically, using language-based and classification-based metrics, and manually by three experienced ophthalmologists. The automatic evaluation of the generated reports demonstrated that the system can generate coherent and comprehensible free-text reports, achieving a BERTScore of 0.70 and F1 scores ranging from 0.64 to 0.82 for detecting top-5 retinal conditions. The manual evaluation revealed acceptable accuracy (68.3%, Kappa 0.746) and completeness (62.3%, Kappa 0.739) of the generated reports. The generated free-form answers were evaluated manually, with the majority meeting the ophthalmologists' criteria (error-free: 70.7%, complete: 84.0%, harmless: 93.7%, satisfied: 65.3%, Kappa: 0.762-0.834). This study introduces an innovative framework that combines multi-modal transformers and LLMs, enhancing ophthalmic image interpretation, and facilitating interactive communications during medical consultation.

摘要

眼底荧光血管造影（FFA）是诊断脉络膜视网膜疾病的重要工具，但其解读需要专业知识和时间。以往的研究使用基于人工智能（AI）的系统辅助FFA解读，但这些系统缺乏用户交互，也未经过眼科医生的全面评估。在此，我们使用大语言模型（LLMs）开发了一个自动解读流程，用于生成FFA图像的报告和进行医学问答（QA）。该流程包括两个部分：用于生成报告的图像-文本对齐模块（自训练语言-图像预训练）和用于交互式QA的大语言模型（Llama 2）。该模型使用654,343张FFA图像和9392份报告进行开发。我们使用基于语言和分类的指标对其进行自动评估，并由三位经验丰富的眼科医生进行人工评估。对生成报告的自动评估表明，该系统能够生成连贯且易懂的自由文本报告，在检测前5种视网膜疾病时，BERTScore达到0.70，F1分数在0.64至0.82之间。人工评估显示，生成报告的准确性（68.3%，Kappa 0.746）和完整性（62.3%，Kappa 0.739）可接受。对生成的自由形式答案进行人工评估，大多数答案符合眼科医生的标准（无错误：70.7%，完整：84.0%，无害：93.7%，满意：65.3%，Kappa：0.762 - 0.834）。本研究引入了一个创新框架，该框架结合了多模态变压器和大语言模型，增强了眼科图像解读能力，并在医疗咨询过程中促进了交互式沟通。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8086/11068733/14ec9e1640dc/41746_2024_1101_Fig1_HTML.jpg

相似文献

FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.

NPJ Digit Med. 2024 May 3;7(1):111. doi: 10.1038/s41746-024-01101-z.

ICGA-GPT: report generation and question answering for indocyanine green angiography images.

Br J Ophthalmol. 2024 Sep 20;108(10):1450-1456. doi: 10.1136/bjo-2023-324446.

Quality of Answers of Generative Large Language Models vs Peer Patients for Interpreting Lab Test Results for Lay Patients: Evaluation Study.

ArXiv. 2024 Jan 23:arXiv:2402.01693v1.

ChatFFA: An ophthalmic chat system for unified vision-language understanding and question answering for fundus fluorescein angiography.

iScience. 2024 May 17;27(7):110021. doi: 10.1016/j.isci.2024.110021. eCollection 2024 Jul 19.

Reshaping free-text radiology notes into structured reports with generative question answering transformers.

Artif Intell Med. 2024 Aug;154:102924. doi: 10.1016/j.artmed.2024.102924. Epub 2024 Jun 26.

Automated interpretation of retinal vein occlusion based on fundus fluorescein angiography images using deep learning: A retrospective, multi-center study.

Heliyon. 2024 Jun 19;10(13):e33108. doi: 10.1016/j.heliyon.2024.e33108. eCollection 2024 Jul 15.

EYE-Llama, an in-domain large language model for ophthalmology.

bioRxiv. 2024 Apr 29:2024.04.26.591355. doi: 10.1101/2024.04.26.591355.

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.

JMIR Med Educ. 2024 Mar 28;10:e57054. doi: 10.2196/57054.

Exploring the potential of ChatGPT in medical dialogue summarization: a study on consistency with human preferences.

BMC Med Inform Decis Mak. 2024 Mar 14;24(1):75. doi: 10.1186/s12911-024-02481-8.

Automatic interpretation and clinical evaluation for fundus fluorescein angiography images of diabetic retinopathy patients by deep learning.

Br J Ophthalmol. 2023 Nov 22;107(12):1852-1858. doi: 10.1136/bjo-2022-321472.

引用本文的文献

From large language models to multimodal AI: a scoping review on the potential of generative AI in medicine.

Biomed Eng Lett. 2025 Aug 22;15(5):845-863. doi: 10.1007/s13534-025-00497-1. eCollection 2025 Sep.

An eyecare foundation model for clinical assistance: a randomized controlled trial.

Nat Med. 2025 Aug 28. doi: 10.1038/s41591-025-03900-7.

Large Language Models in Medical Image Analysis: A Systematic Survey and Future Directions.

Bioengineering (Basel). 2025 Jul 29;12(8):818. doi: 10.3390/bioengineering12080818.

Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators.

Eye (Lond). 2025 Aug 25. doi: 10.1038/s41433-025-03935-7.

DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning.

Adv Ophthalmol Pract Res. 2025 May 9;5(3):189-195. doi: 10.1016/j.aopr.2025.05.001. eCollection 2025 Aug-Sep.

Large language models for disease diagnosis: a scoping review.

NPJ Artif Intell. 2025;1(1):9. doi: 10.1038/s44387-025-00011-z. Epub 2025 Jun 9.

Generative artificial intelligence for fundus fluorescein angiography interpretation and human expert evaluation.

NPJ Digit Med. 2025 Jul 2;8(1):396. doi: 10.1038/s41746-025-01759-z.

Generation of Fundus Fluorescein Angiography Videos for Health Care Data Sharing.

JAMA Ophthalmol. 2025 Jun 26. doi: 10.1001/jamaophthalmol.2025.1419.

AI in Medical Questionnaires: Innovations, Diagnosis, and Implications.

J Med Internet Res. 2025 Jun 23;27:e72398. doi: 10.2196/72398.

A multimodal visual-language foundation model for computational ophthalmology.

NPJ Digit Med. 2025 Jun 21;8(1):381. doi: 10.1038/s41746-025-01772-2.

本文引用的文献

Development of a liver disease-specific large language model chat interface using retrieval-augmented generation.

Hepatology. 2024 Nov 1;80(5):1158-1168. doi: 10.1097/HEP.0000000000000834. Epub 2024 Mar 7.

Systematic analysis of ChatGPT, Google search and Llama 2 for clinical decision support tasks.

Nat Commun. 2024 Mar 6;15(1):2050. doi: 10.1038/s41467-024-46411-8.

A side-by-side evaluation of Llama 2 by meta with ChatGPT and its application in ophthalmology.

Eye (Lond). 2024 Jul;38(10):1789-1792. doi: 10.1038/s41433-024-02972-y. Epub 2024 Feb 12.

Uncovering Language Disparity of ChatGPT on Retinal Vascular Disease Classification: Cross-Sectional Study.

J Med Internet Res. 2024 Jan 22;26:e51926. doi: 10.2196/51926.

An Improved Microaneurysm Detection Model Based on SwinIR and YOLOv8.

Bioengineering (Basel). 2023 Dec 8;10(12):1405. doi: 10.3390/bioengineering10121405.

Evaluating the performance of large language models in haematopoietic stem cell transplantation decision-making.

Br J Haematol. 2024 Apr;204(4):1523-1528. doi: 10.1111/bjh.19200. Epub 2023 Dec 9.

Large language models and their impact in ophthalmology.

Lancet Digit Health. 2023 Dec;5(12):e917-e924. doi: 10.1016/S2589-7500(23)00201-7.

Digital technology in medical visits: a critical review of its impact on doctor-patient communication.

Front Psychiatry. 2023 Jul 27;14:1226225. doi: 10.3389/fpsyt.2023.1226225. eCollection 2023.

Large language models encode clinical knowledge.

Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.

Effect of Human-AI Interaction on Detection of Malignant Lung Nodules on Chest Radiographs.

Radiology. 2023 Jun;307(5):e222976. doi: 10.1148/radiol.222976.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

FFA-GPT：一种用于眼底荧光血管造影解释和问答的自动化流程。

FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译