ChatFFA：一种用于眼底荧光血管造影的统一视觉语言理解和问答的眼科聊天系统。

ChatFFA: An ophthalmic chat system for unified vision-language understanding and question answering for fundus fluorescein angiography.

作者信息

Chen Xiaolan, Xu Pusheng, Li Yao, Zhang Weiyi, Song Fan, He Mingguang, Shi Danli

机构信息

School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong.

State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.

出版信息

iScience. 2024 May 17;27(7):110021. doi: 10.1016/j.isci.2024.110021. eCollection 2024 Jul 19.

DOI:10.1016/j.isci.2024.110021

PMID:39055931

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11269310/

Abstract

Existing automatic analysis of fundus fluorescein angiography (FFA) images faces limitations, including a predetermined set of possible image classifications and being confined to text-based question-answering (QA) approaches. This study aims to address these limitations by developing an end-to-end unified model that utilizes synthetic data to train a visual question-answering model for FFA images. To achieve this, we employed ChatGPT to generate 4,110,581 QA pairs for a large FFA dataset, which encompassed a total of 654,343 FFA images from 9,392 participants. We then fine-tuned the Bootstrapping Language-Image Pre-training (BLIP) framework to enable simultaneous handling of vision and language. The performance of the fine-tuned model (ChatFFA) was thoroughly evaluated through automated and manual assessments, as well as case studies based on an external validation set, demonstrating satisfactory results. In conclusion, our ChatFFA system paves the way for improved efficiency and feasibility in medical imaging analysis by leveraging generative large language models.

摘要

现有的眼底荧光血管造影（FFA）图像自动分析面临局限性，包括预定的一组可能的图像分类，并且局限于基于文本的问答（QA）方法。本研究旨在通过开发一种端到端统一模型来解决这些局限性，该模型利用合成数据来训练用于FFA图像的视觉问答模型。为实现这一目标，我们使用ChatGPT为一个大型FFA数据集生成了4,110,581个问答对，该数据集总共包含来自9392名参与者的654,343张FFA图像。然后，我们对自训练语言-图像预训练（BLIP）框架进行了微调，以实现对视觉和语言的同时处理。通过自动和人工评估以及基于外部验证集的案例研究，对微调后的模型（ChatFFA）的性能进行了全面评估，结果令人满意。总之，我们的ChatFFA系统通过利用生成式大语言模型，为提高医学成像分析的效率和可行性铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdb1/11269310/8feff3269732/fx1.jpg

相似文献

ChatFFA: An ophthalmic chat system for unified vision-language understanding and question answering for fundus fluorescein angiography.ChatFFA：一种用于眼底荧光血管造影的统一视觉语言理解和问答的眼科聊天系统。

iScience. 2024 May 17;27(7):110021. doi: 10.1016/j.isci.2024.110021. eCollection 2024 Jul 19.

FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.FFA-GPT：一种用于眼底荧光血管造影解释和问答的自动化流程。

NPJ Digit Med. 2024 May 3;7(1):111. doi: 10.1038/s41746-024-01101-z.

Performance of Generative Large Language Models on Ophthalmology Board-Style Questions.生成式大型语言模型在眼科 Board 式问题中的表现。

Am J Ophthalmol. 2023 Oct;254:141-149. doi: 10.1016/j.ajo.2023.05.024. Epub 2023 Jun 18.

How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试（USMLE）中的表现如何？大语言模型对医学教育和知识评估的影响。

JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.

EYE-Llama, an in-domain large language model for ophthalmology.EYE-Llama，一种用于眼科领域的大语言模型。

bioRxiv. 2024 Apr 29:2024.04.26.591355. doi: 10.1101/2024.04.26.591355.

Developing ChatGPT for biology and medicine: a complete review of biomedical question answering.为生物学和医学开发ChatGPT：生物医学问答的全面综述

Biophys Rep. 2024 Jun 30;10(3):152-171. doi: 10.52601/bpr.2024.240004.

Answering medical questions in Chinese using automatically mined knowledge and deep neural networks: an end-to-end solution.利用自动挖掘的知识和深度神经网络用中文回答医学问题：一种端到端的解决方案。

BMC Bioinformatics. 2022 Apr 15;23(1):136. doi: 10.1186/s12859-022-04658-2.

Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。

Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.

Translation of Color Fundus Photography into Fluorescein Angiography Using Deep Learning for Enhanced Diabetic Retinopathy Screening.利用深度学习将彩色眼底照片转换为荧光血管造影以增强糖尿病视网膜病变筛查

Ophthalmol Sci. 2023 Sep 15;3(4):100401. doi: 10.1016/j.xops.2023.100401. eCollection 2023 Dec.

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study.GPT-4V 在回答日本耳鼻喉科学委员会认证考试问题方面的表现：评估研究。

JMIR Med Educ. 2024 Mar 28;10:e57054. doi: 10.2196/57054.

引用本文的文献

Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators.眼科领域的大语言模型：关于其对临床医生、研究人员、患者和教育工作者的效用的范围综述

Eye (Lond). 2025 Aug 25. doi: 10.1038/s41433-025-03935-7.

DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning.在双语复杂眼科推理方面，DeepSeek-R1的表现优于Gemini 2.0 Pro、OpenAI的o1和o3-mini。

Adv Ophthalmol Pract Res. 2025 May 9;5(3):189-195. doi: 10.1016/j.aopr.2025.05.001. eCollection 2025 Aug-Sep.

Embodied artificial intelligence in ophthalmology.眼科中的具身人工智能。

NPJ Digit Med. 2025 Jun 11;8(1):351. doi: 10.1038/s41746-025-01754-4.

A Systematic Review of Advances in AI-Assisted Analysis of Fundus Fluorescein Angiography (FFA) Images: From Detection to Report Generation.眼底荧光血管造影（FFA）图像人工智能辅助分析进展的系统评价：从检测到报告生成

Ophthalmol Ther. 2025 Apr;14(4):599-619. doi: 10.1007/s40123-025-01109-y. Epub 2025 Feb 21.

Slit Lamp Report Generation and Question Answering: Development and Validation of a Multimodal Transformer Model with Large Language Model Integration.裂隙灯报告生成与问答：集成大语言模型的多模态变压器模型的开发与验证

J Med Internet Res. 2024 Dec 30;26:e54047. doi: 10.2196/54047.

EyeGPT for Patient Inquiries and Medical Education: Development and Validation of an Ophthalmology Large Language Model.用于患者咨询和医学教育的EyeGPT：一种眼科大语言模型的开发与验证

J Med Internet Res. 2024 Dec 11;26:e60063. doi: 10.2196/60063.

本文引用的文献

Unveiling the clinical incapabilities: a benchmarking study of GPT-4V(ision) for ophthalmic multimodal image analysis.揭示临床能力不足：GPT-4V(ision) 眼科多模态图像分析的基准研究。

Br J Ophthalmol. 2024 Sep 20;108(10):1384-1389. doi: 10.1136/bjo-2023-325054.

FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.FFA-GPT：一种用于眼底荧光血管造影解释和问答的自动化流程。

NPJ Digit Med. 2024 May 3;7(1):111. doi: 10.1038/s41746-024-01101-z.

ICGA-GPT: report generation and question answering for indocyanine green angiography images.ICGA-GPT：用于吲哚菁绿血管造影图像的报告生成和问答。

Br J Ophthalmol. 2024 Sep 20;108(10):1450-1456. doi: 10.1136/bjo-2023-324446.

Vision-Language Models for Vision Tasks: A Survey.用于视觉任务的视觉语言模型：一项综述。

IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5625-5644. doi: 10.1109/TPAMI.2024.3369699. Epub 2024 Jul 2.

Ophthalmol Sci. 2023 Sep 15;3(4):100401. doi: 10.1016/j.xops.2023.100401. eCollection 2023 Dec.

Large language models and their impact in ophthalmology.大语言模型及其在眼科学中的影响。

Lancet Digit Health. 2023 Dec;5(12):e917-e924. doi: 10.1016/S2589-7500(23)00201-7.

One-shot Retinal Artery and Vein Segmentation via Cross-modality Pretraining.通过跨模态预训练实现一次性视网膜动静脉分割

Ophthalmol Sci. 2023 Jul 6;4(2):100363. doi: 10.1016/j.xops.2023.100363. eCollection 2024 Mar-Apr.

Medical visual question answering: A survey.医学视觉问答：综述。

Artif Intell Med. 2023 Sep;143:102611. doi: 10.1016/j.artmed.2023.102611. Epub 2023 Jun 8.

Comparison of Ophthalmologist and Large Language Model Chatbot Responses to Online Patient Eye Care Questions.眼科医生与大型语言模型聊天机器人对在线患者眼部护理问题的回复比较。

JAMA Netw Open. 2023 Aug 1;6(8):e2330320. doi: 10.1001/jamanetworkopen.2023.30320.

Utility of ChatGPT in Clinical Practice.ChatGPT 在临床实践中的应用。

J Med Internet Res. 2023 Jun 28;25:e48568. doi: 10.2196/48568.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ChatFFA：一种用于眼底荧光血管造影的统一视觉语言理解和问答的眼科聊天系统。

ChatFFA: An ophthalmic chat system for unified vision-language understanding and question answering for fundus fluorescein angiography.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献