• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于医疗保健数据共享的眼底荧光血管造影视频生成

Generation of Fundus Fluorescein Angiography Videos for Health Care Data Sharing.

作者信息

Wu Xinyuan, Wang Lili, Chen Ruoyu, Liu Bowen, Zhang Weiyi, Yang Xi, Feng Yifan, He Mingguang, Shi Danli

机构信息

School of Optometry, Hong Kong Polytechnic University, Hong Kong SAR, China.

Department of Computing, Hong Kong Polytechnic University, Hong Kong SAR, China.

出版信息

JAMA Ophthalmol. 2025 Jun 26. doi: 10.1001/jamaophthalmol.2025.1419.

DOI:10.1001/jamaophthalmol.2025.1419
PMID:40569610
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12203415/
Abstract

IMPORTANCE

Medical data sharing faces strict restrictions. Text-to-video generation shows potential for creating realistic medical data while preserving privacy, offering a solution for cross-center data sharing and medical education.

OBJECTIVE

To develop and evaluate a text-to-video generative artificial intelligence (AI)-driven model that converts the text of reports into dynamic fundus fluorescein angiography (FFA) videos, enabling visualization of retinal vascular and structural abnormalities.

DESIGN, SETTING, AND PARTICIPANTS: This study retrospectively collected anonymized FFA data from a tertiary hospital in China. The dataset included both the medical records and FFA examinations of patients assessed between November 2016 and December 2019. A text-to-video model was developed and evaluated. The AI-driven model integrated the wavelet-flow variational autoencoder and the diffusion transformer.

MAIN OUTCOMES AND MEASURES

The AI-driven model's performance was assessed through objective metrics (Fréchet video distance, learned perceptual image patch similarity score, and visual question answering score [VQAScore]). The domain-specific evaluation for the generated FFA videos was measured by the bidirectional encoder representations from transformers score (BERTScore). Image retrieval was evaluated using a Recall@K score. Each video was rated for quality by 3 ophthalmologists on a scale of 1 (excellent) to 5 (very poor).

RESULTS

A total of 3625 FFA videos were included (2851 videos [78.6%] for training, 387 videos [10.7%] for validation, and 387 videos [10.7%] for testing). The AI-generated FFA videos demonstrated retinal abnormalities from the input text (Fréchet video distance of 2273, a mean learned perceptual image patch similarity score of 0.48 [SD, 0.04], and a mean VQAScore of 0.61 [SD, 0.08]). The domain-specific evaluations showed alignment between the generated videos and textual prompts (mean BERTScore, 0.35 [SD, 0.09]). The Recall@K scores were 0.02 for K = 5, 0.04 for K = 10, and 0.16 for K = 50, yielding a mean score of 0.073, reflecting disparities between AI-generated and real clinical videos and demonstrating privacy-preserving effectiveness. For assessment of visual quality of the FFA videos by the 3 ophthalmologists, the mean score was 1.57 (SD, 0.44).

CONCLUSIONS AND RELEVANCE

This study demonstrated that an AI-driven text-to-video model generated FFA videos from textual descriptions, potentially improving visualization for clinical and educational purposes. The privacy-preserving nature of the model may address key challenges in data sharing while trying to ensure compliance with confidentiality standards.

摘要

重要性

医学数据共享面临严格限制。文本到视频生成在保护隐私的同时显示出创建逼真医学数据的潜力,为跨中心数据共享和医学教育提供了一种解决方案。

目的

开发并评估一种由文本到视频生成的人工智能(AI)驱动模型,该模型将报告文本转换为动态眼底荧光血管造影(FFA)视频,从而实现视网膜血管和结构异常的可视化。

设计、设置和参与者:本研究回顾性收集了中国一家三级医院的匿名FFA数据。数据集包括2016年11月至2019年12月期间接受评估的患者的病历和FFA检查。开发并评估了一个文本到视频模型。该AI驱动模型集成了小波流变分自编码器和扩散变换器。

主要结果和测量指标

通过客观指标(弗雷歇视频距离、学习感知图像块相似性分数和视觉问答分数[VQAScore])评估AI驱动模型的性能。通过变换器分数(BERTScore)的双向编码器表示来测量生成的FFA视频的特定领域评估。使用召回率@K分数评估图像检索。3名眼科医生对每个视频的质量进行评分,范围为1(优秀)至5(非常差)。

结果

共纳入3625个FFA视频(2851个视频[78.6%]用于训练,387个视频[10.7%]用于验证,387个视频[10.7%]用于测试)。AI生成的FFA视频从输入文本中展示了视网膜异常(弗雷歇视频距离为2273,平均学习感知图像块相似性分数为0.48[标准差,0.04],平均VQAScore为0.61[标准差,0.08])。特定领域评估显示生成的视频与文本提示之间具有一致性(平均BERTScore,0.35[标准差,0.09])。召回率@K分数在K = 5时为0.02,K = 10时为0.04,K = 50时为0.16,平均分数为0.073,反映了AI生成的视频与真实临床视频之间的差异,并证明了隐私保护的有效性。对于3名眼科医生对FFA视频视觉质量的评估,平均分数为1.57(标准差,0.44)。

结论及相关性

本研究表明,一个由AI驱动的文本到视频模型可从文本描述中生成FFA视频,可能改善临床和教育目的的可视化效果。该模型的隐私保护特性可能在确保符合保密标准的同时解决数据共享中的关键挑战。

相似文献

1
Generation of Fundus Fluorescein Angiography Videos for Health Care Data Sharing.用于医疗保健数据共享的眼底荧光血管造影视频生成
JAMA Ophthalmol. 2025 Jun 26. doi: 10.1001/jamaophthalmol.2025.1419.
2
Quality and Dissemination of Uterine Fibroid Health Information on TikTok and Bilibili: Cross-Sectional Study.抖音和哔哩哔哩上子宫肌瘤健康信息的质量与传播:横断面研究
JMIR Form Res. 2025 Aug 1;9:e75120. doi: 10.2196/75120.
3
Analyzing Surgical Technique in Diverse Open Surgical Videos With Multitask Machine Learning.利用多任务机器学习分析多种开放性手术视频中的手术技术。
JAMA Surg. 2024 Feb 1;159(2):185-192. doi: 10.1001/jamasurg.2023.6262.
4
Using Natural Language Processing to Explore Patient Perspectives on AI Avatars in Support Materials for Patients With Breast Cancer: Survey Study.使用自然语言处理技术探索乳腺癌患者在支持材料中对人工智能化身的看法:调查研究
J Med Internet Res. 2025 Jun 20;27:e70971. doi: 10.2196/70971.
5
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
6
AI in Medical Questionnaires: Innovations, Diagnosis, and Implications.医学问卷中的人工智能:创新、诊断及影响
J Med Internet Res. 2025 Jun 23;27:e72398. doi: 10.2196/72398.
7
Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.利用基于计算机视觉的自动严重程度估计提高帕金森病运动评估的可靠性。
J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.
8
Developing an ICD-10 Coding Assistant: Pilot Study Using RoBERTa and GPT-4 for Term Extraction and Description-Based Code Selection.开发国际疾病分类第十版(ICD - 10)编码助手:使用RoBERTa和GPT - 4进行术语提取和基于描述的代码选择的试点研究
JMIR Form Res. 2025 Feb 11;9:e60095. doi: 10.2196/60095.
9
AI-Enabled Screening for Retinopathy of Prematurity in Low-Resource Settings.在资源匮乏地区利用人工智能进行早产儿视网膜病变筛查。
JAMA Netw Open. 2025 Apr 1;8(4):e257831. doi: 10.1001/jamanetworkopen.2025.7831.
10
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险

本文引用的文献

1
Randomness-Restricted Diffusion Model for Ocular Surface Structure Segmentation.用于眼表结构分割的随机受限扩散模型
IEEE Trans Med Imaging. 2025 Mar;44(3):1359-1372. doi: 10.1109/TMI.2024.3494762. Epub 2025 Mar 17.
2
Privacy preserving technology in ophthalmology.眼科中的隐私保护技术。
Curr Opin Ophthalmol. 2024 Nov 1;35(6):431-437. doi: 10.1097/ICU.0000000000001087. Epub 2024 Aug 26.
3
A vision-language foundation model for the generation of realistic chest X-ray images.一种用于生成逼真胸部X光图像的视觉语言基础模型。
Nat Biomed Eng. 2025 Apr;9(4):494-506. doi: 10.1038/s41551-024-01246-y. Epub 2024 Aug 26.
4
FFA-GPT: an automated pipeline for fundus fluorescein angiography interpretation and question-answer.FFA-GPT:一种用于眼底荧光血管造影解释和问答的自动化流程。
NPJ Digit Med. 2024 May 3;7(1):111. doi: 10.1038/s41746-024-01101-z.
5
Conditional Diffusion Models for Semantic 3D Brain MRI Synthesis.用于语义3D脑磁共振成像合成的条件扩散模型
IEEE J Biomed Health Inform. 2024 Jul;28(7):4084-4093. doi: 10.1109/JBHI.2024.3385504. Epub 2024 Jul 2.
6
Using artificial intelligence to improve human performance: efficient retinal disease detection training with synthetic images.利用人工智能提高人类表现:使用合成图像进行高效的视网膜疾病检测训练。
Br J Ophthalmol. 2024 Sep 20;108(10):1430-1435. doi: 10.1136/bjo-2023-324923.
7
How OpenAI's text-to-video tool Sora could change science - and society.OpenAI的文本转视频工具Sora如何改变科学及社会。
Nature. 2024 Mar;627(8004):475-476. doi: 10.1038/d41586-024-00661-0.
8
Privacy-preserving federated machine learning on FAIR health data: A real-world application.公平健康数据上的隐私保护联邦机器学习:一个实际应用
Comput Struct Biotechnol J. 2024 Feb 17;24:136-145. doi: 10.1016/j.csbj.2024.02.014. eCollection 2024 Dec.
9
Translating color fundus photography to indocyanine green angiography using deep-learning for age-related macular degeneration screening.利用深度学习将彩色眼底摄影转换为吲哚菁绿血管造影以进行年龄相关性黄斑变性筛查。
NPJ Digit Med. 2024 Feb 12;7(1):34. doi: 10.1038/s41746-024-01018-7.
10
Bridging the Camera Domain Gap With Image-to-Image Translation Improves Glaucoma Diagnosis.通过图像到图像的翻译弥合相机域差距可提高青光眼诊断。
Transl Vis Sci Technol. 2023 Dec 1;12(12):20. doi: 10.1167/tvst.12.12.20.