Suppr超能文献

Octascope:一种用于光学相干断层扫描的轻量级预训练模型。

Octascope: A Lightweight Pre-Trained Model for Optical Coherence Tomography.

作者信息

Cui Haoyang, Wang Chen, Calle Paul, Liu Yunlong, Zhang Qinghao, Ly Sinaro, Reynolds Justin, Yan Feng, Zhang K E, Liu Ronghao, Liu Junyuan, Fung Kar-Ming, Yu Zhongxin, Jain Ajay, Tang Qinggong, Pan Chongle

机构信息

School of Computer Science, Gallogly College of Engineering, The University of Oklahoma, Norman, OK 73019, USA.

Stephenson School of Biomedical Engineering, The University of Oklahoma, Norman, OK 73019, USA.

出版信息

IEEE Access. 2025;13:138005-138019. doi: 10.1109/access.2025.3595838. Epub 2025 Aug 5.

Abstract

Optical coherence tomography (OCT) imaging enables high resolution visualization of sub-surface tissue microstructures. However, OCT image analysis using deep learning is hampered by limited diverse training data to meet performance requirements and high inference latency for real-time applications. To address these challenges, we developed Octascope, a lightweight domain-specific convolutional neural network (CNN) - based model designed for OCT image analysis. Octascope was pre-trained using a curriculum learning approach, which involves sequential training, first on natural images (ImageNet), then on OCT images from retinal, abdominal, and renal tissues, to progressively acquire transferable knowledge. This multi-domain pre-training enables Octascope to generalize across varied tissue types. In two downstream tasks, Octascope demonstrated notable improvements in predictive accuracy compared to alternative approaches. In the epidural tissue detection task, our method surpassed single-task learning with fine-tuning by 9.13% and OCT-specific transfer learning by 5.95% in accuracy. Octascope outperformed VGG16 and ResNet50 by 5.36% and 6.66% in a retinal diagnosis task, respectively. In comparison to a Transformer-based OCT foundation model - RETFound, Octascope delivered 2 to 4.4 times faster inference speed with slightly better predictive accuracies in both downstream tasks. Octascope represented a significant advancement for OCT image analysis by providing an effective balance between computational efficiency and diagnostic accuracy for real-time clinical applications.

摘要

光学相干断层扫描(OCT)成像能够对皮下组织微观结构进行高分辨率可视化。然而,使用深度学习进行OCT图像分析受到限制,因为满足性能要求的多样化训练数据有限,且实时应用的推理延迟较高。为应对这些挑战,我们开发了Octascope,这是一种基于轻量级特定领域卷积神经网络(CNN)的模型,专为OCT图像分析而设计。Octascope使用课程学习方法进行预训练,该方法包括顺序训练,首先在自然图像(ImageNet)上训练,然后在来自视网膜、腹部和肾脏组织的OCT图像上训练,以逐步获取可转移的知识。这种多领域预训练使Octascope能够在不同组织类型中进行泛化。在两项下游任务中,与其他方法相比,Octascope在预测准确性方面有显著提高。在硬膜外组织检测任务中,我们的方法在准确率上比微调的单任务学习高出9.13%,比特定于OCT的迁移学习高出5.95%。在视网膜诊断任务中,Octascope分别比VGG16和ResNet50的表现高出5.36%和6.66%。与基于Transformer的OCT基础模型RETFound相比,Octascope在两项下游任务中的推理速度快2至4.4倍,预测准确性略高。Octascope通过在计算效率和实时临床应用的诊断准确性之间实现有效平衡,代表了OCT图像分析的重大进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3710/12378998/e43dbf8e72e0/nihms-2104333-f0017.jpg

相似文献

1
Octascope: A Lightweight Pre-Trained Model for Optical Coherence Tomography.
IEEE Access. 2025;13:138005-138019. doi: 10.1109/access.2025.3595838. Epub 2025 Aug 5.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.
Cochrane Database Syst Rev. 2015 Jan 7;1(1):CD008081. doi: 10.1002/14651858.CD008081.pub3.
6
OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection.
Front Big Data. 2025 Jul 29;8:1609124. doi: 10.3389/fdata.2025.1609124. eCollection 2025.
8
Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.
Cochrane Database Syst Rev. 2011 Jul 6(7):CD008081. doi: 10.1002/14651858.CD008081.pub2.
9
Self-Supervised Learning for Improved Optical Coherence Tomography Detection of Macular Telangiectasia Type 2.
JAMA Ophthalmol. 2024 Mar 1;142(3):226-233. doi: 10.1001/jamaophthalmol.2023.6454.

本文引用的文献

1
Foundation Models Defining a New Era in Vision: A Survey and Outlook.
IEEE Trans Pattern Anal Mach Intell. 2025 Apr;47(4):2245-2264. doi: 10.1109/TPAMI.2024.3506283. Epub 2025 Mar 6.
3
A foundation model for enhancing magnetic resonance images and downstream segmentation, registration and diagnostic tasks.
Nat Biomed Eng. 2025 Apr;9(4):521-538. doi: 10.1038/s41551-024-01283-7. Epub 2024 Dec 5.
4
A vision-language foundation model for the generation of realistic chest X-ray images.
Nat Biomed Eng. 2025 Apr;9(4):494-506. doi: 10.1038/s41551-024-01246-y. Epub 2024 Aug 26.
6
USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis.
Med Image Anal. 2024 Aug;96:103202. doi: 10.1016/j.media.2024.103202. Epub 2024 May 15.
7
OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods.
Sci Data. 2024 Apr 11;11(1):365. doi: 10.1038/s41597-024-03182-7.
8
A foundation model for generalizable disease detection from retinal images.
Nature. 2023 Oct;622(7981):156-163. doi: 10.1038/s41586-023-06555-x. Epub 2023 Sep 13.
9
Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis.
PLoS Comput Biol. 2023 Jul 7;19(7):e1011211. doi: 10.1371/journal.pcbi.1011211. eCollection 2023 Jul.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验