Cui Haoyang, Wang Chen, Calle Paul, Liu Yunlong, Zhang Qinghao, Ly Sinaro, Reynolds Justin, Yan Feng, Zhang K E, Liu Ronghao, Liu Junyuan, Fung Kar-Ming, Yu Zhongxin, Jain Ajay, Tang Qinggong, Pan Chongle
School of Computer Science, Gallogly College of Engineering, The University of Oklahoma, Norman, OK 73019, USA.
Stephenson School of Biomedical Engineering, The University of Oklahoma, Norman, OK 73019, USA.
IEEE Access. 2025;13:138005-138019. doi: 10.1109/access.2025.3595838. Epub 2025 Aug 5.
Optical coherence tomography (OCT) imaging enables high resolution visualization of sub-surface tissue microstructures. However, OCT image analysis using deep learning is hampered by limited diverse training data to meet performance requirements and high inference latency for real-time applications. To address these challenges, we developed Octascope, a lightweight domain-specific convolutional neural network (CNN) - based model designed for OCT image analysis. Octascope was pre-trained using a curriculum learning approach, which involves sequential training, first on natural images (ImageNet), then on OCT images from retinal, abdominal, and renal tissues, to progressively acquire transferable knowledge. This multi-domain pre-training enables Octascope to generalize across varied tissue types. In two downstream tasks, Octascope demonstrated notable improvements in predictive accuracy compared to alternative approaches. In the epidural tissue detection task, our method surpassed single-task learning with fine-tuning by 9.13% and OCT-specific transfer learning by 5.95% in accuracy. Octascope outperformed VGG16 and ResNet50 by 5.36% and 6.66% in a retinal diagnosis task, respectively. In comparison to a Transformer-based OCT foundation model - RETFound, Octascope delivered 2 to 4.4 times faster inference speed with slightly better predictive accuracies in both downstream tasks. Octascope represented a significant advancement for OCT image analysis by providing an effective balance between computational efficiency and diagnostic accuracy for real-time clinical applications.
光学相干断层扫描(OCT)成像能够对皮下组织微观结构进行高分辨率可视化。然而,使用深度学习进行OCT图像分析受到限制,因为满足性能要求的多样化训练数据有限,且实时应用的推理延迟较高。为应对这些挑战,我们开发了Octascope,这是一种基于轻量级特定领域卷积神经网络(CNN)的模型,专为OCT图像分析而设计。Octascope使用课程学习方法进行预训练,该方法包括顺序训练,首先在自然图像(ImageNet)上训练,然后在来自视网膜、腹部和肾脏组织的OCT图像上训练,以逐步获取可转移的知识。这种多领域预训练使Octascope能够在不同组织类型中进行泛化。在两项下游任务中,与其他方法相比,Octascope在预测准确性方面有显著提高。在硬膜外组织检测任务中,我们的方法在准确率上比微调的单任务学习高出9.13%,比特定于OCT的迁移学习高出5.95%。在视网膜诊断任务中,Octascope分别比VGG16和ResNet50的表现高出5.36%和6.66%。与基于Transformer的OCT基础模型RETFound相比,Octascope在两项下游任务中的推理速度快2至4.4倍,预测准确性略高。Octascope通过在计算效率和实时临床应用的诊断准确性之间实现有效平衡,代表了OCT图像分析的重大进展。