Octascope：一种用于光学相干断层扫描的轻量级预训练模型。

Octascope: A Lightweight Pre-Trained Model for Optical Coherence Tomography.

作者信息

Cui Haoyang, Wang Chen, Calle Paul, Liu Yunlong, Zhang Qinghao, Ly Sinaro, Reynolds Justin, Yan Feng, Zhang K E, Liu Ronghao, Liu Junyuan, Fung Kar-Ming, Yu Zhongxin, Jain Ajay, Tang Qinggong, Pan Chongle

机构信息

School of Computer Science, Gallogly College of Engineering, The University of Oklahoma, Norman, OK 73019, USA.

Stephenson School of Biomedical Engineering, The University of Oklahoma, Norman, OK 73019, USA.

出版信息

IEEE Access. 2025;13:138005-138019. doi: 10.1109/access.2025.3595838. Epub 2025 Aug 5.

DOI:10.1109/access.2025.3595838

PMID:40874077

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12378998/

Abstract

Optical coherence tomography (OCT) imaging enables high resolution visualization of sub-surface tissue microstructures. However, OCT image analysis using deep learning is hampered by limited diverse training data to meet performance requirements and high inference latency for real-time applications. To address these challenges, we developed Octascope, a lightweight domain-specific convolutional neural network (CNN) - based model designed for OCT image analysis. Octascope was pre-trained using a curriculum learning approach, which involves sequential training, first on natural images (ImageNet), then on OCT images from retinal, abdominal, and renal tissues, to progressively acquire transferable knowledge. This multi-domain pre-training enables Octascope to generalize across varied tissue types. In two downstream tasks, Octascope demonstrated notable improvements in predictive accuracy compared to alternative approaches. In the epidural tissue detection task, our method surpassed single-task learning with fine-tuning by 9.13% and OCT-specific transfer learning by 5.95% in accuracy. Octascope outperformed VGG16 and ResNet50 by 5.36% and 6.66% in a retinal diagnosis task, respectively. In comparison to a Transformer-based OCT foundation model - RETFound, Octascope delivered 2 to 4.4 times faster inference speed with slightly better predictive accuracies in both downstream tasks. Octascope represented a significant advancement for OCT image analysis by providing an effective balance between computational efficiency and diagnostic accuracy for real-time clinical applications.

摘要

光学相干断层扫描（OCT）成像能够对皮下组织微观结构进行高分辨率可视化。然而，使用深度学习进行OCT图像分析受到限制，因为满足性能要求的多样化训练数据有限，且实时应用的推理延迟较高。为应对这些挑战，我们开发了Octascope，这是一种基于轻量级特定领域卷积神经网络（CNN）的模型，专为OCT图像分析而设计。Octascope使用课程学习方法进行预训练，该方法包括顺序训练，首先在自然图像（ImageNet）上训练，然后在来自视网膜、腹部和肾脏组织的OCT图像上训练，以逐步获取可转移的知识。这种多领域预训练使Octascope能够在不同组织类型中进行泛化。在两项下游任务中，与其他方法相比，Octascope在预测准确性方面有显著提高。在硬膜外组织检测任务中，我们的方法在准确率上比微调的单任务学习高出9.13%，比特定于OCT的迁移学习高出5.95%。在视网膜诊断任务中，Octascope分别比VGG16和ResNet50的表现高出5.36%和6.66%。与基于Transformer的OCT基础模型RETFound相比，Octascope在两项下游任务中的推理速度快2至4.4倍，预测准确性略高。Octascope通过在计算效率和实时临床应用的诊断准确性之间实现有效平衡，代表了OCT图像分析的重大进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3710/12378998/e43dbf8e72e0/nihms-2104333-f0017.jpg

相似文献

Octascope: A Lightweight Pre-Trained Model for Optical Coherence Tomography.Octascope：一种用于光学相干断层扫描的轻量级预训练模型。

IEEE Access. 2025;13:138005-138019. doi: 10.1109/access.2025.3595838. Epub 2025 Aug 5.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.光学相干断层扫描（OCT）用于检测糖尿病视网膜病变患者的黄斑水肿。

Cochrane Database Syst Rev. 2015 Jan 7;1(1):CD008081. doi: 10.1002/14651858.CD008081.pub3.

OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection.OCT-SelfNet：一个用于广义视网膜疾病检测的具有多源数据集的自监督框架。

Front Big Data. 2025 Jul 29;8:1609124. doi: 10.3389/fdata.2025.1609124. eCollection 2025.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.光学相干断层扫描（OCT）用于检测糖尿病视网膜病变患者的黄斑水肿。

Cochrane Database Syst Rev. 2011 Jul 6(7):CD008081. doi: 10.1002/14651858.CD008081.pub2.

Self-Supervised Learning for Improved Optical Coherence Tomography Detection of Macular Telangiectasia Type 2.基于自监督学习的黄斑毛细血管扩张症 2 型光学相干断层扫描检测方法的研究

JAMA Ophthalmol. 2024 Mar 1;142(3):226-233. doi: 10.1001/jamaophthalmol.2023.6454.

Deep Learning for the Early Detection of Invasive Ductal Carcinoma in Histopathological Images: Convolutional Neural Network Approach With Transfer Learning.基于深度学习的组织病理学图像中浸润性导管癌早期检测：采用迁移学习的卷积神经网络方法

JMIR Form Res. 2025 Aug 21;9:e62996. doi: 10.2196/62996.

本文引用的文献

Foundation Models Defining a New Era in Vision: A Survey and Outlook.基础模型：定义视觉领域的新时代——一项综述与展望

IEEE Trans Pattern Anal Mach Intell. 2025 Apr;47(4):2245-2264. doi: 10.1109/TPAMI.2024.3506283. Epub 2025 Mar 6.

General lightweight framework for vision foundation model supporting multi-task and multi-center medical image analysis.支持多任务和多中心医学图像分析的视觉基础模型通用轻量级框架。

Nat Commun. 2025 Mar 1;16(1):2097. doi: 10.1038/s41467-025-57427-z.

A foundation model for enhancing magnetic resonance images and downstream segmentation, registration and diagnostic tasks.一种用于增强磁共振图像以及下游分割、配准和诊断任务的基础模型。

Nat Biomed Eng. 2025 Apr;9(4):521-538. doi: 10.1038/s41551-024-01283-7. Epub 2024 Dec 5.

A vision-language foundation model for the generation of realistic chest X-ray images.一种用于生成逼真胸部X光图像的视觉语言基础模型。

Nat Biomed Eng. 2025 Apr;9(4):494-506. doi: 10.1038/s41551-024-01246-y. Epub 2024 Aug 26.

Automatic renal carcinoma biopsy guidance using forward-viewing endoscopic optical coherence tomography and deep learning.使用前视内镜光学相干断层扫描和深度学习的自动肾癌活检引导

Commun Eng. 2024 Aug 2;3(1):107. doi: 10.1038/s44172-024-00254-9.

USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis.USFM：一种通用的超声基础模型，可推广到任务和器官，实现高效的标签图像分析。

Med Image Anal. 2024 Aug;96:103202. doi: 10.1016/j.media.2024.103202. Epub 2024 May 15.

OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods.OCTDL：基于图像的深度学习方法的光学相干层析成像数据集。

Sci Data. 2024 Apr 11;11(1):365. doi: 10.1038/s41597-024-03182-7.

A foundation model for generalizable disease detection from retinal images.基于视网膜图像的通用疾病检测的基础模型。

Nature. 2023 Oct;622(7981):156-163. doi: 10.1038/s41586-023-06555-x. Epub 2023 Sep 13.

Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis.可解释的多任务学习通过共享遗传基础，提高了对许多疾病的多基因风险评分的并行估计。

PLoS Comput Biol. 2023 Jul 7;19(7):e1011211. doi: 10.1371/journal.pcbi.1011211. eCollection 2023 Jul.

Epidural anesthesia needle guidance by forward-view endoscopic optical coherence tomography and deep learning.经皮内镜光学相干断层扫描和深度学习引导的硬膜外麻醉针。

Sci Rep. 2022 May 31;12(1):9057. doi: 10.1038/s41598-022-12950-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Octascope：一种用于光学相干断层扫描的轻量级预训练模型。

Octascope: A Lightweight Pre-Trained Model for Optical Coherence Tomography.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献