利用预训练视觉变换器实现光学相干断层扫描图像中的癌症自动诊断。

Leveraging pretrained vision transformers for automated cancer diagnosis in optical coherence tomography images.

作者信息

Ray Soumyajit, Lee Cheng-Yu, Park Hyeon-Cheol, Nauen David W, Bettegowda Chetan, Li Xingde, Chellappa Rama

机构信息

Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA.

出版信息

Biomed Opt Express. 2025 Jul 21;16(8):3283-3294. doi: 10.1364/BOE.563694. eCollection 2025 Aug 1.

DOI:10.1364/BOE.563694

PMID:40809960

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12339304/

Abstract

This study presents an approach to brain cancer detection based on optical coherence tomography (OCT) images and advanced machine learning techniques. The research addresses the critical need for accurate, real-time differentiation between cancerous and noncancerous brain tissue during neurosurgical procedures. The proposed method combines a pre-trained large vision transformer (ViT) model, specifically DINOv2, with a convolutional neural network (CNN) operating on the grey level co-occurrence matrix (GLCM) texture features. This dual-path architecture leverages both the global contextual feature extraction capabilities of transformers and the local texture analysis strengths of GLCM + CNNs. To mitigate patient-specific bias from the limited cohort, we incorporate an adversarial discriminator network that attempts to identify individual patients from feature representations, creating a competing objective that forces the model to learn generalizable cancer-indicative features rather than patient-specific characteristics. We also explore an alternative state space model approach using MambaVision blocks, which achieves comparable performance. The dataset comprised OCT images from 11 patients, with 5,831 B-frame slices from 7 patients used for training and validation, and 1,610 slices from 4 patients used for testing. The model achieved high accuracy in distinguishing cancerous from noncancerous tissue, with over 99% accuracy on the training dataset, 98.8% on the validation dataset and 98.6% accuracy on the test dataset. This approach demonstrates significant potential for achieving and improving intraoperative decision-making in brain cancer surgeries, offering real-time, high-accuracy tissue classification and surgical guidance.

摘要

本研究提出了一种基于光学相干断层扫描（OCT）图像和先进机器学习技术的脑癌检测方法。该研究满足了神经外科手术期间对癌性和非癌性脑组织进行准确、实时区分的迫切需求。所提出的方法将预训练的大型视觉Transformer（ViT）模型，特别是DINOv2，与基于灰度共生矩阵（GLCM）纹理特征运行的卷积神经网络（CNN）相结合。这种双路径架构利用了Transformer的全局上下文特征提取能力和GLCM + CNN的局部纹理分析优势。为了减轻有限队列中患者特异性偏差的影响，我们引入了一个对抗性判别器网络，该网络试图从特征表示中识别个体患者，创建一个竞争目标，迫使模型学习可推广的癌症指示特征而不是患者特异性特征。我们还探索了一种使用MambaVision块的替代状态空间模型方法，该方法取得了可比的性能。数据集包括11名患者的OCT图像，其中来自7名患者的5831个B帧切片用于训练和验证，来自4名患者的1610个切片用于测试。该模型在区分癌性和非癌性组织方面取得了高精度，在训练数据集上的准确率超过99%，在验证数据集上为98.8%，在测试数据集上为98.6%。这种方法在实现和改善脑癌手术中的术中决策方面显示出巨大潜力，提供实时、高精度的组织分类和手术指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c7a/12339304/000b25fe9cb1/boe-16-8-3283-g001.jpg

相似文献

Leveraging pretrained vision transformers for automated cancer diagnosis in optical coherence tomography images.

Biomed Opt Express. 2025 Jul 21;16(8):3283-3294. doi: 10.1364/BOE.563694. eCollection 2025 Aug 1.

Prescription of Controlled Substances: Benefits and Risks

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and convolutional neural networks.

J Med Imaging (Bellingham). 2025 Nov;12(Suppl 2):S22007. doi: 10.1117/1.JMI.12.S2.S22007. Epub 2025 May 14.

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Novel Deep Learning Model for Glaucoma Detection Using Fusion of Fundus and Optical Coherence Tomography Images.

Sensors (Basel). 2025 Jul 11;25(14):4337. doi: 10.3390/s25144337.

Enhanced Maize Leaf Disease Detection and Classification Using an Integrated CNN-ViT Model.

Food Sci Nutr. 2025 Jun 30;13(7):e70513. doi: 10.1002/fsn3.70513. eCollection 2025 Jul.

Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.

JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.

本文引用的文献

Deep learning-based optical coherence tomography image analysis of human brain cancer.

Biomed Opt Express. 2022 Dec 7;14(1):81-88. doi: 10.1364/BOE.477311. eCollection 2023 Jan 1.

Minimizing OCT quantification error via a surface-tracking imaging probe.

Biomed Opt Express. 2021 Jun 10;12(7):3992-4002. doi: 10.1364/BOE.423233. eCollection 2021 Jul 1.

Robust, accurate depth-resolved attenuation characterization in optical coherence tomography.

Biomed Opt Express. 2020 Jan 9;11(2):672-687. doi: 10.1364/BOE.382493. eCollection 2020 Feb 1.

AI-Assisted Detection of Human Glioma Infiltration Using a Novel Computational Method for Optical Coherence Tomography.

Clin Cancer Res. 2019 Nov 1;25(21):6329-6338. doi: 10.1158/1078-0432.CCR-19-0854. Epub 2019 Jul 17.

Pilot feasibility study of in vivo intraoperative quantitative optical coherence tomography of human brain tissue during glioma resection.

J Biophotonics. 2019 Oct;12(10):e201900037. doi: 10.1002/jbio.201900037. Epub 2019 Jul 15.

Optimized depth-resolved estimation to measure optical attenuation coefficients from optical coherence tomography and its application in cerebral damage determination.

J Biomed Opt. 2019 Mar;24(3):1-11. doi: 10.1117/1.JBO.24.3.035002.

On the Effectiveness of Least Squares Generative Adversarial Networks.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2947-2960. doi: 10.1109/TPAMI.2018.2872043. Epub 2018 Sep 24.

Robust and fast characterization of OCT-based optical attenuation using a novel frequency-domain algorithm for brain cancer detection.

Sci Rep. 2017 Mar 22;7:44909. doi: 10.1038/srep44909.

Detection of human brain cancer infiltration ex vivo and in vivo using quantitative optical coherence tomography.

Sci Transl Med. 2015 Jun 17;7(292):292ra100. doi: 10.1126/scitranslmed.3010611.

Intraoperative brain cancer detection with Raman spectroscopy in humans.

Sci Transl Med. 2015 Feb 11;7(274):274ra19. doi: 10.1126/scitranslmed.aaa2384.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用预训练视觉变换器实现光学相干断层扫描图像中的癌症自动诊断。

Leveraging pretrained vision transformers for automated cancer diagnosis in optical coherence tomography images.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献