• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

知识蒸馏绕过了光学卷积神经网络的非线性。

Knowledge distillation circumvents nonlinearity for optical convolutional neural networks.

出版信息

Appl Opt. 2022 Mar 20;61(9):2173-2183. doi: 10.1364/AO.435738.

DOI:10.1364/AO.435738
PMID:35333231
Abstract

In recent years, convolutional neural networks (CNNs) have enabled ubiquitous image processing applications. As such, CNNs require fast forward propagation runtime to process high-resolution visual streams in real time. This is still a challenging task even with state-of-the-art graphics and tensor processing units. The bottleneck in computational efficiency primarily occurs in the convolutional layers. Performing convolutions in the Fourier domain is a promising way to accelerate forward propagation since it transforms convolutions into elementwise multiplications, which are considerably faster to compute for large kernels. Furthermore, such computation could be implemented using an optical 4 system with orders of magnitude faster operation. However, a major challenge in using this spectral approach, as well as in an optical implementation of CNNs, is the inclusion of a nonlinearity between each convolutional layer, without which CNN performance drops dramatically. Here, we propose a spectral CNN linear counterpart (SCLC) network architecture and its optical implementation. We propose a hybrid platform with an optical front end to perform a large number of linear operations, followed by an electronic back end. The key contribution is to develop a knowledge distillation (KD) approach to circumvent the need for nonlinear layers between the convolutional layers and successfully train such networks. While the KD approach is known in machine learning as an effective process for network pruning, we adapt the approach to transfer the knowledge from a nonlinear network (teacher) to a linear counterpart (student), where we can exploit the inherent parallelism of light. We show that the KD approach can achieve performance that easily surpasses the standard linear version of a CNN and could approach the performance of the nonlinear network. Our simulations show that the possibility of increasing the resolution of the input image allows our proposed 4 optical linear network to perform more efficiently than a nonlinear network with the same accuracy on two fundamental image processing tasks: (i) object classification and (ii) semantic segmentation.

摘要

近年来,卷积神经网络(CNN)已经实现了无处不在的图像处理应用。因此,CNN 需要快速的正向传播运行时,以便实时处理高分辨率的视觉流。即使使用最先进的图形和张量处理单元,这仍然是一个具有挑战性的任务。计算效率的瓶颈主要发生在卷积层。在傅立叶域中进行卷积是一种很有前途的加速正向传播的方法,因为它将卷积转换为元素乘法,对于大核来说,计算速度要快得多。此外,这种计算可以使用具有数量级更快操作的光学 4 系统来实现。然而,使用这种谱方法以及 CNN 的光学实现面临的一个主要挑战是,在每个卷积层之间包含一个非线性,否则 CNN 的性能会急剧下降。在这里,我们提出了一种谱 CNN 线性对应物(SCLC)网络架构及其光学实现。我们提出了一种混合平台,具有光学前端来执行大量线性操作,然后是电子后端。关键贡献是开发一种知识蒸馏(KD)方法来规避在卷积层之间使用非线性层的需求,并成功训练这样的网络。虽然 KD 方法在机器学习中作为网络修剪的有效过程是已知的,但我们将该方法应用于从非线性网络(教师)到线性对应物(学生)转移知识,我们可以利用光的固有并行性。我们表明,KD 方法可以实现轻松超过 CNN 的标准线性版本的性能,并且可以接近非线性网络的性能。我们的模拟表明,增加输入图像分辨率的可能性使得我们提出的 4 光学线性网络在执行两项基本图像处理任务(i)对象分类和(ii)语义分割时比具有相同精度的非线性网络更有效。

相似文献

1
Knowledge distillation circumvents nonlinearity for optical convolutional neural networks.知识蒸馏绕过了光学卷积神经网络的非线性。
Appl Opt. 2022 Mar 20;61(9):2173-2183. doi: 10.1364/AO.435738.
2
Optical Diffractive Convolutional Neural Networks Implemented in an All-Optical Way.基于全光方式实现的光学衍射卷积神经网络。
Sensors (Basel). 2023 Jun 20;23(12):5749. doi: 10.3390/s23125749.
3
Catheter segmentation in X-ray fluoroscopy using synthetic data and transfer learning with light U-nets.基于合成数据和轻量级 U 型网络的迁移学习在 X 射线透视下的导管分割
Comput Methods Programs Biomed. 2020 Aug;192:105420. doi: 10.1016/j.cmpb.2020.105420. Epub 2020 Feb 29.
4
Transfer of Learning from Vision to Touch: A Hybrid Deep Convolutional Neural Network for Visuo-Tactile 3D Object Recognition.从视觉到触觉的迁移学习:用于视触 3D 物体识别的混合深度卷积神经网络。
Sensors (Basel). 2020 Dec 27;21(1):113. doi: 10.3390/s21010113.
5
OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.OBELISK-Net:稀疏可变形卷积解决三维多器官分割问题,所需层数更少。
Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.
6
Study of the Application of Deep Convolutional Neural Networks (CNNs) in Processing Sensor Data and Biomedical Images.深度学习卷积神经网络(CNNs)在传感器数据和生物医学图像处理中的应用研究。
Sensors (Basel). 2019 Aug 17;19(16):3584. doi: 10.3390/s19163584.
7
DENSE-INception U-net for medical image segmentation.基于密集卷积 Inception 的 U-Net 网络在医学图像分割中的应用
Comput Methods Programs Biomed. 2020 Aug;192:105395. doi: 10.1016/j.cmpb.2020.105395. Epub 2020 Feb 15.
8
Fully hardware-implemented memristor convolutional neural network.全硬件实现的忆阻器卷积神经网络。
Nature. 2020 Jan;577(7792):641-646. doi: 10.1038/s41586-020-1942-4. Epub 2020 Jan 29.
9
Automatically Designing CNN Architectures Using the Genetic Algorithm for Image Classification.使用遗传算法自动设计用于图像分类的 CNN 架构。
IEEE Trans Cybern. 2020 Sep;50(9):3840-3854. doi: 10.1109/TCYB.2020.2983860. Epub 2020 Apr 21.
10
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.

引用本文的文献

1
Transferable polychromatic optical encoder for neural networks.用于神经网络的可转移多色光学编码器。
Nat Commun. 2025 Jul 1;16(1):5623. doi: 10.1038/s41467-025-61338-4.
2
Minimalist Deployment of Neural Network Equalizers in a Bandwidth-Limited Optical Wireless Communication System with Knowledge Distillation.基于知识蒸馏的带宽受限光无线通信系统中神经网络均衡器的极简部署
Sensors (Basel). 2024 Mar 1;24(5):1612. doi: 10.3390/s24051612.