• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

模型很重要,训练也很重要:用于光流估计的 CNN 的实证研究。

Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1408-1423. doi: 10.1109/TPAMI.2019.2894353. Epub 2019 Jan 22.

DOI:10.1109/TPAMI.2019.2894353
PMID:30676944
Abstract

We investigate two crucial and closely-related aspects of CNNs for optical flow estimation: models and training. First, we design a compact but effective CNN model, called PWC-Net, according to simple and well-established principles: pyramidal processing, warping, and cost volume processing. PWC-Net is 17 times smaller in size, 2 times faster in inference, and 11 percent more accurate on Sintel final than the recent FlowNet2 model. It is the winning entry in the optical flow competition of the robust vision challenge. Next, we experimentally analyze the sources of our performance gains. In particular, we use the same training procedure for PWC-Net to retrain FlowNetC, a sub-network of FlowNet2. The retrained FlowNetC is 56 percent more accurate on Sintel final than the previously trained one and even 5 percent more accurate than the FlowNet2 model. We further improve the training procedure and increase the accuracy of PWC-Net on Sintel by 10 percent and on KITTI 2012 and 2015 by 20 percent. Our newly trained model parameters and training protocols are available on https://github.com/NVlabs/PWC-Net.

摘要

我们研究了用于光流估计的 CNN 的两个关键且密切相关的方面:模型和训练。首先,我们根据简单而成熟的原则设计了一个紧凑但有效的 CNN 模型,称为 PWC-Net:金字塔处理、变形和代价体处理。PWC-Net 的大小缩小了 17 倍,推理速度提高了 2 倍,在 Sintel 最终测试中比最近的 FlowNet2 模型准确 11%。它是鲁棒视觉挑战赛光流竞赛的获胜者。接下来,我们通过实验分析了我们性能提升的来源。特别是,我们使用相同的训练过程对 FlowNet2 的子网 FlowNetC 进行再训练。再训练的 FlowNetC 在 Sintel 最终测试中的准确性比之前训练的提高了 56%,甚至比 FlowNet2 模型还要准确 5%。我们进一步改进了训练过程,使 PWC-Net 在 Sintel 上的准确性提高了 10%,在 KITTI 2012 和 2015 上的准确性提高了 20%。我们新训练的模型参数和训练协议可在 https://github.com/NVlabs/PWC-Net 上获得。

相似文献

1
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation.模型很重要,训练也很重要:用于光流估计的 CNN 的实证研究。
IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1408-1423. doi: 10.1109/TPAMI.2019.2894353. Epub 2019 Jan 22.
2
A Lightweight Optical Flow CNN -Revisiting Data Fidelity and Regularization.一种轻量级光流卷积神经网络——重新审视数据保真度和正则化
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2555-2569. doi: 10.1109/TPAMI.2020.2976928. Epub 2021 Jul 1.
3
Regularization for Unsupervised Learning of Optical Flow.无监督光流学习的正则化。
Sensors (Basel). 2023 Apr 18;23(8):4080. doi: 10.3390/s23084080.
4
Displacement Estimation in Ultrasound Elastography Using Pyramidal Convolutional Neural Network.基于金字塔卷积神经网络的超声弹性成像中的位移估计。
IEEE Trans Ultrason Ferroelectr Freq Control. 2020 Dec;67(12):2629-2639. doi: 10.1109/TUFFC.2020.2973047. Epub 2020 Nov 24.
5
One-shot domain adaptation in multiple sclerosis lesion segmentation using convolutional neural networks.基于卷积神经网络的多发性硬化病变分割中单样本域自适应
Neuroimage Clin. 2019;21:101638. doi: 10.1016/j.nicl.2018.101638. Epub 2018 Dec 10.
6
Semi-supervised learning for automatic segmentation of the knee from MRI with convolutional neural networks.基于卷积神经网络的膝关节 MRI 半自动分割的半监督学习。
Comput Methods Programs Biomed. 2020 Jun;189:105328. doi: 10.1016/j.cmpb.2020.105328. Epub 2020 Jan 11.
7
Reducing the U-Net size for practical scenarios: Virus recognition in electron microscopy images.针对实际情况缩小 U-Net 规模:电子显微镜图像中的病毒识别。
Comput Methods Programs Biomed. 2019 Sep;178:31-39. doi: 10.1016/j.cmpb.2019.05.026. Epub 2019 Jun 1.
8
Real-time and High Quality Ultrasound Elastography Using Convolutional Neural Network by Incorporating Analytic Signal.结合解析信号利用卷积神经网络实现实时高质量超声弹性成像
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:2075-2078. doi: 10.1109/EMBC44109.2020.9176025.
9
KIKI-net: cross-domain convolutional neural networks for reconstructing undersampled magnetic resonance images.KIKI-net:用于重建欠采样磁共振图像的跨域卷积神经网络。
Magn Reson Med. 2018 Nov;80(5):2188-2201. doi: 10.1002/mrm.27201. Epub 2018 Apr 6.
10
Fine-Tuning CNN Image Retrieval with No Human Annotation.无人工标注微调卷积神经网络图像检索。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1655-1668. doi: 10.1109/TPAMI.2018.2846566. Epub 2018 Jun 12.

引用本文的文献

1
Hierarchical Motion Field Alignment for Robust Optical Flow Estimation.用于稳健光流估计的分层运动场对齐
Sensors (Basel). 2025 Apr 22;25(9):2653. doi: 10.3390/s25092653.
2
DDCNet: Deep Dilated Convolutional Neural Network for Dense Prediction.DDCNet:用于密集预测的深度扩张卷积神经网络。
Neurocomputing (Amst). 2023 Feb 28;523:116-129. doi: 10.1016/j.neucom.2022.12.024. Epub 2022 Dec 15.
3
Regularization for Unsupervised Learning of Optical Flow.无监督光流学习的正则化。
Sensors (Basel). 2023 Apr 18;23(8):4080. doi: 10.3390/s23084080.
4
Improved Optical Flow Estimation Method for Deepfake Videos.深度伪造视频的改进光流估计方法。
Sensors (Basel). 2022 Mar 24;22(7):2500. doi: 10.3390/s22072500.
5
Enhancement of Speed and Accuracy Trade-Off for Sports Ball Detection in Videos-Finding Fast Moving, Small Objects in Real Time.提高视频中球类运动检测的速度和精度权衡——实时发现快速移动的小物体。
Sensors (Basel). 2021 May 6;21(9):3214. doi: 10.3390/s21093214.
6
Implicit and Explicit Regularization for Optical Flow Estimation.光流估计的隐式和显式正则化。
Sensors (Basel). 2020 Jul 10;20(14):3855. doi: 10.3390/s20143855.