• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于仿生扑翼飞机的多模态融合图像稳定算法

Multimodal Fusion Image Stabilization Algorithm for Bio-Inspired Flapping-Wing Aircraft.

作者信息

Wang Zhikai, Wang Sen, Hu Yiwen, Zhou Yangfan, Li Na, Zhang Xiaofeng

机构信息

College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China.

Henan Key Laboratory of Robot and Intelligent System, Henan University of Science and Technology, Luoyang 471023, China.

出版信息

Biomimetics (Basel). 2025 Jul 7;10(7):448. doi: 10.3390/biomimetics10070448.

DOI:10.3390/biomimetics10070448
PMID:40710261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12292680/
Abstract

This paper presents FWStab, a specialized video stabilization dataset tailored for flapping-wing platforms. The dataset encompasses five typical flight scenarios, featuring 48 video clips with intense dynamic jitter. The corresponding Inertial Measurement Unit (IMU) sensor data are synchronously collected, which jointly provide reliable support for multimodal modeling. Based on this, to address the issue of poor image acquisition quality due to severe vibrations in aerial vehicles, this paper proposes a multi-modal signal fusion video stabilization framework. This framework effectively integrates image features and inertial sensor features to predict smooth and stable camera poses. During the video stabilization process, the true camera motion originally estimated based on sensors is warped to the smooth trajectory predicted by the network, thereby optimizing the inter-frame stability. This approach maintains the global rigidity of scene motion, avoids visual artifacts caused by traditional dense optical flow-based spatiotemporal warping, and rectifies rolling shutter-induced distortions. Furthermore, the network is trained in an unsupervised manner by leveraging a joint loss function that integrates camera pose smoothness and optical flow residuals. When coupled with a multi-stage training strategy, this framework demonstrates remarkable stabilization adaptability across a wide range of scenarios. The entire framework employs Long Short-Term Memory (LSTM) to model the temporal characteristics of camera trajectories, enabling high-precision prediction of smooth trajectories.

摘要

本文介绍了FWStab,这是一个专门为扑翼平台量身定制的视频稳定数据集。该数据集包含五种典型飞行场景,有48个具有强烈动态抖动的视频片段。同时收集了相应的惯性测量单元(IMU)传感器数据,为多模态建模提供了可靠支持。基于此,为解决飞行器剧烈振动导致图像采集质量差的问题,本文提出了一种多模态信号融合视频稳定框架。该框架有效地整合了图像特征和惯性传感器特征,以预测平滑稳定的相机姿态。在视频稳定过程中,最初基于传感器估计的真实相机运动被扭曲到网络预测的平滑轨迹上,从而优化帧间稳定性。这种方法保持了场景运动的全局刚性,避免了传统基于密集光流的时空扭曲所引起的视觉伪影,并校正了卷帘快门引起的失真。此外,通过利用整合相机姿态平滑度和光流残差的联合损失函数,以无监督方式对网络进行训练。当与多阶段训练策略相结合时,该框架在广泛的场景中展现出显著的稳定适应性。整个框架采用长短期记忆(LSTM)对相机轨迹的时间特征进行建模,能够高精度地预测平滑轨迹。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/1ac4ce579595/biomimetics-10-00448-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/816f6989f4c1/biomimetics-10-00448-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/934a3fe4e145/biomimetics-10-00448-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c483bffd93f5/biomimetics-10-00448-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c85636b1140d/biomimetics-10-00448-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/2b194301e8f2/biomimetics-10-00448-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/6bb931251a5f/biomimetics-10-00448-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c8bde4dcaa31/biomimetics-10-00448-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/d9713c5f49ba/biomimetics-10-00448-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/6733911725e5/biomimetics-10-00448-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/0cea856caeb3/biomimetics-10-00448-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/425508494b54/biomimetics-10-00448-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/1ac4ce579595/biomimetics-10-00448-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/816f6989f4c1/biomimetics-10-00448-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/934a3fe4e145/biomimetics-10-00448-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c483bffd93f5/biomimetics-10-00448-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c85636b1140d/biomimetics-10-00448-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/2b194301e8f2/biomimetics-10-00448-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/6bb931251a5f/biomimetics-10-00448-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/c8bde4dcaa31/biomimetics-10-00448-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/d9713c5f49ba/biomimetics-10-00448-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/6733911725e5/biomimetics-10-00448-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/0cea856caeb3/biomimetics-10-00448-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/425508494b54/biomimetics-10-00448-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8693/12292680/1ac4ce579595/biomimetics-10-00448-g012.jpg

相似文献

1
Multimodal Fusion Image Stabilization Algorithm for Bio-Inspired Flapping-Wing Aircraft.用于仿生扑翼飞机的多模态融合图像稳定算法
Biomimetics (Basel). 2025 Jul 7;10(7):448. doi: 10.3390/biomimetics10070448.
2
Short-Term Memory Impairment短期记忆障碍
3
A hybrid model for detecting motion artifacts in ballistocardiogram signals.一种用于检测心冲击图信号中运动伪影的混合模型。
Biomed Eng Online. 2025 Jul 23;24(1):92. doi: 10.1186/s12938-025-01426-0.
4
A fake news detection model using the integration of multimodal attention mechanism and residual convolutional network.一种融合多模态注意力机制和残差卷积网络的假新闻检测模型。
Sci Rep. 2025 Jul 1;15(1):20544. doi: 10.1038/s41598-025-05702-w.
5
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
6
Multimodal cross-system virtual reality (VR) ball throwing dataset for VR biometrics.用于虚拟现实生物识别技术的多模态跨系统虚拟现实(VR)抛球数据集。
Data Brief. 2025 Jun 26;61:111827. doi: 10.1016/j.dib.2025.111827. eCollection 2025 Aug.
7
Sparse-view spectral CT reconstruction via a coupled subspace representation and score-based generative model.基于耦合子空间表示和基于分数的生成模型的稀疏视图光谱CT重建
Quant Imaging Med Surg. 2025 Jun 6;15(6):5474-5495. doi: 10.21037/qims-24-2226. Epub 2025 May 28.
8
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
9
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
10
EDT-MCFEF: a multi-channel feature fusion model for emergency department triage of medical texts.EDT-MCFEF:一种用于医学文本急诊科分诊的多通道特征融合模型。
Front Public Health. 2025 Jun 18;13:1591491. doi: 10.3389/fpubh.2025.1591491. eCollection 2025.

本文引用的文献

1
Unsupervised Global and Local Homography Estimation With Motion Basis Learning.无监督全局和局部运动基学习的单应性估计。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7885-7899. doi: 10.1109/TPAMI.2022.3223789. Epub 2023 May 5.
2
DUT: Learning Video Stabilization by Simply Watching Unstable Videos.DUT:通过简单观看不稳定视频来学习视频稳定化
IEEE Trans Image Process. 2022;31:4306-4320. doi: 10.1109/TIP.2022.3182887. Epub 2022 Jun 29.
3
Content-Aware Unsupervised Deep Homography Estimation and its Extensions.内容感知无监督深度单应估计及其扩展。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2849-2863. doi: 10.1109/TPAMI.2022.3174130. Epub 2023 Feb 3.
4
PWStableNet: Learning Pixel-wise Warping Maps for Video Stabilization.PWStableNet:用于视频稳定的逐像素扭曲映射学习
IEEE Trans Image Process. 2020 Jan 7. doi: 10.1109/TIP.2019.2963380.
5
Deep Online Video Stabilization with Multi-Grid Warping Transformation Learning.基于多网格扭曲变换学习的深度在线视频稳定化
IEEE Trans Image Process. 2018 Nov 30. doi: 10.1109/TIP.2018.2884280.
6
CodingFlow: Enable Video Coding for Video Stabilization.编码流:用于视频稳定的视频编码。
IEEE Trans Image Process. 2017 Jul;26(7):3291-3302. doi: 10.1109/TIP.2017.2697759. Epub 2017 Apr 24.
7
Spatially and temporally optimized video stabilization.时空优化视频稳定化。
IEEE Trans Vis Comput Graph. 2013 Aug;19(8):1354-61. doi: 10.1109/TVCG.2013.11.
8
Full-frame video stabilization with motion inpainting.采用运动修复技术的全帧视频稳定
IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1150-63. doi: 10.1109/TPAMI.2006.141.