• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于层次偏移补偿和时间记忆更新的无人机航空影像视频实例分割

Video Instance Segmentation Through Hierarchical Offset Compensation and Temporal Memory Update for UAV Aerial Images.

作者信息

Huang Ying, Zhang Yinhui, He Zifen, Deng Yunnan

机构信息

Faculty of Mechanical and Electrical Engineering, Kunming University of Science and Technology, Kunming 650500, China.

出版信息

Sensors (Basel). 2025 Jul 9;25(14):4274. doi: 10.3390/s25144274.

DOI:10.3390/s25144274
PMID:40732402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12299631/
Abstract

Despite the pivotal role of unmanned aerial vehicles (UAVs) in intelligent inspection tasks, existing video instance segmentation methods struggle with irregular deforming targets, leading to inconsistent segmentation results due to ineffective feature offset capture and temporal correlation modeling. To address this issue, we propose a hierarchical offset compensation and temporal memory update method for video instance segmentation (HT-VIS) with a high generalization ability. Firstly, a hierarchical offset compensation (HOC) module in the form of a sequential and parallel connection is designed to perform deformable offset for the same flexible target across frames, which benefits from compensating for spatial motion features at the time sequence. Next, the temporal memory update (TMU) module is developed by employing convolutional long-short-term memory (ConvLSTM) between the current and adjacent frames to establish the temporal dynamic context correlation and update the current frame feature effectively. Finally, extensive experimental results demonstrate the superiority of the proposed HDNet method when applied to the public YouTubeVIS-2019 dataset and a self-built UAV-Seg segmentation dataset. On four typical datasets (i.e., Zoo, Street, Vehicle, and Sport) extracted from YoutubeVIS-2019 according to category characteristics, the proposed HT-VIS outperforms the state-of-the-art CNN-based VIS methods CrossVIS by 3.9%, 2.0%, 0.3%, and 3.8% in average segmentation accuracy, respectively. On the self-built UAV-VIS dataset, our HT-VIS with PHOC surpasses the baseline SipMask by 2.1% and achieves the highest average segmentation accuracy of 37.4% in the CNN-based methods, demonstrating the effectiveness and robustness of our proposed framework.

摘要

尽管无人机在智能检测任务中发挥着关键作用,但现有的视频实例分割方法在处理不规则变形目标时存在困难,由于特征偏移捕获和时间相关性建模无效,导致分割结果不一致。为了解决这个问题,我们提出了一种具有高泛化能力的视频实例分割分层偏移补偿和时间记忆更新方法(HT-VIS)。首先,设计了一种顺序和并行连接形式的分层偏移补偿(HOC)模块,用于对同一灵活目标跨帧执行可变形偏移,这得益于在时间序列上补偿空间运动特征。接下来,通过在当前帧和相邻帧之间采用卷积长短期记忆(ConvLSTM)来开发时间记忆更新(TMU)模块,以建立时间动态上下文相关性并有效地更新当前帧特征。最后,大量实验结果表明,所提出的HDNet方法应用于公共YouTubeVIS-2019数据集和自建的UAV-Seg分割数据集时具有优越性。在根据类别特征从YoutubeVIS-2019中提取的四个典型数据集(即动物园、街道、车辆和体育)上,所提出的HT-VIS在平均分割精度上分别比基于CNN的最先进VIS方法CrossVIS高出3.9%、2.0%、0.3%和3.8%。在自建的UAV-VIS数据集上,我们的带有PHOC的HT-VIS比基线SipMask高出2.1%,并在基于CNN的方法中实现了37.4%的最高平均分割精度,证明了我们提出的框架的有效性和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/afcfad6bbf89/sensors-25-04274-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/b74bb51b113b/sensors-25-04274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/d2284a9c2805/sensors-25-04274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/b6fb9762b1a2/sensors-25-04274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/992e908b7ea7/sensors-25-04274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/f55545456f7d/sensors-25-04274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/c4640871b995/sensors-25-04274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/99e5ca342167/sensors-25-04274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/6211a1e9d6c2/sensors-25-04274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/2dd3c50b47e7/sensors-25-04274-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/8c0783352127/sensors-25-04274-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/1d9d46fcc985/sensors-25-04274-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/86b334d23608/sensors-25-04274-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/afcfad6bbf89/sensors-25-04274-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/b74bb51b113b/sensors-25-04274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/d2284a9c2805/sensors-25-04274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/b6fb9762b1a2/sensors-25-04274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/992e908b7ea7/sensors-25-04274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/f55545456f7d/sensors-25-04274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/c4640871b995/sensors-25-04274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/99e5ca342167/sensors-25-04274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/6211a1e9d6c2/sensors-25-04274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/2dd3c50b47e7/sensors-25-04274-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/8c0783352127/sensors-25-04274-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/1d9d46fcc985/sensors-25-04274-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/86b334d23608/sensors-25-04274-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02ef/12299631/afcfad6bbf89/sensors-25-04274-g013.jpg

相似文献

1
Video Instance Segmentation Through Hierarchical Offset Compensation and Temporal Memory Update for UAV Aerial Images.基于层次偏移补偿和时间记忆更新的无人机航空影像视频实例分割
Sensors (Basel). 2025 Jul 9;25(14):4274. doi: 10.3390/s25144274.
2
Contextual information based anomaly detection for multi-scene aerial videos.基于上下文信息的多场景航空视频异常检测
Sci Rep. 2025 Jul 16;15(1):25805. doi: 10.1038/s41598-025-07486-5.
3
Point-cloud segmentation with in-silico data augmentation for prostate cancer treatment.用于前列腺癌治疗的基于计算机模拟数据增强的点云分割
Med Phys. 2025 Apr 3. doi: 10.1002/mp.17815.
4
Semi-Supervised Echocardiography Video Segmentation via Adaptive Spatio-Temporal Tensor Semantic Awareness and Memory Flow.基于自适应时空张量语义感知与记忆流的半监督超声心动图视频分割
IEEE Trans Med Imaging. 2025 May;44(5):2182-2193. doi: 10.1109/TMI.2025.3526955. Epub 2025 May 2.
5
Diffusion semantic segmentation model: A generative model for medical image segmentation based on joint distribution.扩散语义分割模型:一种基于联合分布的医学图像分割生成模型。
Med Phys. 2025 Jul;52(7):e17928. doi: 10.1002/mp.17928. Epub 2025 Jun 8.
6
DASNet a dual branch multi level attention sheep counting network.DASNet是一种双分支多级注意力羊只计数网络。
Sci Rep. 2025 Jul 2;15(1):23228. doi: 10.1038/s41598-025-97929-w.
7
Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.早期至晚期融合对来自配准不完善的多模态磁共振成像的胰腺分割的影响。
J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.
8
DCMC-UNet: A Novel Segmentation Model for Carbon Traces in Oil-Immersed Transformers Improved with Dynamic Feature Fusion and Adaptive Illumination Enhancement.DCMC-UNet:一种通过动态特征融合和自适应光照增强改进的油浸式变压器碳痕分割新模型。
Sensors (Basel). 2025 Jun 23;25(13):3904. doi: 10.3390/s25133904.
9
Short-Term Memory Impairment短期记忆障碍
10
Measurement Along the Path of Unmanned Aerial Vehicles for Best Horizontal Dilution of Precision and Geometric Dilution of Precision.沿无人机路径进行测量以实现最佳水平精度稀释和几何精度稀释。
Sensors (Basel). 2025 Jun 23;25(13):3901. doi: 10.3390/s25133901.

本文引用的文献

1
Mask R-CNN.Mask R-CNN。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.