• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CrowdGAN:无身份交互人群视频生成及其他应用

CrowdGAN: Identity-Free Interactive Crowd Video Generation and Beyond.

作者信息

Chai Liangyu, Liu Yongtuo, Liu Wenxi, Han Guoqiang, He Shengfeng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2856-2871. doi: 10.1109/TPAMI.2020.3043372. Epub 2022 May 5.

DOI:10.1109/TPAMI.2020.3043372
PMID:33290212
Abstract

In this paper, we introduce a novel yet challenging research problem, interactive crowd video generation, committed to producing diverse and continuous crowd video, and relieving the difficulty of insufficient annotated real-world datasets in crowd analysis. Our goal is to recursively generate realistic future crowd video frames given few context frames, under the user-specified guidance, namely individual positions of the crowd. To this end, we propose a deep network architecture specifically designed for crowd video generation that is composed of two complementary modules, each of which combats the problems of crowd dynamic synthesis and appearance preservation respectively. Particularly, a spatio-temporal transfer module is proposed to infer the crowd position and structure from guidance and temporal information, and a point-aware flow prediction module is presented to preserve appearance consistency by flow-based warping. Then, the outputs of the two modules are integrated by a self-selective fusion unit to produce an identity-preserved and continuous video. Unlike previous works, we generate continuous crowd behaviors beyond identity annotations or matching. Extensive experiments show that our method is effective for crowd video generation. More importantly, we demonstrate the generated video can produce diverse crowd behaviors and be used for augmenting different crowd analysis tasks, i.e., crowd counting, anomaly detection, crowd video prediction. Code is available at https://github.com/Icep2020/CrowdGAN.

摘要

在本文中,我们引入了一个新颖但具有挑战性的研究问题——交互式人群视频生成,致力于生成多样化且连续的人群视频,并缓解人群分析中真实世界标注数据集不足的难题。我们的目标是在用户指定的指导下,即人群的个体位置,根据少量上下文帧递归地生成逼真的未来人群视频帧。为此,我们提出了一种专门为人群视频生成设计的深度网络架构,它由两个互补模块组成,每个模块分别解决人群动态合成和外观保留的问题。具体而言,提出了一个时空转移模块,用于从指导信息和时间信息中推断人群位置和结构,并提出了一个点感知流预测模块,通过基于流的扭曲来保持外观一致性。然后,两个模块的输出由一个自选择融合单元整合,以生成一个保持身份并连续的视频。与以往的工作不同,我们生成的连续人群行为超越了身份标注或匹配。大量实验表明,我们的方法在人群视频生成方面是有效的。更重要的是,我们证明了生成的视频可以产生多样化的人群行为,并可用于增强不同的人群分析任务,即人群计数、异常检测、人群视频预测。代码可在https://github.com/Icep2020/CrowdGAN获取。

相似文献

1
CrowdGAN: Identity-Free Interactive Crowd Video Generation and Beyond.CrowdGAN:无身份交互人群视频生成及其他应用
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2856-2871. doi: 10.1109/TPAMI.2020.3043372. Epub 2022 May 5.
2
Video Crowd Localization With Multifocus Gaussian Neighborhood Attention and a Large-Scale Benchmark.基于多焦点高斯邻域注意力和大规模基准的视频人群定位
IEEE Trans Image Process. 2022;31:6032-6047. doi: 10.1109/TIP.2022.3205210. Epub 2022 Sep 19.
3
CLRNet: A Cross Locality Relation Network for Crowd Counting in Videos.
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6408-6422. doi: 10.1109/TNNLS.2022.3209918. Epub 2024 May 2.
4
Revisiting crowd behaviour analysis through deep learning: Taxonomy, anomaly detection, crowd emotions, datasets, opportunities and prospects.通过深度学习重新审视人群行为分析:分类法、异常检测、人群情绪、数据集、机遇与前景。
Inf Fusion. 2020 Dec;64:318-335. doi: 10.1016/j.inffus.2020.07.008. Epub 2020 Jul 29.
5
Image Comes Dancing With Collaborative Parsing-Flow Video Synthesis.图像与协作解析流视频合成共舞。
IEEE Trans Image Process. 2021;30:9259-9269. doi: 10.1109/TIP.2021.3123549. Epub 2021 Nov 12.
6
Multidimensional Measure Matching for Crowd Counting.用于人群计数的多维测量匹配
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9112-9126. doi: 10.1109/TNNLS.2024.3435854. Epub 2025 May 2.
7
HRANet: Hierarchical region-aware network for crowd counting.HRANet:用于人群计数的分层区域感知网络。
Appl Intell (Dordr). 2022;52(11):12191-12205. doi: 10.1007/s10489-021-03030-w. Epub 2022 Feb 2.
8
DMPNet: densely connected multi-scale pyramid networks for crowd counting.DMPNet:用于人群计数的密集连接多尺度金字塔网络
PeerJ Comput Sci. 2022 Mar 18;8:e902. doi: 10.7717/peerj-cs.902. eCollection 2022.
9
Consistency-Aware Anchor Pyramid Network for Crowd Localization.用于人群定位的一致性感知锚点金字塔网络。
IEEE Trans Pattern Anal Mach Intell. 2024 Apr 29;PP. doi: 10.1109/TPAMI.2024.3392013.
10
Redesigning Multi-Scale Neural Network for Crowd Counting.重新设计用于人群计数的多尺度神经网络。
IEEE Trans Image Process. 2023;32:3664-3678. doi: 10.1109/TIP.2023.3289290. Epub 2023 Jul 4.

引用本文的文献

1
Design of an integrated model with temporal graph attention and transformer-augmented RNNs for enhanced anomaly detection.用于增强异常检测的具有时间图注意力和Transformer增强循环神经网络的集成模型设计
Sci Rep. 2025 Jan 21;15(1):2692. doi: 10.1038/s41598-025-85822-5.