• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于机器人操作的扩散模型:一项综述。

Diffusion models for robotic manipulation: a survey.

作者信息

Wolf Rosa, Shi Yitian, Liu Sheng, Rayyes Rania

机构信息

AI and Robotics (AIR), Institute of Material Handling and Logistics (IFL), Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany.

出版信息

Front Robot AI. 2025 Sep 9;12:1606247. doi: 10.3389/frobt.2025.1606247. eCollection 2025.

DOI:10.3389/frobt.2025.1606247
PMID:40995149
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12454101/
Abstract

Diffusion generative models have demonstrated remarkable success in visual domains such as image and video generation. They have also recently emerged as a promising approach in robotics, especially in robot manipulations. Diffusion models leverage a probabilistic framework, and they stand out with their ability to model multi-modal distributions and their robustness to high-dimensional input and output spaces. This survey provides a comprehensive review of state-of-the-art diffusion models in robotic manipulation, including grasp learning, trajectory planning, and data augmentation. Diffusion models for scene and image augmentation lie at the intersection of robotics and computer vision for vision-based tasks to enhance generalizability and data scarcity. This paper also presents the two main frameworks of diffusion models and their integration with imitation learning and reinforcement learning. In addition, it discusses the common architectures and benchmarks and points out the challenges and advantages of current state-of-the-art diffusion-based methods.

摘要

扩散生成模型在图像和视频生成等视觉领域已取得显著成功。它们最近在机器人技术中也成为一种很有前景的方法,特别是在机器人操作方面。扩散模型利用概率框架,以其对多模态分布进行建模的能力以及对高维输入和输出空间的鲁棒性脱颖而出。本综述全面回顾了机器人操作中最先进的扩散模型,包括抓取学习、轨迹规划和数据增强。用于场景和图像增强的扩散模型处于机器人技术和计算机视觉的交叉点,用于基于视觉的任务,以提高通用性和解决数据稀缺问题。本文还介绍了扩散模型的两个主要框架及其与模仿学习和强化学习的集成。此外,它讨论了常见的架构和基准,并指出了当前最先进的基于扩散的方法的挑战和优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4035/12454101/ee291c4e169a/frobt-12-1606247-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4035/12454101/122e2b3d36e4/frobt-12-1606247-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4035/12454101/ee291c4e169a/frobt-12-1606247-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4035/12454101/122e2b3d36e4/frobt-12-1606247-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4035/12454101/ee291c4e169a/frobt-12-1606247-g002.jpg

相似文献

1
Diffusion models for robotic manipulation: a survey.用于机器人操作的扩散模型:一项综述。
Front Robot AI. 2025 Sep 9;12:1606247. doi: 10.3389/frobt.2025.1606247. eCollection 2025.
2
Vesicoureteral Reflux膀胱输尿管反流
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Mid Forehead Brow Lift额中眉提升术
5
Short-Term Memory Impairment短期记忆障碍
6
Shoulder Arthrogram肩关节造影
7
A comprehensive comparative study of generative adversarial network architectures for synthetic computed tomography generation in the abdomen.用于腹部合成计算机断层扫描生成的生成对抗网络架构的全面比较研究。
Med Phys. 2025 Aug;52(8):e18038. doi: 10.1002/mp.18038.
8
General 3D Vision-Language Model With Fast Rendering and Pre-Training Vision-Language Alignment.具有快速渲染和预训练视觉语言对齐的通用3D视觉语言模型。
IEEE Trans Pattern Anal Mach Intell. 2025 Sep;47(9):7352-7368. doi: 10.1109/TPAMI.2025.3566593.
9
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
10
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.

本文引用的文献

1
ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion.ManiDext:通过连续对应嵌入和残差引导扩散进行手部物体操作合成
IEEE Trans Pattern Anal Mach Intell. 2025 Jul 11;PP. doi: 10.1109/TPAMI.2025.3588302.
2
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges.模仿学习综述:算法、最新进展与挑战
IEEE Trans Cybern. 2024 Dec;54(12):7173-7186. doi: 10.1109/TCYB.2024.3395626. Epub 2024 Nov 27.