• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器人姿态估计技术的应用前景:基于YOLOv8-ApexNet探索新方向。

The application prospects of robot pose estimation technology: exploring new directions based on YOLOv8-ApexNet.

作者信息

Tang XianFeng, Zhao Shuwei

机构信息

Physical Education Department, Zhejiang Wanli University, Ningbo, China.

Physical Education Department, Hebei University of Technology, Tianjin, China.

出版信息

Front Neurorobot. 2024 Apr 5;18:1374385. doi: 10.3389/fnbot.2024.1374385. eCollection 2024.

DOI:10.3389/fnbot.2024.1374385
PMID:38644904
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11026676/
Abstract

INTRODUCTION

Service robot technology is increasingly gaining prominence in the field of artificial intelligence. However, persistent limitations continue to impede its widespread implementation. In this regard, human motion pose estimation emerges as a crucial challenge necessary for enhancing the perceptual and decision-making capacities of service robots.

METHOD

This paper introduces a groundbreaking model, YOLOv8-ApexNet, which integrates advanced technologies, including Bidirectional Routing Attention (BRA) and Generalized Feature Pyramid Network (GFPN). BRA facilitates the capture of inter-keypoint correlations within dynamic environments by introducing a bidirectional information propagation mechanism. Furthermore, GFPN adeptly extracts and integrates feature information across different scales, enabling the model to make more precise predictions for targets of various sizes and shapes.

RESULTS

Empirical research findings reveal significant performance enhancements of the YOLOv8-ApexNet model across the COCO and MPII datasets. Compared to existing methodologies, the model demonstrates pronounced advantages in keypoint localization accuracy and robustness.

DISCUSSION

The significance of this research lies in providing an efficient and accurate solution tailored for the realm of service robotics, effectively mitigating the deficiencies inherent in current approaches. By bolstering the accuracy of perception and decision-making, our endeavors unequivocally endorse the widespread integration of service robots within practical applications.

摘要

引言

服务机器人技术在人工智能领域正日益凸显其重要性。然而,持续存在的局限性仍阻碍着其广泛应用。在这方面,人体运动姿态估计成为提升服务机器人感知和决策能力所需面对的一项关键挑战。

方法

本文介绍了一种开创性的模型YOLOv8-ApexNet,它集成了包括双向路由注意力(BRA)和通用特征金字塔网络(GFPN)在内的先进技术。BRA通过引入双向信息传播机制,有助于在动态环境中捕捉关键点之间的相关性。此外,GFPN能够巧妙地跨不同尺度提取和整合特征信息,使模型能够对各种大小和形状的目标做出更精确的预测。

结果

实证研究结果表明,YOLOv8-ApexNet模型在COCO和MPII数据集上有显著的性能提升。与现有方法相比,该模型在关键点定位准确性和鲁棒性方面展现出明显优势。

讨论

本研究的意义在于为服务机器人领域提供了一种高效且准确的解决方案,有效缓解了当前方法中固有的缺陷。通过提高感知和决策的准确性,我们的努力明确支持服务机器人在实际应用中的广泛集成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/870c0c9e27bd/fnbot-18-1374385-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/f65f0eb68f5b/fnbot-18-1374385-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/a612ed921c1b/fnbot-18-1374385-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/439da5fbae3a/fnbot-18-1374385-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/3454faaa6d4c/fnbot-18-1374385-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/870c0c9e27bd/fnbot-18-1374385-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/f65f0eb68f5b/fnbot-18-1374385-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/a612ed921c1b/fnbot-18-1374385-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/439da5fbae3a/fnbot-18-1374385-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/3454faaa6d4c/fnbot-18-1374385-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bcd/11026676/870c0c9e27bd/fnbot-18-1374385-g0005.jpg

相似文献

1
The application prospects of robot pose estimation technology: exploring new directions based on YOLOv8-ApexNet.机器人姿态估计技术的应用前景:基于YOLOv8-ApexNet探索新方向。
Front Neurorobot. 2024 Apr 5;18:1374385. doi: 10.3389/fnbot.2024.1374385. eCollection 2024.
2
YOLOv8-ACU: improved YOLOv8-pose for facial acupoint detection.YOLOv8-ACU:用于面部穴位检测的改进版YOLOv8姿态估计模型
Front Neurorobot. 2024 Feb 1;18:1355857. doi: 10.3389/fnbot.2024.1355857. eCollection 2024.
3
An enhanced real-time human pose estimation method based on modified YOLOv8 framework.基于改进 YOLOv8 框架的增强实时人体姿态估计方法。
Sci Rep. 2024 Apr 5;14(1):8012. doi: 10.1038/s41598-024-58146-z.
4
Smoking behavior detection algorithm based on YOLOv8-MNC.基于YOLOv8-MNC的吸烟行为检测算法
Front Comput Neurosci. 2023 Aug 24;17:1243779. doi: 10.3389/fncom.2023.1243779. eCollection 2023.
5
Cofopose: Conditional 2D Pose Estimation with Transformers.Cofopose:基于 Transformer 的条件 2D 姿态估计。
Sensors (Basel). 2022 Sep 9;22(18):6821. doi: 10.3390/s22186821.
6
A Glove-Wearing Detection Algorithm Based on Improved YOLOv8.基于改进 YOLOv8 的戴手套检测算法。
Sensors (Basel). 2023 Dec 18;23(24):9906. doi: 10.3390/s23249906.
7
Small object detection algorithm incorporating swin transformer for tea buds.用于茶芽的融合 Swin 变换小目标检测算法。
PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.
8
Research on deep reinforcement learning basketball robot shooting skills improvement based on end to end architecture and multi-modal perception.基于端到端架构和多模态感知的深度强化学习篮球机器人投篮技术改进研究
Front Neurorobot. 2023 Oct 13;17:1274543. doi: 10.3389/fnbot.2023.1274543. eCollection 2023.
9
Cross-modal self-attention mechanism for controlling robot volleyball motion.用于控制机器人排球运动的跨模态自注意力机制。
Front Neurorobot. 2023 Nov 10;17:1288463. doi: 10.3389/fnbot.2023.1288463. eCollection 2023.
10
YOLOv8-C2f-Faster-EMA: An Improved Underwater Trash Detection Model Based on YOLOv8.YOLOv8-C2f-Faster-EMA:一种基于YOLOv8的改进型水下垃圾检测模型。
Sensors (Basel). 2024 Apr 12;24(8):2483. doi: 10.3390/s24082483.

引用本文的文献

1
A human pose estimation network based on YOLOv8 framework with efficient multi-scale receptive field and expanded feature pyramid network.一种基于YOLOv8框架的人体姿态估计网络,具有高效的多尺度感受野和扩展的特征金字塔网络。
Sci Rep. 2025 May 1;15(1):15284. doi: 10.1038/s41598-025-00259-0.
2
DEKR-SPrior: An Efficient Bottom-Up Keypoint Detection Model for Accurate Pod Phenotyping in Soybean.DEKR-SPrior:一种用于大豆精确荚果表型分析的高效自底向上关键点检测模型。
Plant Phenomics. 2024 Jun 27;6:0198. doi: 10.34133/plantphenomics.0198. eCollection 2024.

本文引用的文献

1
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time.AlphaPose:实时的全身区域多人姿态估计和跟踪。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7157-7173. doi: 10.1109/TPAMI.2022.3222784.
2
Stable tissue-mimicking phantoms for longitudinal multimodality imaging studies that incorporate optical, CT, and MRI contrast.用于纵向多模态成像研究的稳定组织模拟体,该研究纳入了光学、CT 和 MRI 对比剂。
J Biomed Opt. 2023 Apr;28(4):046006. doi: 10.1117/1.JBO.28.4.046006. Epub 2023 Apr 20.