农业机器人的强化学习智能路径规划系统。

The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.

机构信息

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China.

College of Mechanical and Electrical Engineering, Shihezi University, Shihezi 832003, China.

出版信息

Sensors (Basel). 2022 Jun 7;22(12):4316. doi: 10.3390/s22124316.

DOI:10.3390/s22124316

PMID:35746099

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9227048/

Abstract

Agricultural robots are one of the important means to promote agricultural modernization and improve agricultural efficiency. With the development of artificial intelligence technology and the maturity of Internet of Things (IoT) technology, people put forward higher requirements for the intelligence of robots. Agricultural robots must have intelligent control functions in agricultural scenarios and be able to autonomously decide paths to complete agricultural tasks. In response to this requirement, this paper proposes a Residual-like Soft Actor Critic (R-SAC) algorithm for agricultural scenarios to realize safe obstacle avoidance and intelligent path planning of robots. In addition, in order to alleviate the time-consuming problem of exploration process of reinforcement learning, this paper proposes an offline expert experience pre-training method, which improves the training efficiency of reinforcement learning. Moreover, this paper optimizes the reward mechanism of the algorithm by using multi-step TD-error, which solves the probable dilemma during training. Experiments verify that our proposed method has stable performance in both static and dynamic obstacle environments, and is superior to other reinforcement learning algorithms. It is a stable and efficient path planning method and has visible application potential in agricultural robots.

摘要

农业机器人是推动农业现代化、提高农业效率的重要手段之一。随着人工智能技术的发展和物联网（IoT）技术的成熟，人们对机器人的智能化提出了更高的要求。农业机器人在农业场景中必须具有智能控制功能，能够自主决定路径来完成农业任务。针对这一要求，本文提出了一种用于农业场景的残差式软动作控制器（R-SAC）算法，以实现机器人的安全避障和智能路径规划。此外，为了缓解强化学习探索过程中的耗时问题，本文提出了一种离线专家经验预训练方法，提高了强化学习的训练效率。并且，通过使用多步 TD 误差来优化算法的奖励机制，解决了训练过程中的可能困境。实验验证了我们提出的方法在静态和动态障碍物环境中都具有稳定的性能，并且优于其他强化学习算法。它是一种稳定且高效的路径规划方法，在农业机器人中有明显的应用潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3647/9227048/13a0526a80a8/sensors-22-04316-g005.jpg

相似文献

The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.

Sensors (Basel). 2022 Jun 7;22(12):4316. doi: 10.3390/s22124316.

Improved Robot Path Planning Method Based on Deep Reinforcement Learning.

Sensors (Basel). 2023 Jun 15;23(12):5622. doi: 10.3390/s23125622.

Grid-Based Mobile Robot Path Planning Using Aging-Based Ant Colony Optimization Algorithm in Static and Dynamic Environments.

Sensors (Basel). 2020 Mar 28;20(7):1880. doi: 10.3390/s20071880.

Reinforcement learning-based dynamic obstacle avoidance and integration of path planning.

Intell Serv Robot. 2021;14(5):663-677. doi: 10.1007/s11370-021-00387-2. Epub 2021 Oct 6.

Intelligent career planning via stochastic subsampling reinforcement learning.

Sci Rep. 2022 May 18;12(1):8332. doi: 10.1038/s41598-022-11872-8.

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots.

Biomimetics (Basel). 2023 Oct 10;8(6):481. doi: 10.3390/biomimetics8060481.

Real-time route planning of unmanned aerial vehicles based on improved soft actor-critic algorithm.

Front Neurorobot. 2022 Dec 5;16:1025817. doi: 10.3389/fnbot.2022.1025817. eCollection 2022.

Vision-Based Intelligent Perceiving and Planning System of a 7-DoF Collaborative Robot.

Comput Intell Neurosci. 2021 Sep 14;2021:5810371. doi: 10.1155/2021/5810371. eCollection 2021.

Path planning and collision avoidance methods for distributed multi-robot systems in complex dynamic environments.

Math Biosci Eng. 2023 Jan;20(1):145-178. doi: 10.3934/mbe.2023008. Epub 2022 Sep 30.

Deep Reinforcement Learning for Autonomous Driving with an Auxiliary Actor Discriminator.

Sensors (Basel). 2024 Jan 22;24(2):700. doi: 10.3390/s24020700.

引用本文的文献

A Review of Research on Fruit and Vegetable Picking Robots Based on Deep Learning.

Sensors (Basel). 2025 Jun 12;25(12):3677. doi: 10.3390/s25123677.

AHG-YOLO: multi-category detection for occluded pear fruits in complex orchard scenes.

Front Plant Sci. 2025 May 23;16:1580325. doi: 10.3389/fpls.2025.1580325. eCollection 2025.

A Comprehensive Review of Deep Learning Applications in Cotton Industry: From Field Monitoring to Smart Processing.

Plants (Basel). 2025 May 15;14(10):1481. doi: 10.3390/plants14101481.

Agricultural machinery automatic navigation technology.

iScience. 2023 Dec 14;27(2):108714. doi: 10.1016/j.isci.2023.108714. eCollection 2024 Feb 16.

Mobile robotics in smart farming: current trends and applications.

Front Artif Intell. 2023 Aug 31;6:1213330. doi: 10.3389/frai.2023.1213330. eCollection 2023.

Research on the local path planning of an orchard mowing robot based on an elliptic repulsion scope boundary constraint potential field method.

Front Plant Sci. 2023 Jul 21;14:1184352. doi: 10.3389/fpls.2023.1184352. eCollection 2023.

Three-dimensional continuous picking path planning based on ant colony optimization algorithm.

PLoS One. 2023 Feb 27;18(2):e0282334. doi: 10.1371/journal.pone.0282334. eCollection 2023.

End-to-End One-Shot Path-Planning Algorithm for an Autonomous Vehicle Based on a Convolutional Neural Network Considering Traversability Cost.

Sensors (Basel). 2022 Dec 10;22(24):9682. doi: 10.3390/s22249682.

本文引用的文献

Spectral Diagnostic Model for Agricultural Robot System Based on Binary Wavelet Algorithm.

Sensors (Basel). 2022 Feb 25;22(5):1822. doi: 10.3390/s22051822.

A survey of few-shot learning in smart agriculture: developments, applications, and challenges.

Plant Methods. 2022 Mar 5;18(1):28. doi: 10.1186/s13007-022-00866-2.

Improved Position Estimation Algorithm of Agricultural Mobile Robots Based on Multisensor Fusion and Autoencoder Neural Network.

Sensors (Basel). 2022 Feb 16;22(4):1522. doi: 10.3390/s22041522.

Distance-Entropy: An Effective Indicator for Selecting Informative Data.

Front Plant Sci. 2022 Jan 13;12:818895. doi: 10.3389/fpls.2021.818895. eCollection 2021.

Obstacle Detection System for Agricultural Mobile Robot Application Using RGB-D Cameras.

Sensors (Basel). 2021 Aug 5;21(16):5292. doi: 10.3390/s21165292.

Knowledge Implementation and Transfer With an Adaptive Learning Network for Real-Time Power Management of the Plug-in Hybrid Vehicle.

IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5298-5308. doi: 10.1109/TNNLS.2021.3093429. Epub 2021 Nov 30.

Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning Models for the Detection of Tomatoes in a Greenhouse.

Sensors (Basel). 2021 May 20;21(10):3569. doi: 10.3390/s21103569.

Soft Grippers for Automatic Crop Harvesting: A Review.

Sensors (Basel). 2021 Apr 11;21(8):2689. doi: 10.3390/s21082689.

Learning-Based Methods of Perception and Navigation for Ground Vehicles in Unstructured Environments: A Review.

Sensors (Basel). 2020 Dec 25;21(1):73. doi: 10.3390/s21010073.

Why ResNet Works? Residuals Generalize.

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5349-5362. doi: 10.1109/TNNLS.2020.2966319. Epub 2020 Nov 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

农业机器人的强化学习智能路径规划系统。

The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献