• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Intelligent decision for joint operations based on improved proximal policy optimization.
Sci Rep. 2025 Mar 19;15(1):9418. doi: 10.1038/s41598-025-86229-y.
2
Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm.基于改进型 PPO 算法的智能型海洋自主水面船舶决策系统。
Sensors (Basel). 2022 Jul 31;22(15):5732. doi: 10.3390/s22155732.
3
An LEO Constellation Early Warning System Decision-Making Method Based on Hierarchical Reinforcement Learning.基于分层强化学习的低地球轨道星座预警系统决策方法。
Sensors (Basel). 2023 Feb 16;23(4):2225. doi: 10.3390/s23042225.
4
Hierarchical Optimization Design for Autonomous Flight of Vision-Based Quadrotor Using Reinforcement Learning.基于强化学习的视觉四旋翼无人机自主飞行分层优化设计
IEEE Trans Cybern. 2025 Jul;55(7):3119-3130. doi: 10.1109/TCYB.2025.3562800.
5
A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles.一种基于多任务融合策略的自动驾驶车辆决策与规划方法
Sensors (Basel). 2023 Aug 8;23(16):7021. doi: 10.3390/s23167021.
6
Multi-robot hierarchical safe reinforcement learning autonomous decision-making strategy based on uniformly ultimate boundedness constraints.基于一致最终有界约束的多机器人分层安全强化学习自主决策策略
Sci Rep. 2025 Feb 18;15(1):5990. doi: 10.1038/s41598-025-89285-6.
7
Optimization of news dissemination push mode by intelligent edge computing technology for deep learning.基于深度学习的智能边缘计算技术对新闻传播推送模式的优化
Sci Rep. 2024 Mar 20;14(1):6671. doi: 10.1038/s41598-024-53859-7.
8
Federated deep reinforcement learning-based urban traffic signal optimal control.基于联邦深度强化学习的城市交通信号最优控制
Sci Rep. 2025 Apr 5;15(1):11724. doi: 10.1038/s41598-025-91966-1.
9
An Intelligent Maneuver Decision-Making Approach for Air Combat Based on Deep Reinforcement Learning and Transformer Networks.一种基于深度强化学习和Transformer网络的空战智能机动决策方法。
Entropy (Basel). 2024 Nov 29;26(12):1036. doi: 10.3390/e26121036.
10
Intelligent air defense task assignment based on hierarchical reinforcement learning.基于分层强化学习的智能防空任务分配
Front Neurorobot. 2022 Dec 1;16:1072887. doi: 10.3389/fnbot.2022.1072887. eCollection 2022.

本文引用的文献

1
Metalearning-Based Alternating Minimization Algorithm for Nonconvex Optimization.基于元学习的非凸优化交替最小化算法
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5366-5380. doi: 10.1109/TNNLS.2022.3165627. Epub 2023 Sep 1.

Intelligent decision for joint operations based on improved proximal policy optimization.

作者信息

Li Chen, Dong Wenhan, He Lei, Cai Ming, Wang Dafei

机构信息

Air Force Engineering University, Xi'an, 710038, China.

出版信息

Sci Rep. 2025 Mar 19;15(1):9418. doi: 10.1038/s41598-025-86229-y.

DOI:10.1038/s41598-025-86229-y
PMID:40108227
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11923204/
Abstract

To tackle challenges such as convergence difficulties and suboptimal performance in the application of reinforcement learning to intelligent decision-making for joint operations, this study introduces an enhanced decision-making approach for joint operations utilizing an improved Proximal Policy Optimization (PPO) algorithm. We propose a structured intelligent decision-making model designed to execute decision-making functions effectively. The strategy loss mechanism is improved by constraining the upper limit of the strategy loss function. Furthermore, a priority sampling mechanism, is developed to assess sample values, thereby enhancing the efficiency of sampling training. Additionally, a network structure facilitating distributed interaction and centralized learning is designed to expedite the training process. The proposed method is then applied to a joint operations simulation platform for intelligent decision-making. Simulation results demonstrate that our algorithm successfully addresses the aforementioned issues, enabling autonomous decisions based on battlefield dynamics, and ultimately leading to victory.

摘要