• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于新型方法的强化学习,结合深度时间差分网络用于柔性双车间调度问题。

A novel method-based reinforcement learning with deep temporal difference network for flexible double shop scheduling problem.

作者信息

Wang Xiao, Zhong Peisi, Liu Mei, Zhang Chao, Yang Shihao

机构信息

College of Mechanical and Electronic Engineering, Shandong University of Science and Technology, Qingdao, 266590, China.

Advanced Manufacturing Technology Centre, Shandong University of Science and Technology, Qingdao, 266590, China.

出版信息

Sci Rep. 2024 Apr 20;14(1):9047. doi: 10.1038/s41598-024-59414-8.

DOI:10.1038/s41598-024-59414-8
PMID:38641689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11031591/
Abstract

This paper studies the flexible double shop scheduling problem (FDSSP) that considers simultaneously job shop and assembly shop. It brings about the problem of scheduling association of the related tasks. To this end, a reinforcement learning algorithm with a deep temporal difference network is proposed to minimize the makespan. Firstly, the FDSSP is defined as the mathematical model of the flexible job-shop scheduling problem joined to the assembly constraint level. It is translated into a Markov decision process that directly selects behavioral strategies according to historical machining state data. Secondly, the proposed ten generic state features are input into the deep neural network model to fit the state value function. Similarly, eight simple constructive heuristics are used as candidate actions for scheduling decisions. From the greedy mechanism, optimally combined actions of all machines are obtained for each decision step. Finally, a deep temporal difference reinforcement learning framework is established, and a large number of comparative experiments are designed to analyze the basic performance of this algorithm. The results showed that the proposed algorithm was better than most other methods, which contributed to solving the practical production problem of the manufacturing industry.

摘要

本文研究了同时考虑作业车间和装配车间的柔性双车间调度问题(FDSSP)。它带来了相关任务调度关联的问题。为此,提出了一种具有深度时序差分网络的强化学习算法,以最小化完工时间。首先,将FDSSP定义为与装配约束水平相结合的柔性作业车间调度问题的数学模型。它被转化为一个马尔可夫决策过程,该过程根据历史加工状态数据直接选择行为策略。其次,将所提出的十个通用状态特征输入到深度神经网络模型中,以拟合状态值函数。同样,八个简单的构造启发式算法被用作调度决策的候选动作。从贪心机制出发,为每个决策步骤获得所有机器的最优组合动作。最后,建立了一个深度时序差分强化学习框架,并设计了大量对比实验来分析该算法的基本性能。结果表明,所提出的算法优于大多数其他方法,这有助于解决制造业的实际生产问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/c15109d94c37/41598_2024_59414_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/fc6fb4eddcce/41598_2024_59414_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/f6d5badab8bb/41598_2024_59414_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/e5e440ce4180/41598_2024_59414_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/cd1cda8e7ece/41598_2024_59414_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/33f79ad7b645/41598_2024_59414_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/c15109d94c37/41598_2024_59414_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/fc6fb4eddcce/41598_2024_59414_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/f6d5badab8bb/41598_2024_59414_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/e5e440ce4180/41598_2024_59414_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/cd1cda8e7ece/41598_2024_59414_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/33f79ad7b645/41598_2024_59414_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a709/11031591/c15109d94c37/41598_2024_59414_Fig4_HTML.jpg

相似文献

1
A novel method-based reinforcement learning with deep temporal difference network for flexible double shop scheduling problem.一种基于新型方法的强化学习,结合深度时间差分网络用于柔性双车间调度问题。
Sci Rep. 2024 Apr 20;14(1):9047. doi: 10.1038/s41598-024-59414-8.
2
An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.一种基于深度强化学习的演员-评论家框架,用于解决柔性作业车间调度问题。
Math Biosci Eng. 2024 Jan;21(1):1445-1471. doi: 10.3934/mbe.2024062. Epub 2022 Dec 28.
3
Solving flexible job shop scheduling problems with transportation time based on improved genetic algorithm.基于改进遗传算法的带运输时间的柔性作业车间调度问题求解
Math Biosci Eng. 2019 Feb 20;16(3):1334-1347. doi: 10.3934/mbe.2019065.
4
A Reinforcement Learning Approach for Flexible Job Shop Scheduling Problem With Crane Transportation and Setup Times.一种用于具有起重机运输和准备时间的柔性作业车间调度问题的强化学习方法。
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5695-5709. doi: 10.1109/TNNLS.2022.3208942. Epub 2024 Apr 4.
5
An Improved Genetic Algorithm for Solving the Multi-AGV Flexible Job Shop Scheduling Problem.一种改进的遗传算法求解多 AGV 柔性作业车间调度问题。
Sensors (Basel). 2023 Apr 7;23(8):3815. doi: 10.3390/s23083815.
6
Flexible Job Shop Scheduling via Dual Attention Network-Based Reinforcement Learning.基于双注意力网络强化学习的柔性作业车间调度
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3091-3102. doi: 10.1109/TNNLS.2023.3306421. Epub 2024 Feb 29.
7
Hybrid Deep Neural Network Scheduler for Job-Shop Problem Based on Convolution Two-Dimensional Transformation.基于卷积二维变换的作业车间问题混合深度神经网络调度器。
Comput Intell Neurosci. 2019 Jul 10;2019:7172842. doi: 10.1155/2019/7172842. eCollection 2019.
8
Minimizing the makespan and carbon emissions in the green flexible job shop scheduling problem with learning effects.最小化学习效应的绿色柔性作业车间调度问题的最大完工时间和碳排放。
Sci Rep. 2023 Apr 19;13(1):6369. doi: 10.1038/s41598-023-33615-z.
9
Dynamic Intelligent Scheduling in Low-Carbon Heterogeneous Distributed Flexible Job Shops with Job Insertions and Transfers.具有作业插入和转移的低碳异构分布式柔性作业车间中的动态智能调度
Sensors (Basel). 2024 Mar 31;24(7):2251. doi: 10.3390/s24072251.
10
A Novel Approach to the Job Shop Scheduling Problem Based on the Deep Q-Network in a Cooperative Multi-Access Edge Computing Ecosystem.基于深度 Q 网络的协同多接入边缘计算环境下作业车间调度问题的新方法。
Sensors (Basel). 2021 Jul 2;21(13):4553. doi: 10.3390/s21134553.

本文引用的文献

1
Multi-objective flexible job-shop scheduling problem using modified discrete particle swarm optimization.基于改进离散粒子群优化算法的多目标柔性作业车间调度问题
Springerplus. 2016 Aug 30;5(1):1432. doi: 10.1186/s40064-016-3054-z. eCollection 2016.
2
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
3
A fast learning algorithm for deep belief nets.一种用于深度信念网络的快速学习算法。
Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.