一种基于强化学习的作业车间动态调度方法

魏英姿; 赵明扬

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

一种基于强化学习的作业车间动态调度方法

1.
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016

详细信息

通讯作者:
魏英姿

计量
- 文章访问数: 3293
- HTML全文浏览量: 190
- PDF下载量: 3439
- 被引次数: 0
出版历程
- 收稿日期: 2004-08-12
- 修回日期: 2005-06-26
- 刊出日期: 2005-09-20

A Reinforcement Learning-based Approach to DynamicJob-shop Scheduling

1.
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016

More Information

Corresponding author: WEI Ying-Zi

摘要

摘要: Production scheduling is critical to manufacturing system. Dispatching rules are usually applied dynamically to schedule the job in a dynamic job-shop. Existing scheduling approaches sel- dom address machine selection in the scheduling process. Composite rules, considering both machine selection and job selection, are proposed in this paper. The dynamic system is trained to enhance its learning and adaptive capability by a reinforcement learning (RL) algorithm. We define the conception of pressure to describe the system feature. Designing a reward function should be guided by the scheduling goal to accurately record the learning progress. Competitive results with the RL-based approach show that it can be used as real-time scheduling technology.
- Reinforcement learning /
- composite rules /
- mean tardiness /
- dynamic job-shop scheduling
Abstract: Production scheduling is critical to manufacturing system. Dispatching rules are usually applied dynamically to schedule the job in a dynamic job-shop. Existing scheduling approaches sel- dom address machine selection in the scheduling process. Composite rules, considering both machine selection and job selection, are proposed in this paper. The dynamic system is trained to enhance its learning and adaptive capability by a reinforcement learning (RL) algorithm. We define the conception of pressure to describe the system feature. Designing a reward function should be guided by the scheduling goal to accurately record the learning progress. Competitive results with the RL-based approach show that it can be used as real-time scheduling technology.
- Reinforcement learning /
- composite rules /
- mean tardiness /
- dynamic job-shop scheduling