View article

[PDF] from psu.edu

A reinforcement learning approach to job-shop scheduling

Authors

Wei Zhang, Thomas G Dietterich

Publication date

1995/8/20

Journal

Ijcai

Volume

Pages

1114-1120

Description

We apply reinforcement learning methods to learn domain-speci c heuristics for job shop scheduling. A repair-based scheduler starts with a critical-path schedule and incrementally repairs constraint violations with the goal of nding a short con ict-free schedule. The temporal di erence algorithm TD () is applied to train a neural network to learn a heuristic evaluation function over states. This evaluation function is used by a one-step lookahead search procedure to nd good solutions to new scheduling problems. We evaluate this approach on synthetic problems and on problems from a NASA space shuttle payload processing task. The evaluation function is trained on problems involving a small number of jobs and then tested on larger problems. The TD scheduler performs better than the best known existing algorithm for this task| Zweben's iterative repair method based on simulated annealing. The results suggest that reinforcement learning can provide a new method for constructing high-performance scheduling systems.

Total citations

Cited by 625

1995199619971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320244 13 20 15 20 20 20 20 23 15 17 26 36 21 27 17 16 15 14 15 12 14 14 22 21 34 36 42 33 11

Scholar articles

A reinforcement learning approach to job-shop scheduling

W Zhang, TG Dietterich - Ijcai, 1995