11 8 the performance of reinforcement learning is demonstrated by solving several dynamic scheduling problems