June 23, 2019
How reward functions affect our agents' decisions - Pedantic Reinforcement Learning (pt 5)
This post is part of a series. If you haven’t read the introduction yet, you might want to go back and read it so you can understand why we are doing this.
So far, we’ve created an environment that simulates a cluster attempting to handle HTTP requests that come in varying volumes throughout the day. Our agent is responsible for adding and removing servers from our cluster to handle the traffic.
Read more