This is an implementation of
# In project root folder
pip install -r requirements.txt
# In project root folder
./run.sh
A plot of reward over time (averaged over 100 runs each) on the same axes, for
A summary comparison plot of rewards over first 1000 steps for the three algorithms with different values of the hyperparameters.