Skip to content

Python Implementation for problems in Reinforcement Learning - An Introduction book

Notifications You must be signed in to change notification settings

trunghng/reinforcement_learning_an_introduction

Repository files navigation

Reinforcement Learning: An Introduction

Python Implementation for problems in the Reinforcement Learning: An Introduction written by Richard S. Sutton & Andrew G. Barto.

Many ideas in the solutions are inspired by or taken from this well-known repo of Shangtong Zhang

DETAILS

Chapter 3: Finite MDPs

  • Gridworld - Bellman equation: source code, result (shown on the command prompt after running the code).

Chapter 4: Dynamic Programming

  • Gridworld - Iterative Policy Evaluation: source code, result (shown on the command prompt after running the code).
  • Jack's Car Rental - Policy Iteration: source code, result.
  • Gambler's Problem - Value Iteration: source code, result.

Chapter 5: Monte Carlo Methods

Chapter 6: TD Learning

Chapter 7: n-step Bootstrapping

Chapter 8: Planning and Learning with Tabular Methods

Chapter 9: On-policy Prediction with Approximation

Chapter 10: On-policy Control with Approximation

Chapter 11: Off-policy Methods with Approximation

Chapter 12: Eligible Traces

Chapter 13: Policy Gradient

About

Python Implementation for problems in Reinforcement Learning - An Introduction book

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages