Skip to content

htdt/ppo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Proximal Policy Optimization

  • Python 3.7, PyTorch 1.2
  • Neat, simple and efficient code
  • atari pacman score ≈4200 after 24h training on T4 GPU

Start

pip install -r requirements.txt
tensorboard --logdir runs
python -m train cartpole

Dependencies

git clone https://github.com/openai/baselines.git
pip install -e baselines

About

Proximal Policy Optimization

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages