How to use PPO to train in psro_scenario #59

donotbelieveit · 2023-02-22T07:58:56Z

I can not find the implementation of PPO in this project.Through docs I know policy is compatible with Tianshou,but what about trainer?How can I use PPO to train in psro_scenario?I will appreciate it if you can answer my question.

KornbergFresnel · 2023-02-23T13:32:28Z

@donotbelieveit PPO is not ready yet as further tests are required, but you can follow our submission to malib.rl.ppo (coming sooner). btw, you can refer to the given training example (here) for using RL subroutines in PSRO. And if you want to know the mechanism of RL trainer, please refer to this marl example: examples/run_gym.py. And also, please feel free to make your PR if you have any ideas to enrich our (MA)RL algorithm lib under malib/rl. :)

KornbergFresnel linked a pull request Feb 28, 2023 that will close this issue

WIP: RL baselines as policy support #60

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use PPO to train in psro_scenario #59

How to use PPO to train in psro_scenario #59

donotbelieveit commented Feb 22, 2023

KornbergFresnel commented Feb 23, 2023

How to use PPO to train in psro_scenario #59

How to use PPO to train in psro_scenario #59

Comments

donotbelieveit commented Feb 22, 2023

KornbergFresnel commented Feb 23, 2023