This repo dedicates to the task of stress testing a black-box autnomous vehicle system using multi-agent reinforcement learnign algorithms under an adversarial environment.
The simulation environment used are included as a submodule, which is adapted from highway-env.
The codes in this repo follow the suggested one-file implementation in reinforcement learning community.
There are four algorithms experimented in this repo:
- MACPO (Multi-Agent Constrained Policy Optimization)
- HATRPO (Heterogeneous-Agent Trust Region Policy Optimization)
- CPO (Constrained Policy Optimization)
- TRPO (Trust Region Policy Optimization)
The results are summarized in this thesis
The python environemnt can be created using conda:
conda env create -f environment.yml
To run the training script of an algorithm ALGO
listed above ALGO.py
:
python ALGO.py
To run the testing script of an algorithm ALGO
listed above ALGO_vis.py
:
python ALGO_vis.py
The code within this repository was developed with assistance from MACPO and CleanRL.