AI Agent Learns To Walk

Project Overview

This project demonstrates the process of training a reinforcement learning agent to walk using the BipedalWalker-v3 environment from OpenAI Gym. The agent employs the Soft Actor-Critic (SAC) algorithm, a model-free, off-policy actor-critic method, to learn how to navigate and balance in a simulated environment.

Environment Setup

The project utilizes several libraries and tools to set up the environment, manage virtual displays, and facilitate rendering. Key libraries include gymnasium, pybullet, pyvirtualdisplay, and tf-agents. The environment setup involves installing dependencies and creating a virtual display for rendering purposes in headless systems.

Training the Agent

The agent is trained using the SAC algorithm, which is implemented using TensorFlow Agents (tf-agents). The training process involves:

Initializing the Environment: The BipedalWalker-v3 environment is loaded, and its specifications (actions, observations, rewards) are obtained.
Defining the Model: The actor and critic networks are defined, specifying the architecture and parameters for each. The SAC agent is then initialized with these networks.
Data Collection: A random policy is used to collect initial experiences, which are stored in a replay buffer managed by Reverb.
Training Loop: The agent is trained over a specified number of iterations. During each iteration, the agent collects new experiences, updates its policy based on these experiences, and evaluates its performance periodically.
Evaluation: The agent's performance is evaluated periodically, and metrics such as average return are logged to track progress.
Saving the Model: The trained policy is saved at regular intervals for future use or deployment.

Evaluation and Visualization

After training, the agent's performance is evaluated by running several episodes in the environment. A video is generated to visually demonstrate the agent's walking behavior. The video is embedded in the project documentation to provide an illustrative example of the agent's capabilities.

Results and Metrics

The performance of the agent is measured using metrics such as average return over multiple episodes. These metrics are logged and plotted to visualize the agent's learning progress over time. The training and evaluation results are crucial for understanding the effectiveness of the reinforcement learning algorithm and the improvements made by the agent.

How to Run the Project

Clone the Repository: Clone the project repository from GitHub to your local machine.
```
git clone https://github.com/your-username/ai-agent-learns-to-walk.git
```
Install Dependencies: Navigate to the project directory and install the required dependencies using pip.
```
cd ai-agent-learns-to-walk
pip install -r requirements.txt
```
Run the Training Script: Execute the training script to start the training process.
```
python train.py
```
Evaluate the Agent: After training, run the evaluation script to generate performance metrics and visualize the agent's behavior.
```
python evaluate.py
```

Conclusion

This project demonstrates the application of the SAC algorithm to train a bipedal walker agent in a simulated environment. By leveraging reinforcement learning techniques, the agent learns to navigate and balance, showcasing the potential of AI in solving complex control tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
AIES MINI PROJECT REPORT.docx		AIES MINI PROJECT REPORT.docx
AIES MINI PROJECT.pptx		AIES MINI PROJECT.pptx
AI_Walker_normal.mp4		AI_Walker_normal.mp4
Agent_AIES.ipynb		Agent_AIES.ipynb
README.md		README.md
graph.png		graph.png
requirements.txt		requirements.txt
walk.png		walk.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Agent Learns To Walk

Project Overview

Environment Setup

Training the Agent

Evaluation and Visualization

Results and Metrics

How to Run the Project

Conclusion

References

About

Releases

Packages

Languages

Doombotino/AI_Learns_to_Walk

Folders and files

Latest commit

History

Repository files navigation

AI Agent Learns To Walk

Project Overview

Environment Setup

Training the Agent

Evaluation and Visualization

Results and Metrics

How to Run the Project

Conclusion

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages