Bees

Swarm intelligence through RL

Project with ETH's DisCo group

This project aims to explore the capabilities of reinforcement learning in our custom bee-themed environment, by training bees to accomplish simple tasks. These include bringing nectar from flowers back to the hive, and eliminating wasps by surrounding them.

Installation

The ETH ITET cluster Arton uses conda to manage Python packages, but some dependencies in this project, such as PettingZoo, have no conda package. Mixing packages installed in conda and pip is not recommended, so we recommend setting up conda once with Python 3.10.3 and pip 23.1.2, and then using pip to install all packages at once:

conda create -n bees python=3.10.3 pip=23.1.2
conda activate bees
pip install -r requirements.txt --no-cache-dir

You might need to follow the instructions here to install conda on Arton, with the packages stored on /itet-stor/USERNAME/net_scratch.

Training

Define the appropriate config dicts directly in train.py or through experiments.py
Configure the location of the ray_results directory in train.py's RESULTS_DIR, ideally on itet-storage
Optionally log in to WandB to visualize the results in real time and set the corresponding LOG_TO_WANDB
Run python train.py EXPERIMENT_ID locally, or sbatch bees_slurm.sh on Arton (some paths are hardcoded, make sure to change those)

Inference

Download the generated checkpoint file (usually having the name checkpoint_001000 or similar) from the ray_results folder
Set the TRAINING_CHECKPOINT_FILE variable in server.py and adapt the config dicts to match those from training
Run mesa runserver

Project structure

agents.py: The Mesa agents.
model.py: The Mesa model, where agents are positioned and ran.
train.py: The RLlib training code.
experiments.py: The config dicts to reproduce the experiments from the paper.
environments.py: A PettingZoo-API wrapper around the Mesa environment to make it compatible with RLlib.
action_mask_model.py: The wrapper around the model to apply action masks.
comm_net.py: The custom neural networks for communication between the agents.
bees_slurm.sh: The SLURM script to run the training on Arton CPU nodes.
test.py: Some tests for the PettingZoo environment.
visualization/: Visualization code of the game, extracted here to add features in the future, such as animating the agent movement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bees

Swarm intelligence through RL

Project with ETH's DisCo group

Installation

Training

Inference

Project structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
resources		resources
visualization		visualization
.gitignore		.gitignore
README.md		README.md
action_mask_model.py		action_mask_model.py
agents.py		agents.py
bees_slurm.sh		bees_slurm.sh
comm_net.py		comm_net.py
environment.py		environment.py
experiments.py		experiments.py
model.py		model.py
pettingzoo_env.py		pettingzoo_env.py
report.pdf		report.pdf
requirements.txt		requirements.txt
run.py		run.py
server.py		server.py
test.py		test.py
train.py		train.py

lundwall/bees

Folders and files

Latest commit

History

Repository files navigation

Bees

Swarm intelligence through RL

Project with ETH's DisCo group

Installation

Training

Inference

Project structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages