Earl: a framework for scalable reinforcement learning research

Earl is a library of reinforcement learning (RL) building blocks that strives to makes it easy to build simple, efficient, and readable agents.

Earl is built on JAX and Equinox.

Earl implements the two architectures described in "Podracer architectures for scalable Reinforcement Learning", which were used at DeepMind to scale training to very large batch sizes across many chips. This repository includes a few agents (AKA RL algorithms), notably R2D2 as described in "Recurrent Experience Replay In Distributed Reinforcement Learning".

The most important parts of Earl are:

The Agent abstract base class. It is designed to be flexible enough to allow implementation of a wide variety of RL algorithm, but structured enough to allow for standardized environment loops to be used for training all such agents. Earl agents are implemented using Equinox.
GymnaxLoop. For jax.jit-compatible environments, Earl supports the Gymnax interface. GymnaxLoop implements distributed training for these environments. This implements the Anakin architecture from the Podracer paper.
GymnasiumLoop. For other environments, Earl will support the Gymnasium interface. GymnasiumLoop implements distributed training for these environments. This implements the Sebulba architecture from the Podracer paper.

For an example of training an agent using both loops, see this notebook.

Included example agent implemented in Earl:

Simple Policy Gradient (very simple).
R2D2 (quite complicated).

There is currently no package on PyPi, but Earl is pure Python, so it can be installed from source, e.g.:

uv pip install "earl @ git+https://github.com/garymm/earl.git"

Here's a blog post that discusses some of the rationale and lessons learned developing Earl.

Development

Testing

There are two ways to run tests:

Bazel

Testing with Bazel is what happens in GitHub workflows. It's good for running lots of tests in parallel and it intelligently caches results. However it has a high overhead so running an individual test is slower and it doesn't have the same level of control as Pytest.

Currently running tests with Bazel is only supported on Linux x86_64.

Install Bazelisk, name it bazel.

Then run tests with:

bazel test //...

Pytest

To set this up first create a virtual environment (see section below), then source .venv/bin/activate. Then you can run pytest normally.

VEnv / IDE Setup

When using VS Code intall the recommended extensions by searching for @recommended.

Set up a virtual environment with:

bazel run //:dot_venv_linux_x86_64

This will create a .venv directory with the dependencies so you can use it with your IDE.

Citation

If you use Earl in your research, please cite it:

@software{miguel2024earl,
  author = {Miguel, Gary},
  title = {Earl: A Framework for Scalable Reinforcement Learning Research},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  url = {https://github.com/garymm/earl},
  description = {A library of reinforcement learning (RL) building blocks that strives to makes it easy to build simple, efficient, and readable agents}
}

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.github/workflows		.github/workflows
.vscode		.vscode
earl		earl
tools		tools
.bazelignore		.bazelignore
.bazeliskrc		.bazeliskrc
.bazelrc		.bazelrc
.gitignore		.gitignore
BUILD.bazel		BUILD.bazel
LICENSE		LICENSE
MODULE.bazel		MODULE.bazel
MODULE.bazel.lock		MODULE.bazel.lock
NOTES.md		NOTES.md
README.md		README.md
pyproject.toml		pyproject.toml
requirements_linux_x86_64.txt		requirements_linux_x86_64.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Earl: a framework for scalable reinforcement learning research

Development

Testing

Bazel

Pytest

VEnv / IDE Setup

Citation

About

Releases

Packages

Contributors 4

Languages

License

garymm/earl

Folders and files

Latest commit

History

Repository files navigation

Earl: a framework for scalable reinforcement learning research

Development

Testing

Bazel

Pytest

VEnv / IDE Setup

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages