L2M2: A Simple Python LLM Manager 💬👍

L2M2 ("LLM Manager" → "LLMM" → "L2M2") is a tiny and very simple LLM manager for Python that exposes lots of models through a unified API. This is useful for evaluation, demos, production applications etc. that need to easily be model-agnostic.

Advantages

Simple: Completely unified interface – just swap out the model name.
Tiny: Only one external dependency (httpx). No BS dependency graph.
Private: Compatible with self-hosted models on your own infrastructure.
Fast: Fully asynchronous and non-blocking if concurrent calls are needed.

Features

30+ regularly updated supported models from popular hosted providers.
Support for self-hosted models via Ollama.
Manageable chat memory – even across multiple models or with concurrent memory streams.
JSON mode
Prompt loading tools

Supported API-based Models

L2M2 supports 38 models from OpenAI, Google, Anthropic, Cohere, Mistral, Groq, Replicate, and Cerebras. The full list of supported models can be found here.

Usage (Full Docs)

Requirements

Python >= 3.9
At least one valid API key for a supported provider, or a working Ollama installation (their docs).

Installation

pip install l2m2

Environment Setup

If you plan to use an API-based model, at least one of the following environment variables is set in order for L2M2 to automatically activate the provider.

Provider	Environment Variable
OpenAI	`OPENAI_API_KEY`
Anthropic	`ANTHROPIC_API_KEY`
Cohere	`CO_API_KEY`
Google	`GOOGLE_API_KEY`
Groq	`GROQ_API_KEY`
Replicate	`REPLICATE_API_TOKEN`
Mistral (La Plateforme)	`MISTRAL_API_KEY`
Cerebras	`CEREBRAS_API_KEY`

Otherwise, ensure Ollama is running – by default L2M2 looks for it at http://localhost:11434, but this can be configured.

Basic Usage

from l2m2.client import LLMClient

client = LLMClient()

response = client.call(model="gpt-4o", prompt="Hello world")
print(response)

For the full usage guide, including memory, asynchronous usage, local models, JSON mode, and more, see Usage Guide.

Planned Features

Streaming responses
Support Other self-hosted providers (vLLM, GPT4all, LMStudio, etc.)
Basic tools for common application workflows: RAG, prompt management, search, etc.
More customization with response formats
Basic agent & multi-agent system features (a lightweight version of something like LangGraph but with stuff I want)
Support for batch APIs where available (OpenAI, Anthropic, etc.)
Support for embeddings as well as inference
Support for structured outputs where available (Just OpenAI as far as I know)
Port this project over to other languages (TypeScript and Go, maybe Rust)
...etc.

Contributing

Contributions are welcome! Please see the below contribution guide.

Requirements
- Python >= 3.13
- uv >= 0.5.26
- GNU Make
Setup
- Clone this repository and create a Python virtual environment.
- Install dependencies: make init.
- Create a feature branch and an issue with a description of the feature or bug fix.
Develop
- Run lint, typecheck and tests: make (make lint, make type, and make test can also be run individually).
- Generate test coverage: make coverage.
- If you've updated the supported models, run make update-docs to reflect those changes in the README.
Integration Test
- cd into integration_tests.
- Create a .env file with your API keys, and copy itests.example.py to itests.py.
- Write your integration tests in itests.py.
- Run locally with python itests.py -l.
  - Note: make sure to pass the -l flag or else it will look for an L2M2 distribution. Additionally, make sure l2m2 is not installed with pip when running the integration tests locally.
  - A shortcut to do this from the top-level directory is make itl (integration test local).
- Once your changes are ready, from the top-level directory run make build to create the distribution and make itest to run your integration tests against the distribution.
  - Note: in order to ensure a clean test environment, make itest uninstalls all third-party Python packages before running the tests, so make sure to run make init when you're done working on integration tests.
Contribute
- Create a PR and ping me for a review.
- Merge!

Contact

If you have requests, suggestions, or any other questions about l2m2 please shoot me a note at [email protected], open an issue on Github, or DM me on Slack.

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
.github		.github
docs		docs
integration_tests		integration_tests
l2m2		l2m2
scripts		scripts
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

L2M2: A Simple Python LLM Manager 💬👍

Advantages

Features

Supported API-based Models

Usage (Full Docs)

Requirements

Installation

Environment Setup

Basic Usage

Planned Features

Contributing

Contact

About

Releases 34

Languages

License

pkelaita/l2m2

Folders and files

Latest commit

History

Repository files navigation

L2M2: A Simple Python LLM Manager 💬👍

Advantages

Features

Supported API-based Models

Usage (Full Docs)

Requirements

Installation

Environment Setup

Basic Usage

Planned Features

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 34

Languages