Hand-picked awesome Python libraries and frameworks, organised by category ๐
Interactive version: www.awesomepython.org
Updated 06 Feb 2025
- Newly Created Repositories - Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here (10 repos)
- Agentic AI - Agentic AI libraries, frameworks and tools: AI agents, workflows, autonomous decision-making, goal-oriented tasks, and API integrations (51 repos)
- Code Quality - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (16 repos)
- Crypto and Blockchain - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (14 repos)
- Data - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (115 repos)
- Debugging - Debugging and tracing tools (10 repos)
- Diffusion Text to Image - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (42 repos)
- Finance - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (34 repos)
- Game Development - Game development tools, engines and libraries (8 repos)
- GIS - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (28 repos)
- Graph - Graphs and network libraries: network analysis, graph machine learning, visualisation (6 repos)
- GUI - Graphical user interface libraries and toolkits (8 repos)
- Jupyter - Jupyter and JupyterLab and Notebook tools, libraries and plugins (27 repos)
- LLMs and ChatGPT - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (310 repos)
- Math and Science - Mathematical, numerical and scientific libraries (30 repos)
- Machine Learning - General - General and classical machine learning libraries. See below for other sections covering specialised ML areas (160 repos)
- Machine Learning - Deep Learning - Machine learning libraries that cross over with deep learning in some way (79 repos)
- Machine Learning - Interpretability - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (22 repos)
- Machine Learning - Ops - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (43 repos)
- Machine Learning - Reinforcement - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (23 repos)
- Machine Learning - Time Series - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (19 repos)
- Natural Language Processing - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (87 repos)
- Packaging - Python packaging, dependency management and bundling (28 repos)
- Pandas - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (24 repos)
- Performance - Performance, parallelisation and low level libraries (28 repos)
- Profiling - Memory and CPU/GPU profiling tools and libraries (11 repos)
- Security - Security related libraries: vulnerability discovery, SQL injection, environment auditing (14 repos)
- Simulation - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (37 repos)
- Study - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (60 repos)
- Template - Template tools and libraries: cookiecutter repos, generators, quick-starts (10 repos)
- Terminal - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (15 repos)
- Testing - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (24 repos)
- Typing - Typing libraries: static and run-time type checking, annotations (12 repos)
- Utility - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools (210 repos)
- Vizualisation - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (36 repos)
- Web - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (58 repos)
Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.
-
deepseek-ai/DeepSeek-V3 โญ 72,238
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. -
huggingface/open-r1 โญ 15,441
The goal of this repo is to build the missing pieces of the R1 pipeline such that everybody can reproduce and build on top of it -
jiayi-pan/TinyZero โญ 7,718
TinyZero is a reproduction of DeepSeek R1 Zero in countdown and multiplication tasks. -
nvidia/Cosmos โญ 7,364
NVIDIA Cosmos is a developer-first world foundation model platform designed to help Physical AI developers build their Physical AI systems better and faster. -
novasky-ai/SkyThought โญ 2,353
Sky-T1: Train your own O1 preview model within $450
๐ novasky-ai.github.io -
langchain-ai/executive-ai-assistant โญ 1,487
Executive AI Assistant (EAIA) is an AI agent that attempts to do the job of an Executive Assistant (EA). -
deep-agent/R1-V โญ 1,481
We are building a general framework for Reinforcement Learning with Verifiable Rewards (RLVR) in VLM. RLVR outperforms chain-of-thought supervised fine-tuning (CoT-SFT) in both effectiveness and out-of-distribution (OOD) robustness for vision language models. -
thytu/Agentarium โญ 853
Framework for managing and orchestrating AI agents with ease. Agentarium provides a flexible and intuitive way to create, manage, and coordinate interactions between multiple AI agents in various environments. -
developersdigest/llm-api-engine โญ 622
Build and deploy AI-powered APIs in seconds. This project allows you to create custom APIs that extract structured data from websites using natural language descriptions, powered by LLMs and web scraping technology.
๐ www.youtube.com/watch?v=8kuek1bo4mm -
whitead/paper-qa โญ 3
High accuracy RAG for answering questions from scientific documents with citations
Agentic AI libraries, frameworks and tools: AI agents, workflows, autonomous decision-making, goal-oriented tasks, and API integrations.
-
langchain-ai/langchain โญ 99,521
๐ฆ๐ Build context-aware reasoning applications
๐ python.langchain.com -
langgenius/dify โญ 61,597
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
๐ dify.ai -
logspace-ai/langflow โญ 46,385
Langflow is a low-code app builder for RAG and multi-agent AI applications. Itโs Python-based and agnostic to any model, API, or database.
๐ www.langflow.org -
microsoft/autogen โญ 38,740
A programming framework for agentic AI ๐ค PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
๐ microsoft.github.io/autogen -
run-llama/llama_index โญ 38,553
LlamaIndex is the leading framework for building LLM-powered agents over your data.
๐ docs.llamaindex.ai -
openbmb/ChatDev โญ 26,053
ChatDev stands as a virtual software company that operates through various intelligent agents holding different roles, including Chief Executive Officer, Chief Product Officer etc
๐ arxiv.org/abs/2307.07924 -
joaomdmoura/crewAI โญ 25,897
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
๐ crewai.com -
browser-use/browser-use โญ 23,992
Browser use is the easiest way to connect your AI agents with the browser.
๐ browser-use.com -
stanford-oval/storm โญ 21,553
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
๐ storm.genie.stanford.edu -
yoheinakajima/babyagi โญ 20,920
GPT-4 powered task-driven autonomous agent
๐ babyagi.org -
phidatahq/agno โญ 18,534
Phidata is a toolkit for building AI Assistants using function calling.
๐ docs.agno.com -
openai/swarm โญ 18,450
A framework exploring ergonomic, lightweight multi-agent orchestration. -
unity-technologies/ml-agents โญ 17,548
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
๐ unity.com/products/machine-learning-agents -
assafelovic/gpt-researcher โญ 16,326
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
๐ gptr.dev -
letta-ai/letta โญ 14,316
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
๐ docs.letta.com -
smol-ai/developer โญ 11,866
the first library to let you embed a developer agent in your own app!
๐ twitter.com/smolmodels -
sakanaai/AI-Scientist โญ 8,832
The AI Scientist, the first comprehensive system for fully automatic scientific discovery, enabling Foundation Models such as Large Language Models (LLMs) to perform research independently. -
langchain-ai/langgraph โญ 8,646
LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
๐ langchain-ai.github.io/langgraph -
meta-llama/llama-stack โญ 7,140
Llama Stack standardizes the building blocks needed to bring genai applications to market. These blocks cover model training and fine-tuning, evaluation, and running AI agents in production -
huggingface/smolagents โญ 6,649
๐ค smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
๐ huggingface.co/docs/smolagents -
pydantic/pydantic-ai โญ 6,023
PydanticAI is a Python Agent Framework designed to make it less painful to build production grade applications with Generative AI.
๐ ai.pydantic.dev -
nirdiamant/GenAI_Agents โญ 5,830
Tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems. -
prefecthq/marvin โญ 5,452
โจ AI agents that spark joy
๐ askmarvin.ai -
mnotgod96/AppAgent โญ 5,432
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
๐ appagent-official.github.io -
kyegomez/swarms โญ 4,428
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
๐ docs.swarms.world -
meta-llama/llama-stack-apps โญ 4,107
Agentic components of the Llama Stack APIs -
crewaiinc/crewAI-examples โญ 3,569
A collection of examples that show how to use CrewAI framework to automate workflows. -
langroid/langroid โญ 3,005
Harness LLMs with Multi-Agent Programming
๐ langroid.github.io/langroid -
facebookresearch/Pearl โญ 2,758
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta. -
brainblend-ai/atomic-agents โญ 2,433
Atomic Agents provides a set of tools and agents that can be combined to create powerful applications. It is built on top of Instructor and leverages the power of Pydantic for data and schema validation and serialization. -
griptape-ai/griptape โญ 2,156
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
๐ www.griptape.ai -
joshuac215/agent-service-toolkit โญ 2,033
A full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit.
๐ agent-service-toolkit.streamlit.app -
run-llama/llama_deploy โญ 1,931
Async-first framework for deploying, scaling, and productionizing agentic multi-service systems based on workflows from llama_index.
๐ docs.llamaindex.ai/en/stable/module_guides/llama_deploy -
landing-ai/vision-agent โญ 1,789
VisionAgent is a library that helps you utilize agent frameworks to generate code to solve your vision task -
om-ai-lab/OmAgent โญ 1,537
OmAgent is python library for building multimodal language agents with ease. We try to keep the library simple without too much overhead like other agent framework.
๐ om-agent.com -
langchain-ai/executive-ai-assistant โญ 1,487
Executive AI Assistant (EAIA) is an AI agent that attempts to do the job of an Executive Assistant (EA). -
openautocoder/Agentless โญ 1,386
Agentless๐ฑ: an agentless approach to automatically solve software development problems -
pyspur-dev/pyspur โญ 1,369
Minimalist Graph Editor for AI Agents
๐ pyspur.dev -
link-agi/AutoAgents โญ 1,270
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
๐ huggingface.co/spaces/linksoul/autoagents -
emcie-co/parlant โญ 1,181
The heavy-duty guidance framework for customer-facing LLM agents
๐ www.parlant.io -
shengranhu/ADAS โญ 1,157
Automated Design of Agentic Systems using Meta Agent Search to show agents can invent novel and powerful agent designs
๐ www.shengranhu.com/adas -
prefecthq/ControlFlow โญ 1,118
ControlFlow provides a structured, developer-focused framework for defining workflows and delegating work to LLMs, without sacrificing control or transparency
๐ controlflow.ai -
thytu/Agentarium โญ 853
Framework for managing and orchestrating AI agents with ease. Agentarium provides a flexible and intuitive way to create, manage, and coordinate interactions between multiple AI agents in various environments. -
victordibia/autogen-ui โญ 838
Web UI for AutoGen (A Framework Multi-Agent LLM Applications) -
szczyglis-dev/py-gpt โญ 828
Desktop AI Assistant powered by o1, o3-mini, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, DeepSeek, Bielik, DALL-E, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and...
๐ pygpt.net -
google-deepmind/concordia โญ 759
Concordia is a library to facilitate construction and use of generative agent-based models to simulate interactions of agents in grounded physical, social, or digital space. -
deedy/mac_computer_use โญ 734
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
๐ x.com/deedydas/status/1849481225041559910 -
thudm/CogAgent โญ 658
An open-sourced end-to-end VLM-based GUI Agent -
strnad/CrewAI-Studio โญ 572
A user-friendly, multi-platform GUI for managing and running CrewAI agents and tasks. Supports Conda and virtual environments, no coding needed. -
salesforceairesearch/AgentLite โญ 568
AgentLite is a research-oriented library designed for building and advancing LLM-based task-oriented agent systems. It simplifies the implementation of new agent/multi-agent architectures, enabling easy orchestration of multiple agents through a manager agent. -
plurai-ai/intellagent โญ 507
Simulate interactions, analyze performance, and gain actionable insights for conversational agents. Test, evaluate, and optimize your agent to ensure reliable real-world deployment.
๐ intellagent-doc.plurai.ai
Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.
-
psf/black โญ 39,517
The uncompromising Python code formatter
๐ black.readthedocs.io/en/stable -
astral-sh/ruff โญ 35,419
An extremely fast Python linter and code formatter, written in Rust.
๐ docs.astral.sh/ruff -
pre-commit/pre-commit โญ 13,310
A framework for managing and maintaining multi-language pre-commit hooks.
๐ pre-commit.com -
sqlfluff/sqlfluff โญ 8,534
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
๐ www.sqlfluff.com -
pycqa/isort โญ 6,593
A Python utility / library to sort imports.
๐ pycqa.github.io/isort -
davidhalter/jedi โญ 5,864
Awesome autocompletion, static analysis and refactoring library for python
๐ jedi.readthedocs.io -
pycqa/pylint โญ 5,385
It's not just a linter that annoys you!
๐ pylint.readthedocs.io/en/latest -
asottile/pyupgrade โญ 3,677
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language. -
jendrikseipp/vulture โญ 3,653
Find dead Python code -
pycqa/flake8 โญ 3,525
flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
๐ flake8.pycqa.org -
wemake-services/wemake-python-styleguide โญ 2,650
The strictest and most opinionated python linter ever!
๐ wemake-python-styleguide.rtfd.io -
python-lsp/python-lsp-server โญ 2,050
Fork of the python-language-server project, maintained by the Spyder IDE team and the community -
codespell-project/codespell โญ 2,002
check code for common misspellings -
sourcery-ai/sourcery โญ 1,592
Instant AI code reviews
๐ sourcery.ai -
tconbeer/sqlfmt โญ 428
sqlfmt formats your dbt SQL files so you don't have to
๐ sqlfmt.com
Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.
-
freqtrade/freqtrade โญ 35,834
Free, open source crypto trading bot
๐ www.freqtrade.io -
ccxt/ccxt โญ 34,373
A JavaScript / TypeScript / Python / C# / PHP / Go cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
๐ docs.ccxt.com -
crytic/slither โญ 5,472
Static Analyzer for Solidity and Vyper
๐ blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework -
ethereum/web3.py โญ 5,135
A python interface for interacting with the Ethereum blockchain and ecosystem.
๐ web3py.readthedocs.io -
ethereum/consensus-specs โญ 3,626
Ethereum Proof-of-Stake Consensus Specifications -
cyberpunkmetalhead/Binance-volatility-trading-bot โญ 3,438
This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script. -
bmoscon/cryptofeed โญ 2,333
Cryptocurrency Exchange Websocket Data Feed Handler -
ethereum/py-evm โญ 2,298
A Python implementation of the Ethereum Virtual Machine
๐ py-evm.readthedocs.io/en/latest -
binance/binance-public-data โญ 1,701
Details on how to get Binance public data -
ofek/bit โญ 1,269
Bitcoin made easy.
๐ ofek.dev/bit -
man-c/pycoingecko โญ 1,063
Python wrapper for the CoinGecko API -
coinbase/agentkit โญ 451
AgentKit is Coinbase Developer Platform's framework for easily enabling AI agents to take actions onchain. It is designed to be framework-agnostic, so you can use it with any AI framework, and wallet-agnostic -
dylanhogg/awesome-crypto โญ 73
A list of awesome crypto and blockchain projects
๐ www.awesomecrypto.xyz
General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.
-
scrapy/scrapy โญ 54,011
Scrapy, a fast high-level web crawling & scraping framework for Python.
๐ scrapy.org -
apache/spark โญ 40,453
Apache Spark - A unified analytics engine for large-scale data processing
๐ spark.apache.org -
microsoft/markitdown โญ 36,143
A utility for converting files to Markdown, supports: PDF, PPT, Word, Excel, Images etc -
mindsdb/mindsdb โญ 27,140
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
๐ mindsdb.com -
getredash/redash โญ 26,834
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
๐ redash.io -
jaidedai/EasyOCR โญ 25,408
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
๐ www.jaided.ai -
qdrant/qdrant โญ 21,607
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
๐ qdrant.tech -
humansignal/label-studio โญ 20,567
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
๐ labelstud.io -
ds4sd/docling โญ 19,796
Docling parses documents and exports them to the desired format with ease and speed.
๐ ds4sd.github.io/docling -
joke2k/faker โญ 17,993
Faker is a Python package that generates fake data for you.
๐ faker.readthedocs.io -
avaiga/taipy โญ 17,758
Turns Data and AI algorithms into production-ready web applications in no time.
๐ www.taipy.io -
chroma-core/chroma โญ 17,386
the AI-native open-source embedding database
๐ www.trychroma.com -
airbytehq/airbyte โญ 17,118
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
๐ airbyte.com -
binux/pyspider โญ 16,540
A Powerful Spider(Web Crawler) System in Python.
๐ docs.pyspider.org -
twintproject/twint โญ 15,930
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations. -
tiangolo/sqlmodel โญ 15,125
SQL databases in Python, designed for simplicity, compatibility, and robustness.
๐ sqlmodel.tiangolo.com -
apache/arrow โญ 14,949
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
๐ arrow.apache.org -
pathwaycom/pathway โญ 13,493
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
๐ pathway.com -
weaviate/weaviate โญ 12,208
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseโ.
๐ weaviate.io/developers/weaviate -
coleifer/peewee โญ 11,344
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
๐ docs.peewee-orm.com -
s0md3v/Photon โญ 11,291
Incredibly fast crawler designed for OSINT. -
sqlalchemy/sqlalchemy โญ 9,977
The Database Toolkit for Python
๐ www.sqlalchemy.org -
simonw/datasette โญ 9,776
An open source multi-tool for exploring and publishing data
๐ datasette.io -
bigscience-workshop/petals โญ 9,395
๐ธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
๐ petals.dev -
voxel51/fiftyone โญ 9,132
Refine high-quality datasets and visual AI models
๐ fiftyone.ai -
yzhao062/pyod โญ 8,781
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
๐ pyod.readthedocs.io -
gristlabs/grist-core โญ 7,762
Grist is the evolution of spreadsheets.
๐ www.getgrist.com -
tobymao/sqlglot โญ 7,070
Python SQL Parser and Transpiler
๐ sqlglot.com -
alirezamika/autoscraper โญ 6,612
A Smart, Automatic, Fast and Lightweight Web Scraper for Python -
kaggle/kaggle-api โญ 6,411
Official Kaggle API -
madmaze/pytesseract โญ 5,996
A Python wrapper for Google Tesseract -
vi3k6i5/flashtext โญ 5,612
Extract Keywords from sentence or Replace keywords in sentences. -
airbnb/knowledge-repo โญ 5,500
A next-generation curated knowledge sharing platform for data scientists and other technical professions. -
ibis-project/ibis โญ 5,490
Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
๐ ibis-project.org -
lancedb/lancedb โญ 5,409
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
๐ lancedb.github.io/lancedb -
cyclotruc/gitingest โญ 5,408
Turn any Git repository into a prompt-friendly text ingest for LLMs.
๐ gitingest.com -
facebookresearch/AugLy โญ 4,988
A data augmentations library for audio, image, text, and video.
๐ ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models -
superduperdb/superduper โญ 4,941
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
๐ superduper.io -
jazzband/tablib โญ 4,654
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
๐ tablib.readthedocs.io -
amundsen-io/amundsen โญ 4,487
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
๐ www.amundsen.io/amundsen -
lk-geimfari/mimesis โญ 4,484
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
๐ mimesis.name -
giskard-ai/giskard โญ 4,266
๐ข Open-Source Evaluation & Testing for AI & LLM systems
๐ docs.giskard.ai -
mongodb/mongo-python-driver โญ 4,178
PyMongo - the Official MongoDB Python driver
๐ www.mongodb.com/docs/languages/python/pymongo-driver/current -
adbar/trafilatura โญ 3,893
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
๐ trafilatura.readthedocs.io -
rom1504/img2dataset โญ 3,874
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine. -
andialbrecht/sqlparse โญ 3,809
A non-validating SQL parser module for Python -
jmcnamara/XlsxWriter โญ 3,699
A Python module for creating Excel XLSX files.
๐ xlsxwriter.readthedocs.io -
deepchecks/deepchecks โญ 3,698
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
๐ docs.deepchecks.com/stable -
praw-dev/praw โญ 3,574
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
๐ praw.readthedocs.io -
run-llama/llama-hub โญ 3,465
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
๐ llamahub.ai -
rapidai/RapidOCR โญ 3,435
๐ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
๐ rapidai.github.io/rapidocrdocs -
pyeve/cerberus โญ 3,189
Lightweight, extensible data validation library for Python
๐ python-cerberus.org -
zoomeranalytics/xlwings โญ 3,063
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
๐ www.xlwings.org -
sqlalchemy/alembic โญ 3,056
A database migrations tool for SQLAlchemy. -
dlt-hub/dlt โญ 3,055
data load tool (dlt) is an open source Python library that makes data loading easy ๐ ๏ธ
๐ dlthub.com/docs -
docarray/docarray โญ 3,007
Represent, send, store and search multimodal data
๐ docs.docarray.org -
pallets/itsdangerous โญ 2,966
Safely pass trusted data to untrusted environments and back.
๐ itsdangerous.palletsprojects.com -
datafold/data-diff โญ 2,959
Compare tables within or across databases
๐ docs.datafold.com -
goldsmith/Wikipedia โญ 2,916
A Pythonic wrapper for the Wikipedia API
๐ wikipedia.readthedocs.org -
awslabs/amazon-redshift-utils โญ 2,787
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment -
mlabonne/llm-datasets โญ 2,615
Curated list of datasets and tools for post-training.
๐ mlabonne.github.io/blog -
kayak/pypika โญ 2,613
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
๐ pypika.readthedocs.io/en/latest -
sdv-dev/SDV โญ 2,479
Synthetic data generation for tabular data
๐ docs.sdv.dev/sdv -
pynamodb/PynamoDB โญ 2,479
A pythonic interface to Amazon's DynamoDB
๐ pynamodb.readthedocs.io -
uqfoundation/dill โญ 2,310
serialize all of Python
๐ dill.rtfd.io -
emirozer/fake2db โญ 2,288
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb. -
samuelcolvin/arq โญ 2,281
Fast job queuing and RPC in python with asyncio and redis.
๐ arq-docs.helpmanual.io -
pikepdf/pikepdf โญ 2,252
A Python library for reading and writing PDF, powered by QPDF
๐ pikepdf.readthedocs.io -
graphistry/pygraphistry โญ 2,203
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer -
accenture/AmpliGraph โญ 2,183
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org -
huggingface/datatrove โญ 2,173
DataTrove is a library to process, filter and deduplicate text data at a very large scale. It provides a set of prebuilt commonly used processing blocks with a framework to easily add custom functionality -
sfu-db/connector-x โญ 2,095
Fastest library to load data from DB to DataFrames in Rust and Python
๐ sfu-db.github.io/connector-x -
aminalaee/sqladmin โญ 2,025
SQLAlchemy Admin for FastAPI and Starlette
๐ aminalaee.dev/sqladmin -
milvus-io/bootcamp โญ 1,988
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
๐ milvus.io -
agronholm/sqlacodegen โญ 1,977
Automatic model code generator for SQLAlchemy -
uber/petastorm โญ 1,812
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. -
aio-libs/aiomysql โญ 1,780
aiomysql is a library for accessing a MySQL database from the asyncio
๐ aiomysql.rtfd.io -
simonw/sqlite-utils โญ 1,748
Python CLI utility and library for manipulating SQLite databases
๐ sqlite-utils.datasette.io -
simple-salesforce/simple-salesforce โญ 1,734
A very simple Salesforce.com REST API client for Python -
collerek/ormar โญ 1,699
python async orm with fastapi in mind and pydantic validation
๐ collerek.github.io/ormar -
zarr-developers/zarr-python โญ 1,590
An implementation of chunked, compressed, N-dimensional arrays for Python.
๐ zarr.readthedocs.io -
eleutherai/the-pile โญ 1,529
The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together. -
scholarly-python-package/scholarly โญ 1,498
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
๐ scholarly.readthedocs.io -
ydataai/ydata-synthetic โญ 1,483
Synthetic data generators for tabular and time-series data
๐ docs.synthetic.ydata.ai -
sdispater/orator โญ 1,422
The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
๐ orator-orm.com -
mchong6/JoJoGAN โญ 1,419
Official PyTorch repo for JoJoGAN: One Shot Face Stylization -
google/tensorstore โญ 1,376
Library for reading and writing large multi-dimensional arrays.
๐ google.github.io/tensorstore -
quixio/quix-streams โญ 1,289
Python stream processing for Kafka
๐ docs.quix.io -
aio-libs/aiocache โญ 1,208
Asyncio cache manager for redis, memcached and memory
๐ aiocache.readthedocs.io -
eliasdabbas/advertools โญ 1,181
advertools - online marketing productivity and analysis tools
๐ advertools.readthedocs.io -
pytorch/data โญ 1,163
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries. -
d-star-ai/dsRAG โญ 1,139
A retrieval engine for unstructured data. It is especially good at handling challenging queries over dense text, like financial reports, legal documents, and academic papers. -
brettkromkamp/contextualise โญ 1,068
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
๐ contextualise.dev -
uber/fiber โญ 1,041
Distributed Computing for AI Made Simple
๐ uber.github.io/fiber -
intake/intake โญ 1,025
Intake is a lightweight package for finding, investigating, loading and disseminating data.
๐ intake.readthedocs.io -
duckdb/dbt-duckdb โญ 976
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org) -
igorbenav/fastcrud โญ 931
FastCRUD is a Python package for FastAPI, offering robust async CRUD operations and flexible endpoint creation utilities. -
goccy/bigquery-emulator โญ 890
BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development. -
scikit-hep/awkward โญ 861
Manipulate JSON-like data with NumPy-like idioms.
๐ awkward-array.org -
macbre/sql-metadata โญ 834
Uses tokenized query returned by python-sqlparse and generates query metadata
๐ pypi.python.org/pypi/sql-metadata -
koaning/human-learn โญ 801
Natural Intelligence is still a pretty good idea.
๐ koaning.github.io/human-learn -
googleapis/python-bigquery โญ 749
Python Client for Google BigQuery -
hyperqueryhq/whale โญ 725
๐ณ The stupidly simple CLI workspace for your data warehouse.
๐ rsyi.gitbook.io/whale -
dgarnitz/vectorflow โญ 682
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
๐ www.getvectorflow.com -
kagisearch/vectordb โญ 673
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
๐ vectordb.com -
weaviate/recipes โญ 660
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations! -
unstructured-io/unstructured-api โญ 627
API for Open-Source Pre-Processing Tools for Unstructured Data -
apache/iceberg-python โญ 586
PyIceberg is a Python library for programmatic access to Iceberg table metadata as well as to table data in Iceberg format.
๐ py.iceberg.apache.org -
jina-ai/vectordb โญ 586
A Python vector database you just need - no more, no less. -
koaning/bulk โญ 563
Bulk is a quick UI developer tool to apply some bulk labels. -
koaning/doubtlab โญ 508
Doubt your data, find bad labels.
๐ koaning.github.io/doubtlab -
ibm/data-prep-kit โญ 462
Data Prep Kit is a community project to democratize and accelerate unstructured data preparation for LLM app developers
๐ ibm.github.io/data-prep-kit -
titan-systems/titan โญ 450
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. -
stackloklabs/promptwright โญ 371
Promptwright is a Python library designed for generating large synthetic datasets using LLMs
Debugging and tracing tools.
-
cool-rr/PySnooper โญ 16,422
Never use print for debugging again -
gruns/icecream โญ 9,477
๐ฆ Never use print() to debug again. -
shobrook/rebound โญ 4,123
Get Stack Overflow results in your terminal whenever an error is thrown -
inducer/pudb โญ 3,030
Full-screen console debugger for Python
๐ documen.tician.de/pudb -
alexmojaki/heartrate โญ 1,806
Simple real time visualisation of the execution of a Python program. -
alexmojaki/birdseye โญ 1,667
Graphical Python debugger which lets you easily view the values of all evaluated expressions
๐ birdseye.readthedocs.io -
pdbpp/pdbpp โญ 1,328
pdb++, a drop-in replacement for pdb (the Python debugger) -
alexmojaki/snoop โญ 1,320
A powerful set of Python debugging tools, based on PySnooper -
samuelcolvin/python-devtools โญ 1,004
Dev tools for python
๐ python-devtools.helpmanual.io
Text-to-image diffusion model libraries, tools and apps for generating images from natural language.
-
automatic1111/stable-diffusion-webui โญ 146,849
Stable Diffusion web UI -
compvis/stable-diffusion โญ 69,375
A latent text-to-image diffusion model
๐ ommer-lab.com/research/latent-diffusion-models -
comfyanonymous/ComfyUI โญ 65,770
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
๐ www.comfy.org -
stability-ai/stablediffusion โญ 39,940
High-Resolution Image Synthesis with Latent Diffusion Models -
lllyasviel/ControlNet โญ 31,345
Let us control diffusion models! -
huggingface/diffusers โญ 27,374
๐ค Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
๐ huggingface.co/docs/diffusers -
invoke-ai/InvokeAI โญ 24,328
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
๐ invoke-ai.github.io/invokeai -
openbmb/MiniCPM-o โญ 18,085
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone -
apple/ml-stable-diffusion โญ 17,113
Stable Diffusion with Core ML on Apple Silicon -
borisdayma/dalle-mini โญ 14,784
DALLยทE Mini - Generate images from a text prompt
๐ www.craiyon.com -
divamgupta/diffusionbee-stable-diffusion-ui โญ 12,959
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
๐ diffusionbee.com -
compvis/latent-diffusion โญ 12,265
High-Resolution Image Synthesis with Latent Diffusion Models -
instantid/InstantID โญ 11,360
InstantID: Zero-shot Identity-Preserving Generation in Seconds ๐ฅ
๐ instantid.github.io -
lucidrains/DALLE2-pytorch โญ 11,207
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch -
facebookresearch/dinov2 โญ 9,734
PyTorch code and models for the DINOv2 self-supervised learning method. -
ashawkey/stable-dreamfusion โญ 8,446
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion. -
carson-katri/dream-textures โญ 7,919
Stable Diffusion built-in to Blender -
xavierxiao/Dreambooth-Stable-Diffusion โญ 7,652
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion -
idea-research/GroundingDINO โญ 7,297
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
๐ arxiv.org/abs/2303.05499 -
opengvlab/InternVL โญ 6,925
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. ๆฅ่ฟGPT-4o่กจ็ฐ็ๅผๆบๅคๆจกๆๅฏน่ฏๆจกๅ
๐ internvl.readthedocs.io/en/latest -
timothybrooks/instruct-pix2pix โญ 6,491
PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo. -
openai/consistency_models โญ 6,236
Official repo for consistency models. -
salesforce/BLIP โญ 4,989
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation -
nateraw/stable-diffusion-videos โญ 4,496
Create ๐ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts -
lkwq007/stablediffusion-infinity โญ 3,865
Outpainting with Stable Diffusion on an infinite canvas -
jina-ai/discoart โญ 3,846
๐ชฉ Create Disco Diffusion artworks in one line -
mlc-ai/web-stable-diffusion โญ 3,632
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
๐ mlc.ai/web-stable-diffusion -
openai/glide-text2im โญ 3,577
GLIDE: a diffusion-based text-conditional image synthesis model -
openai/improved-diffusion โญ 3,409
Release for Improved Denoising Diffusion Probabilistic Models -
saharmor/dalle-playground โญ 2,765
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini) -
google-research/big_vision โญ 2,547
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more. -
stability-ai/stability-sdk โญ 2,429
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
๐ platform.stability.ai -
thudm/CogVLM2 โญ 2,234
GPT4V-level open-source multi-modal model based on Llama3-8B -
open-compass/VLMEvalKit โญ 1,767
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
๐ huggingface.co/spaces/opencompass/open_vlm_leaderboard -
coyote-a/ultimate-upscale-for-automatic1111 โญ 1,691
Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI -
divamgupta/stable-diffusion-tensorflow โญ 1,593
Stable Diffusion in TensorFlow / Keras -
nvlabs/prismer โญ 1,307
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
๐ shikun.io/projects/prismer -
chenyangqiqi/FateZero โญ 1,130
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
๐ fate-zero-edit.github.io -
thereforegames/unprompted โญ 793
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI. -
tanelp/tiny-diffusion โญ 702
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets. -
sharonzhou/long_stable_diffusion โญ 684
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion) -
laion-ai/dalle2-laion โญ 501
Pretrained Dalle2 from laion
Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.
-
openbb-finance/OpenBB โญ 35,969
Investment Research for Everyone, Everywhere.
๐ openbb.co -
quantopian/zipline โญ 17,992
Zipline, a Pythonic Algorithmic Trading Library
๐ www.zipline.io -
microsoft/qlib โญ 16,303
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
๐ qlib.readthedocs.io/en/latest -
mementum/backtrader โญ 15,681
Python Backtesting library for trading strategies
๐ www.backtrader.com -
ranaroussi/yfinance โญ 15,596
Download market data from Yahoo! Finance's API
๐ ranaroussi.github.io/yfinance -
ai4finance-foundation/FinGPT โญ 14,815
FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐ฅ We release the trained model on HuggingFace.
๐ ai4finance.org -
ai4finance-foundation/FinRL โญ 10,561
FinRL: Financial Reinforcement Learning. ๐ฅ
๐ ai4finance.org -
quantconnect/Lean โญ 10,422
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
๐ lean.io -
ta-lib/ta-lib-python โญ 10,124
Python wrapper for TA-Lib (http://ta-lib.org/).
๐ ta-lib.github.io/ta-lib-python -
goldmansachs/gs-quant โญ 8,304
Python toolkit for quantitative finance
๐ developer.gs.com/discover/products/gs-quant -
virattt/ai-hedge-fund โญ 7,455
AI-powered hedge fund. The goal of this project is to explore the use of AI to make trading decisions. -
kernc/backtesting.py โญ 5,877
๐ ๐ ๐ ๐ฐ Backtest trading strategies in Python.
๐ kernc.github.io/backtesting.py -
quantopian/pyfolio โญ 5,806
Portfolio and risk analytics in Python
๐ quantopian.github.io/pyfolio -
twopirllc/pandas-ta โญ 5,728
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
๐ twopirllc.github.io/pandas-ta -
ranaroussi/quantstats โญ 5,254
Portfolio analytics for quants, written in Python -
polakowo/vectorbt โญ 4,713
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
๐ vectorbt.dev -
google/tf-quant-finance โญ 4,686
High-performance TensorFlow library for quantitative finance. -
gbeced/pyalgotrade โญ 4,471
Python Algorithmic Trading Library
๐ gbeced.github.io/pyalgotrade -
borisbanushev/stockpredictionai โญ 4,415
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ... -
matplotlib/mplfinance โญ 3,831
Financial Markets Data Visualization using Matplotlib
๐ pypi.org/project/mplfinance -
quantopian/alphalens โญ 3,516
Performance analysis of predictive (alpha) stock factors
๐ quantopian.github.io/alphalens -
cuemacro/finmarketpy โญ 3,514
Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
๐ www.cuemacro.com -
zvtvz/zvt โญ 3,369
modular quant framework.
๐ zvt.readthedocs.io/en/latest -
robcarver17/pysystemtrade โญ 2,754
Systematic Trading in python -
quantopian/research_public โญ 2,498
Quantitative research and educational materials
๐ www.quantopian.com/lectures -
pmorissette/bt โญ 2,378
bt - flexible backtesting for Python
๐ pmorissette.github.io/bt -
domokane/FinancePy โญ 2,249
A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives. -
blankly-finance/blankly โญ 2,220
๐ ๐ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
๐ package.blankly.finance -
pmorissette/ffn โญ 2,109
ffn - a financial function library for Python
๐ pmorissette.github.io/ffn -
cuemacro/findatapy โญ 1,754
Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc. -
quantopian/empyrical โญ 1,329
Common financial risk and performance metrics. Used by zipline and pyfolio.
๐ quantopian.github.io/empyrical -
idanya/algo-trader โญ 810
Trading bot with support for realtime trading, backtesting, custom strategies and much more. -
gbeced/basana โญ 647
A Python async and event driven framework for algorithmic trading, with a focus on crypto currencies. -
chancefocus/PIXIU โญ 614
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
Game development tools, engines and libraries.
-
pygame/pygame โญ 7,722
๐๐ฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
๐ www.pygame.org -
microsoft/TRELLIS โญ 7,413
A large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes.
๐ trellis3d.github.io -
panda3d/panda3d โญ 4,634
Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
๐ www.panda3d.org -
niklasf/python-chess โญ 2,504
python-chess is a chess library for Python, with move generation, move validation, and support for common formats
๐ python-chess.readthedocs.io/en/latest -
pokepetter/ursina โญ 2,277
A game engine powered by python and panda3d.
๐ pokepetter.github.io/ursina -
pyglet/pyglet โญ 1,955
pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
๐ pyglet.org -
pythonarcade/arcade โญ 1,747
Easy to use Python library for creating 2D arcade games.
๐ arcade.academy
Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.
-
domlysz/BlenderGIS โญ 7,984
Blender addons to make the bridge between Blender and geographic data -
python-visualization/folium โญ 7,025
Python Data. Leaflet.js Maps.
๐ python-visualization.github.io/folium -
osgeo/gdal โญ 5,075
GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
๐ gdal.org -
gboeing/osmnx โญ 4,988
Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
๐ osmnx.readthedocs.io -
geopandas/geopandas โญ 4,623
Python tools for geographic data
๐ geopandas.org -
shapely/shapely โญ 3,998
Manipulation and analysis of geometric objects
๐ shapely.readthedocs.io/en/stable -
giswqs/geemap โญ 3,553
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
๐ geemap.org -
holoviz/datashader โญ 3,371
Quickly and accurately render even the largest data.
๐ datashader.org -
opengeos/leafmap โญ 3,263
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
๐ leafmap.org -
microsoft/torchgeo โญ 3,189
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
๐ www.osgeo.org/projects/torchgeo -
opengeos/segment-geospatial โญ 3,148
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
๐ samgeo.gishub.org -
google/earthengine-api โญ 2,752
Python and JavaScript bindings for calling the Earth Engine API. -
rasterio/rasterio โญ 2,304
Rasterio reads and writes geospatial raster datasets
๐ rasterio.readthedocs.io -
mcordts/cityscapesScripts โญ 2,203
README and scripts for the Cityscapes Dataset -
azavea/raster-vision โญ 2,111
An open source library and framework for deep learning on satellite and aerial imagery.
๐ docs.rastervision.io -
apache/sedona โญ 1,991
A cluster computing framework for processing large-scale geospatial data
๐ sedona.apache.org -
gboeing/osmnx-examples โญ 1,616
Gallery of OSMnx tutorials, usage examples, and feature demonstations.
๐ osmnx.readthedocs.io -
jupyter-widgets/ipyleaflet โญ 1,504
A Jupyter - Leaflet.js bridge
๐ ipyleaflet.readthedocs.io -
microsoft/GlobalMLBuildingFootprints โญ 1,467
Worldwide building footprints derived from satellite imagery -
pysal/pysal โญ 1,358
PySAL: Python Spatial Analysis Library Meta-Package
๐ pysal.org/pysal -
anitagraser/movingpandas โญ 1,266
Movement trajectory classes and functions built on top of GeoPandas
๐ movingpandas.org -
residentmario/geoplot โญ 1,164
High-level geospatial data visualization library for Python.
๐ residentmario.github.io/geoplot/index.html -
sentinel-hub/eo-learn โญ 1,145
Earth observation processing framework for machine learning in Python
๐ eo-learn.readthedocs.io/en/latest -
opengeos/streamlit-geospatial โญ 904
A multi-page streamlit app for geospatial
๐ huggingface.co/spaces/giswqs/streamlit -
osgeo/grass โญ 876
GRASS GIS - free and open-source geospatial processing engine
๐ grass.osgeo.org -
makepath/xarray-spatial โญ 860
Raster-based Spatial Analytics for Python
๐ xarray-spatial.readthedocs.io -
developmentseed/titiler โญ 823
Build your own Raster dynamic map tile services
๐ developmentseed.org/titiler
Graphs and network libraries: network analysis, graph machine learning, visualisation.
-
networkx/networkx โญ 15,349
Network Analysis in Python
๐ networkx.org -
stellargraph/stellargraph โญ 2,967
StellarGraph - Machine Learning on Graphs
๐ stellargraph.readthedocs.io -
westhealth/pyvis โญ 1,051
Python package for creating and visualizing interactive network graphs.
๐ pyvis.readthedocs.io/en/latest -
microsoft/graspologic โญ 853
graspologic is a package for graph statistical algorithms
๐ graspologic-org.github.io/graspologic -
rampasek/GraphGPS โญ 699
Recipe for a General, Powerful, Scalable Graph Transformer -
dylanhogg/llmgraph โญ 370
Create knowledge graphs with LLMs
Graphical user interface libraries and toolkits.
-
hoffstadt/DearPyGui โญ 13,672
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
๐ dearpygui.readthedocs.io/en/latest -
pysimplegui/PySimpleGUI โญ 13,542
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
๐ www.pysimplegui.com -
parthjadhav/Tkinter-Designer โญ 9,511
An easy and fast way to create a Python GUI ๐ -
samuelcolvin/FastUI โญ 8,650
FastUI is a new way to build web application user interfaces defined by declarative Python code.
๐ fastui-demo.onrender.com -
r0x0r/pywebview โญ 4,944
Build GUI for your Python program with JavaScript, HTML, and CSS
๐ pywebview.flowrl.com -
beeware/toga โญ 4,486
A Python native, OS native GUI toolkit.
๐ toga.readthedocs.io/en/latest -
dddomodossola/remi โญ 3,549
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet. -
wxwidgets/Phoenix โญ 2,381
wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
๐ wxpython.org
Jupyter and JupyterLab and Notebook tools, libraries and plugins.
-
jupyterlab/jupyterlab โญ 14,356
JupyterLab computational environment.
๐ jupyterlab.readthedocs.io -
jupyter/notebook โญ 11,991
Jupyter Interactive Notebook
๐ jupyter-notebook.readthedocs.io -
marimo-team/marimo โญ 10,162
A reactive Python notebook: run a cell or interact with a UI element, and marimo automatically runs dependent cells, keeping code and outputs consistent. marimo notebooks are stored as pure Python, executable as scripts, and deployable as apps.
๐ marimo.io -
mwouts/jupytext โญ 6,728
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
๐ jupytext.readthedocs.io -
nteract/papermill โญ 6,069
๐ Parameterize, execute, and analyze notebooks
๐ papermill.readthedocs.io/en/latest -
connorferster/handcalcs โญ 5,699
Python library for converting Python calculations into rendered latex. -
voila-dashboards/voila โญ 5,558
Voilร turns Jupyter notebooks into standalone web applications
๐ voila.readthedocs.io -
jupyterlite/jupyterlite โญ 3,992
Wasm powered Jupyter running in the browser ๐ก
๐ jupyterlite.rtfd.io/en/stable/try/lab -
executablebooks/jupyter-book โญ 3,956
Create beautiful, publication-quality books and documents from computational content.
๐ jupyterbook.org -
jupyterlab/jupyterlab-desktop โญ 3,855
JupyterLab desktop application, based on Electron. -
jupyterlab/jupyter-ai โญ 3,382
A generative AI extension for JupyterLab
๐ jupyter-ai.readthedocs.io -
jupyter-widgets/ipywidgets โญ 3,190
Interactive Widgets for the Jupyter Notebook
๐ ipywidgets.readthedocs.io -
quantopian/qgrid โญ 3,061
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks -
jupyter/nbdime โญ 2,696
Tools for diffing and merging of Jupyter notebooks.
๐ nbdime.readthedocs.io -
mito-ds/mito โญ 2,342
Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet
๐ trymito.io -
jupyter/nbviewer โญ 2,224
nbconvert as a web service: Render Jupyter Notebooks as static web pages
๐ nbviewer.jupyter.org -
maartenbreddels/ipyvolume โญ 1,953
3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL -
jupyter-lsp/jupyterlab-lsp โญ 1,842
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
๐ jupyterlab-lsp.readthedocs.io -
jupyter/nbconvert โญ 1,783
Jupyter Notebook Conversion
๐ nbconvert.readthedocs.io -
koaning/drawdata โญ 1,222
Draw datasets from within Jupyter. -
8080labs/pyforest โญ 1,109
With pyforest you can use all your favorite Python libraries without importing them before. If you use a package that is not imported yet, pyforest imports the package for you and adds the code to the first Jupyter cell.
๐ 8080labs.com -
nbqa-dev/nbQA โญ 1,076
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
๐ nbqa.readthedocs.io/en/latest/index.html -
vizzuhq/ipyvizzu โญ 959
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
๐ ipyvizzu.vizzuhq.com -
aws/graph-notebook โญ 750
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
๐ github.com/aws/graph-notebook -
linealabs/lineapy โญ 665
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
๐ lineapy.org -
xiaohk/stickyland โญ 540
Break the linear presentation of Jupyter Notebooks with sticky cells!
๐ xiaohk.github.io/stickyland -
infuseai/colab-xterm โญ 423
Open a terminal in colab, including the free tier.
Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.
-
significant-gravitas/AutoGPT โญ 171,082
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
๐ agpt.co -
ggerganov/llama.cpp โญ 72,991
LLM inference in C/C++ -
deepseek-ai/DeepSeek-V3 โญ 72,238
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. -
nomic-ai/gpt4all โญ 72,183
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
๐ nomic.ai/gpt4all -
open-webui/open-webui โญ 67,137
Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. It supports various LLM runners like Ollama and OpenAI-compatible APIs, with built-in inference engine for RAG
๐ openwebui.com -
xtekky/gpt4free โญ 63,309
The official gpt4free repository | various collection of powerful language models | gpt-4o and deepseek v3 & r1
๐ t.me/g4f_channel -
killianlucas/open-interpreter โญ 58,149
A natural language interface for computers
๐ openinterpreter.com -
facebookresearch/llama โญ 57,463
Inference code for Llama models -
imartinez/private-gpt โญ 55,091
Interact with your documents using the power of GPT, 100% privately, no data leaks
๐ privategpt.dev -
gpt-engineer-org/gpt-engineer โญ 53,005
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app -
xai-org/grok-1 โญ 49,890
This repository contains JAX example code for loading and running the Grok-1 open-weights model. -
geekan/MetaGPT โญ 45,927
๐ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
๐ deepwisdom.ai -
oobabooga/text-generation-webui โญ 42,187
A Gradio web UI for Large Language Models with support for multiple inference backends. -
thudm/ChatGLM-6B โญ 41,017
ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผๆบๅ่ฏญๅฏน่ฏ่ฏญ่จๆจกๅ -
hiyouga/LLaMA-Factory โญ 39,299
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
๐ huggingface.co/papers/2403.13372 -
hpcaitech/ColossalAI โญ 39,042
Making large AI models cheaper, faster and more accessible
๐ www.colossalai.org -
karpathy/nanoGPT โญ 39,008
The simplest, fastest repository for training/finetuning medium-sized GPTs. -
lm-sys/FastChat โญ 37,647
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. -
quivrhq/quivr โญ 37,201
Opiniated RAG for integrating GenAI in your apps ๐ง Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
๐ core.quivr.com -
laion-ai/Open-Assistant โญ 37,193
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
๐ open-assistant.io -
vllm-project/vllm โญ 36,204
A high-throughput and memory-efficient inference and serving engine for LLMs
๐ docs.vllm.ai -
moymix/TaskMatrix โญ 34,542
Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting. -
pythagora-io/gpt-pilot โญ 32,286
The first real AI developer -
infiniflow/ragflow โญ 31,032
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
๐ ragflow.io -
tatsu-lab/stanford_alpaca โญ 29,777
Code and documentation to train Stanford's Alpaca models, and generate the data.
๐ crfm.stanford.edu/2023/03/13/alpaca.html -
unclecode/crawl4ai โญ 28,696
AI-ready web crawling tailored for LLMs, AI agents, and data pipelines. Open source, flexible, and built for real-time performance, Crawl4AI empowers developers with unmatched speed, precision, and deployment ease.
๐ crawl4ai.com -
danielmiessler/fabric โญ 28,305
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
๐ danielmiessler.com/p/fabric-origin-story -
meta-llama/llama3 โญ 28,175
The official Meta Llama 3 GitHub site -
khoj-ai/khoj โญ 25,860
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI
๐ khoj.dev -
vision-cair/MiniGPT-4 โญ 25,543
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
๐ minigpt-4.github.io -
karpathy/llm.c โญ 25,221
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython -
embedchain/mem0 โญ 24,373
The Memory layer for AI Agents
๐ mem0.ai -
microsoft/JARVIS โญ 23,904
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf -
unslothai/unsloth โญ 23,135
Finetune Llama 3.3, DeepSeek-R1, Mistral, Phi-4 & Gemma 2 LLMs 2-5x faster with 70% less memory
๐ unsloth.ai -
microsoft/semantic-kernel โญ 22,913
Integrate cutting-edge LLM technology quickly and easily into your apps
๐ aka.ms/semantic-kernel -
openai/gpt-2 โญ 22,899
Code for the paper "Language Models are Unsupervised Multitask Learners"
๐ openai.com/blog/better-language-models -
microsoft/graphrag โญ 22,084
A modular graph-based Retrieval-Augmented Generation (RAG) system
๐ microsoft.github.io/graphrag -
stanfordnlp/dspy โญ 21,643
DSPy: The framework for programmingโnot promptingโlanguage models
๐ dspy.ai -
haotian-liu/LLaVA โญ 21,285
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
๐ llava.hliu.cc -
karpathy/minGPT โญ 21,242
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training -
openai/chatgpt-retrieval-plugin โญ 21,105
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language. -
cinnamon/kotaemon โญ 20,768
An open-source RAG UI for chatting with your documents. Built with both end users and developers in mind
๐ cinnamon.github.io/kotaemon -
mlc-ai/mlc-llm โญ 19,850
Universal LLM Deployment Engine with ML Compilation
๐ llm.mlc.ai -
guidance-ai/guidance โญ 19,576
A guidance language for controlling large language models. -
rasahq/rasa โญ 19,335
๐ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
๐ rasa.com/docs/rasa -
deepset-ai/haystack โญ 18,969
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversatio...
๐ haystack.deepset.ai -
stitionai/devika โญ 18,851
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. -
tloen/alpaca-lora โญ 18,792
Instruct-tune LLaMA on consumer hardware -
karpathy/llama2.c โญ 17,970
Inference Llama 2 in one file of pure C -
huggingface/peft โญ 17,146
๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
๐ huggingface.co/docs/peft -
berriai/litellm โญ 16,967
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
๐ docs.litellm.ai/docs -
qwenlm/Qwen โญ 16,475
The official repo of Qwen (้ไนๅ้ฎ) chat & pretrained large language model proposed by Alibaba Cloud. -
facebookresearch/codellama โญ 16,175
Inference code for CodeLlama models -
facebookresearch/llama-cookbook โญ 16,111
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
๐ www.llama.com -
transformeroptimus/SuperAGI โญ 15,815
<โก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
๐ superagi.com -
thudm/ChatGLM2-6B โญ 15,759
ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผๆบๅ่ฏญๅฏน่ฏ่ฏญ่จๆจกๅ -
idea-research/Grounded-Segment-Anything โญ 15,658
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
๐ arxiv.org/abs/2401.14159 -
huggingface/open-r1 โญ 15,441
The goal of this repo is to build the missing pieces of the R1 pipeline such that everybody can reproduce and build on top of it -
openai/evals โญ 15,436
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. -
dao-ailab/flash-attention โญ 15,290
Fast and memory-efficient exact attention -
mayooear/gpt4-pdf-chatbot-langchain โญ 15,046
GPT4 & LangChain Chatbot for large PDF docs
๐ www.youtube.com/watch?v=ih9pbgvvoo4 -
fauxpilot/fauxpilot โญ 14,660
FauxPilot - an open-source alternative to GitHub Copilot server -
mlc-ai/web-llm โญ 14,544
High-performance In-browser LLM Inference Engine
๐ webllm.mlc.ai -
blinkdl/RWKV-LM โญ 13,081
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and f... -
vanna-ai/vanna โญ 12,942
๐ค Chat with your SQL database ๐. Accurate Text-to-SQL Generation via LLMs using RAG ๐.
๐ vanna.ai/docs -
microsoft/BitNet โญ 12,691
Official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models -
pathwaycom/llm-app โญ 12,582
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. ๐ณDocker-friendly.โกAlways in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
๐ pathway.com/developers/templates -
paddlepaddle/PaddleNLP โญ 12,312
๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
๐ paddlenlp.readthedocs.io -
openlmlab/MOSS โญ 12,024
An open-source tool-augmented conversational language model from Fudan University
๐ txsun1997.github.io/blogs/moss.html -
skyvern-ai/skyvern โญ 11,908
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
๐ www.skyvern.com -
shishirpatil/gorilla โญ 11,743
Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
๐ gorilla.cs.berkeley.edu -
nirdiamant/RAG_Techniques โญ 11,713
The most comprehensive and dynamic collections of Retrieval-Augmented Generation (RAG) tutorials available today. This repository serves as a hub for cutting-edge techniques aimed at enhancing the accuracy, efficiency, and contextual richness of RAG systems. -
h2oai/h2ogpt โญ 11,626
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
๐ h2o.ai -
lightning-ai/litgpt โญ 11,435
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐ lightning.ai -
lightning-ai/litgpt โญ 11,435
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐ lightning.ai -
nvidia/Megatron-LM โญ 11,243
Ongoing research training transformer models at scale
๐ docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start -
microsoft/LoRA โญ 11,216
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
๐ arxiv.org/abs/2106.09685 -
lvwerra/trl โญ 11,053
Train transformer language models with reinforcement learning.
๐ hf.co/docs/trl -
google-research/vision_transformer โญ 10,845
Vision Transformer and MLP-Mixer Architectures -
databrickslabs/dolly โญ 10,805
Databricksโ Dolly, a large language model trained on the Databricks Machine Learning Platform
๐ www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html -
outlines-dev/outlines โญ 10,572
Structured Text Generation from LLMs
๐ dottxt-ai.github.io/outlines -
artidoro/qlora โญ 10,210
QLoRA: Efficient Finetuning of Quantized LLMs
๐ arxiv.org/abs/2305.14314 -
anthropics/anthropic-cookbook โญ 10,161
Provides code and guides designed to help developers build with Claude, offering copy-able code snippets that you can easily integrate into your own projects. -
andrewyng/aisuite โญ 9,911
Simple, unified interface to multiple Generative AI providers. aisuite makes it easy for developers to use multiple LLM through a standardized interface. -
mistralai/mistral-inference โญ 9,908
Official inference library for Mistral models
๐ mistral.ai -
microsoft/promptflow โญ 9,870
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
๐ microsoft.github.io/promptflow -
prompt-toolkit/python-prompt-toolkit โญ 9,511
Library for building powerful interactive command line applications in Python
๐ python-prompt-toolkit.readthedocs.io -
mshumer/gpt-prompt-engineer โญ 9,463
Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best. -
blinkdl/ChatRWKV โญ 9,448
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. -
karpathy/minbpe โญ 9,362
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. -
swivid/F5-TTS โญ 9,339
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
๐ arxiv.org/abs/2410.06885 -
jxnl/instructor โญ 9,231
Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
๐ python.useinstructor.com -
llmware-ai/llmware โญ 8,578
Unified framework for building enterprise RAG pipelines with small, specialized models
๐ llmware-ai.github.io/llmware -
apple/ml-ferret โญ 8,556
Ferret: Refer and Ground Anything Anywhere at Any Granularity -
abetlen/llama-cpp-python โญ 8,536
Simple Python bindings for @ggerganov's llama.cpp library.
๐ llama-cpp-python.readthedocs.io -
axolotl-ai-cloud/axolotl โญ 8,472
Go ahead and axolotl questions
๐ axolotl-ai-cloud.github.io/axolotl -
sgl-project/sglang โญ 8,442
SGLang is a fast serving framework for large language models and vision language models.
๐ docs.sglang.ai -
chainlit/chainlit โญ 8,434
Build Conversational AI in minutes โก๏ธ
๐ docs.chainlit.io -
thudm/CodeGeeX โญ 8,353
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
๐ codegeex.cn -
optimalscale/LMFlow โญ 8,339
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
๐ optimalscale.github.io/lmflow -
eleutherai/gpt-neo โญ 8,266
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
๐ www.eleuther.ai -
jzhang38/TinyLlama โญ 8,162
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. -
sjtu-ipads/PowerInfer โญ 8,072
High-speed Large Language Model Serving for Local Deployment -
explodinggradients/ragas โญ 8,053
Supercharge Your LLM Application Evaluations ๐
๐ docs.ragas.io -
lianjiatech/BELLE โญ 8,029
BELLE: Be Everyone's Large Language model Engine๏ผๅผๆบไธญๆๅฏน่ฏๅคงๆจกๅ๏ผ -
vaibhavs10/insanely-fast-whisper โญ 8,028
An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐ค Transformers, Optimum & flash-attn -
01-ai/Yi โญ 7,802
The Yi series models are the next generation of open-source large language models trained from scratch by 01.AI.
๐ 01.ai -
plachtaa/VALL-E-X โญ 7,767
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ -
jiayi-pan/TinyZero โญ 7,718
TinyZero is a reproduction of DeepSeek R1 Zero in countdown and multiplication tasks. -
thudm/GLM-130B โญ 7,679
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023) -
eleutherai/lm-evaluation-harness โญ 7,637
A framework for few-shot evaluation of language models.
๐ www.eleuther.ai -
anthropics/anthropic-quickstarts โญ 7,578
A collection of projects designed to help developers quickly get started with building applications using the Anthropic API. Each quickstart provides a foundation that you can easily build upon and customize for your specific needs. -
sweepai/sweep โญ 7,496
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
๐ sweep.dev -
openlm-research/open_llama โญ 7,434
OpenLLaMA: An Open Reproduction of LLaMA -
e2b-dev/E2B โญ 7,413
E2B is an open-source infrastructure that allows you to run AI-generated code in secure isolated sandboxes in the cloud
๐ e2b.dev/docs -
zilliztech/GPTCache โญ 7,362
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
๐ gptcache.readthedocs.io -
bigcode-project/starcoder โญ 7,355
Home of StarCoder: fine-tuning & inference! -
vikhyat/moondream โญ 7,225
A tiny open-source computer-vision language model designed to run efficiently on edge devices
๐ moondream.ai -
skypilot-org/skypilot โญ 7,110
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
๐ docs.skypilot.co -
eleutherai/gpt-neox โญ 7,075
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
๐ www.eleuther.ai -
bhaskatripathi/pdfGPT โญ 7,041
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
๐ huggingface.co/spaces/bhaskartripathi/pdfchatter -
apple/corenet โญ 7,000
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation. -
future-house/paper-qa โญ 6,870
High-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature -
mit-han-lab/streaming-llm โญ 6,780
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
๐ arxiv.org/abs/2309.17453 -
weaviate/Verba โญ 6,746
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate -
internlm/InternLM โญ 6,734
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
๐ internlm.intern-ai.org.cn -
langchain-ai/opengpts โญ 6,546
An open source effort to create a similar experience to OpenAI's GPTs and Assistants API. -
run-llama/rags โญ 6,376
RAGs is a Streamlit app that lets you create a RAG pipeline from a data source using natural language. -
nat/openplayground โญ 6,318
An LLM playground you can run on your laptop -
lightning-ai/lit-llama โญ 6,027
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. -
simonw/llm โญ 5,909
Access large language models from the command-line
๐ llm.datasette.io -
minedojo/Voyager โญ 5,865
An Open-Ended Embodied Agent with Large Language Models
๐ voyager.minedojo.org -
pytorch-labs/gpt-fast โญ 5,774
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python. -
langchain-ai/chat-langchain โญ 5,663
Locally hosted chatbot specifically focused on question answering over the LangChain documentation
๐ chat.langchain.com -
lyogavin/airllm โญ 5,640
AirLLM optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card without quantization, distillation and pruning. And you can run 405B Llama3.1 on 8GB vram now. -
canner/WrenAI โญ 5,612
Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI.
๐ getwren.ai/oss -
microsoft/promptbase โญ 5,520
promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models. -
promptfoo/promptfoo โญ 5,410
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
๐ promptfoo.dev -
qwenlm/Qwen-VL โญ 5,379
The official repo of Qwen-VL (้ไนๅ้ฎ-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. -
dsdanielpark/Bard-API โญ 5,298
The unofficial python package that returns response of Google Bard through cookie value.
๐ pypi.org/project/bardapi -
modelscope/ms-swift โญ 5,209
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
๐ swift.readthedocs.io/zh-cn/latest -
cg123/mergekit โญ 5,197
Tools for merging pretrained large language models. -
arcee-ai/mergekit โญ 5,197
Tools for merging pretrained large language models. -
allenai/OLMo โญ 5,115
OLMo is a repository for training and using AI2's state-of-the-art open language models. It is designed by scientists, for scientists.
๐ allenai.org/olmo -
openbmb/ToolBench โญ 4,867
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
๐ openbmb.github.io/toolbench -
microsoft/LLMLingua โญ 4,843
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
๐ llmlingua.com -
togethercomputer/RedPajama-Data โญ 4,633
The RedPajama-Data repository contains code for preparing large datasets for training large language models. -
open-compass/opencompass โญ 4,579
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
๐ opencompass.org.cn -
1rgs/jsonformer โญ 4,562
A Bulletproof Way to Generate Structured JSON from Language Models -
pipecat-ai/pipecat โญ 4,526
Open Source framework for voice and multimodal conversational AI -
guardrails-ai/guardrails โญ 4,439
Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
๐ www.guardrailsai.com/docs -
kyegomez/tree-of-thoughts โญ 4,435
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
๐ discord.gg/qutxnk2nmf -
nvidia/NeMo-Guardrails โญ 4,383
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. -
microsoft/BioGPT โญ 4,354
Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining -
linkedin/Liger-Kernel โญ 4,288
Efficient Triton Kernels for LLM Training
๐ arxiv.org/pdf/2410.10989 -
instruction-tuning-with-gpt-4/GPT-4-LLM โญ 4,259
Instruction Tuning with GPT-4
๐ instruction-tuning-with-gpt-4.github.io -
yizhongw/self-instruct โญ 4,254
Aligning pretrained language models with instruction data generated by themselves. -
katanaml/sparrow โญ 4,198
Sparrow is a solution for efficient data extraction and processing from various documents and images like invoices and receipts
๐ sparrow.katanaml.io -
h2oai/h2o-llmstudio โญ 4,150
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
๐ h2o.ai -
mshumer/gpt-llm-trainer โญ 4,071
Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you -
ragapp/ragapp โญ 4,011
The easiest way to use Agentic RAG in any enterprise -
turboderp/exllamav2 โญ 3,915
A fast inference library for running LLMs locally on modern consumer-class GPUs -
microsoft/LMOps โญ 3,824
General technology for enabling AI capabilities w/ LLMs and MLLMs
๐ aka.ms/generalai -
ravenscroftj/turbopilot โญ 3,818
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU -
eth-sri/lmql โญ 3,793
A language for constraint-guided and efficient LLM programming.
๐ lmql.ai -
agiresearch/AIOS โญ 3,773
AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.
๐ aios.foundation -
mmabrouk/llm-workflow-engine โญ 3,679
Power CLI and Workflow manager for LLMs (core package) -
truefoundry/cognita โญ 3,558
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
๐ cognita.truefoundry.com -
defog-ai/sqlcoder โญ 3,528
SoTA LLM for converting natural language questions to SQL queries -
lm-sys/RouteLLM โญ 3,519
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality -
marker-inc-korea/AutoRAG โญ 3,515
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
๐ auto-rag.com -
minimaxir/simpleaichat โญ 3,499
Python package for easily interfacing with chat apps, with robust features and minimal code complexity. -
iryna-kondr/scikit-llm โญ 3,411
Seamlessly integrate LLMs into scikit-learn.
๐ beastbyte.ai -
next-gpt/NExT-GPT โญ 3,405
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
๐ next-gpt.github.io -
minimaxir/gpt-2-simple โญ 3,400
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts -
jaymody/picoGPT โญ 3,303
An unnecessarily tiny implementation of GPT-2 in NumPy. -
deep-diver/LLM-As-Chatbot โญ 3,299
LLM as a Chatbot Service -
luodian/Otter โญ 3,228
๐ฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
๐ otter-ntu.github.io -
bclavie/RAGatouille โญ 3,214
Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices. -
huggingface/text-embeddings-inference โญ 3,105
A blazing fast inference solution for text embeddings models
๐ huggingface.co/docs/text-embeddings-inference/quick_tour -
microsoft/torchscale โญ 3,044
Foundation Architecture for (M)LLMs
๐ aka.ms/generalai -
baichuan-inc/Baichuan-13B โญ 2,973
A 13B large language model developed by Baichuan Intelligent Technology
๐ huggingface.co/baichuan-inc/baichuan-13b-chat -
li-plus/chatglm.cpp โญ 2,964
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V) -
cohere-ai/cohere-toolkit โญ 2,934
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications. -
freedomintelligence/LLMZoo โญ 2,928
โกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โก -
verazuo/jailbreak_llms โญ 2,917
Official repo for the ACM CCS 2024 paper "Do Anything Now'': Characterizing and Evaluating In-The-Wild Jailbreak Prompts
๐ jailbreak-llms.xinyueshen.me -
meta-llama/PurpleLlama โญ 2,870
Set of tools to assess and improve LLM security. -
mistralai/mistral-finetune โญ 2,825
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA. -
lightning-ai/LitServe โญ 2,812
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
๐ lightning.ai/docs/litserve -
juncongmoo/pyllama โญ 2,806
LLaMA: Open and Efficient Foundation Language Models -
hegelai/prompttools โญ 2,779
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
๐ prompttools.readthedocs.io -
alpha-vllm/LLaMA2-Accessory โญ 2,750
An Open-source Toolkit for LLM Development
๐ llama2-accessory.readthedocs.io -
mit-han-lab/llm-awq โญ 2,703
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration -
paperswithcode/galai โญ 2,702
Model API for GALACTICA -
nirdiamant/Prompt_Engineering โญ 2,695
A comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. -
sylphai-inc/AdalFlow โญ 2,650
Unified auto-differentiative framework for both zero-shot prompt optimization and few-shot optimization. It advances existing auto-optimization research, including Text-Grad and DsPy
๐ adalflow.sylph.ai -
cheshire-cat-ai/core โญ 2,607
AI agent microservice
๐ cheshirecat.ai -
noahshinn/reflexion โญ 2,567
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning -
databricks/dbrx โญ 2,531
Code examples and resources for DBRX, a large language model developed by Databricks
๐ www.databricks.com -
pytorch/executorch โญ 2,461
An end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices.
๐ pytorch.org/executorch -
ofa-sys/OFA โญ 2,457
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework -
young-geng/EasyLM โญ 2,448
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax. -
janhq/cortex.cpp โญ 2,429
Cortex is a Local AI API Platform that is used to run and customize LLMs.
๐ cortex.so -
novasky-ai/SkyThought โญ 2,353
Sky-T1: Train your own O1 preview model within $450
๐ novasky-ai.github.io -
civitai/sd_civitai_extension โญ 2,350
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI -
predibase/lorax โญ 2,333
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
๐ loraexchange.ai -
intel/neural-compressor โญ 2,312
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
๐ intel.github.io/neural-compressor -
truera/trulens โญ 2,307
Evaluation and Tracking for LLM Experiments
๐ www.trulens.org -
spcl/graph-of-thoughts โญ 2,266
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
๐ arxiv.org/pdf/2308.09687.pdf -
openai/simple-evals โญ 2,260
Lightweight library for evaluating language models -
argilla-io/distilabel โญ 2,227
Distilabel is the framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
๐ distilabel.argilla.io -
openai/finetune-transformer-lm โญ 2,185
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
๐ s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf -
volcengine/verl โญ 2,116
veRL is a flexible, efficient and production-ready RL training library for large language models (LLMs).
๐ verl.readthedocs.io/en/latest/index.html -
tairov/llama2.mojo โญ 2,108
Inference Llama 2 in one file of pure ๐ฅ
๐ www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov -
azure-samples/graphrag-accelerator โญ 2,101
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
๐ github.com/microsoft/graphrag -
evolvinglmms-lab/lmms-eval โญ 2,051
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
๐ lmms-lab.framer.ai -
openai/image-gpt โญ 2,049
Archived. Code and models from the paper "Generative Pretraining from Pixels" -
agenta-ai/agenta โญ 2,036
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.
๐ www.agenta.ai -
ist-daslab/gptq โญ 2,016
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
๐ arxiv.org/abs/2210.17323 -
lucidrains/toolformer-pytorch โญ 1,998
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI -
neulab/prompt2model โญ 1,981
prompt2model - Generate Deployable Models from Natural Language Instructions -
microsoft/Megatron-DeepSpeed โญ 1,972
Ongoing research training transformer language models at scale, including: BERT & GPT-2 -
openai/gpt-2-output-dataset โญ 1,958
Dataset of GPT-2 outputs for research in detection, biases, and more -
akariasai/self-rag โญ 1,952
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
๐ selfrag.github.io -
epfllm/meditron โญ 1,945
Meditron is a suite of open-source medical Large Language Models (LLMs).
๐ huggingface.co/epfl-llm -
flashinfer-ai/flashinfer โญ 1,926
FlashInfer is a library and kernel generator for Large Language Models that provides high-performance implementation of LLM GPU kernels such as FlashAttention, SparseAttention, PageAttention, Sampling
๐ flashinfer.ai -
casper-hansen/AutoAWQ โญ 1,924
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
๐ casper-hansen.github.io/autoawq -
facebookresearch/chameleon โญ 1,916
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
๐ arxiv.org/abs/2405.09818 -
facebookresearch/large_concept_model โญ 1,860
Large Concept Models: Language modeling in a sentence representation space -
minimaxir/aitextgen โญ 1,843
A robust Python tool for text-based AI training and generation using GPT-2.
๐ docs.aitextgen.io -
openai/gpt-discord-bot โญ 1,801
Example Discord bot written in Python that uses the completions API to have conversations with thetext-davinci-003
model, and the moderations API to filter the messages. -
huggingface/smollm โญ 1,785
Everything about the SmolLM2 and SmolVLM family of models
๐ huggingface.co/huggingfacetb -
ray-project/llm-applications โญ 1,753
A comprehensive guide to building RAG-based LLM applications for production. -
ruc-nlpir/FlashRAG โญ 1,715
FlashRAG is a Python toolkit for the reproduction and development of RAG research. Our toolkit includes 36 pre-processed benchmark RAG datasets and 15 state-of-the-art RAG algorithms.
๐ arxiv.org/abs/2405.13576 -
noamgat/lm-format-enforcer โญ 1,687
Enforce the output format (JSON Schema, Regex etc) of a language model -
modelcontextprotocol/python-sdk โญ 1,636
The Model Context Protocol allows applications to provide context for LLMs in a standardized way, separating the concerns of providing context from the actual LLM interaction.
๐ modelcontextprotocol.io -
qwenlm/Qwen-Audio โญ 1,575
The official repo of Qwen-Audio (้ไนๅ้ฎ-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud. -
jina-ai/thinkgpt โญ 1,565
Agent techniques to augment your LLM and push it beyong its limits -
agentops-ai/tokencost โญ 1,546
Easy token price estimates for 400+ LLMs. TokenOps.
๐ agentops.ai -
meetkai/functionary โญ 1,508
Chat language model that can use tools and interpret the results -
deep-agent/R1-V โญ 1,481
We are building a general framework for Reinforcement Learning with Verifiable Rewards (RLVR) in VLM. RLVR outperforms chain-of-thought supervised fine-tuning (CoT-SFT) in both effectiveness and out-of-distribution (OOD) robustness for vision language models. -
roboflow/maestro โญ 1,460
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
๐ maestro.roboflow.com -
cstankonrad/long_llama โญ 1,448
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method. -
farizrahman4u/loopgpt โญ 1,443
Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind. -
run-llama/llama-lab โญ 1,440
Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex -
huggingface/nanotron โญ 1,414
Minimalistic large language model 3D-parallelism training -
chatarena/chatarena โญ 1,406
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs. -
explosion/spacy-transformers โญ 1,360
๐ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
๐ spacy.io/usage/embeddings-transformers -
bigscience-workshop/Megatron-DeepSpeed โญ 1,358
Ongoing research training transformer language models at scale, including: BERT & GPT-2 -
karpathy/nano-llama31 โญ 1,303
This repo is to Llama 3.1 what nanoGPT is to GPT-2. i.e. it is a minimal, dependency-free implementation of the Llama 3.1 architecture -
answerdotai/rerankers โญ 1,266
Welcome to rerankers! Our goal is to provide users with a simple API to use any reranking models. -
ray-project/ray-llm โญ 1,252
RayLLM - LLMs on Ray
๐ aviary.anyscale.com -
facebookresearch/MobileLLM โญ 1,233
Training code of MobileLLM introduced in our work: "MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases" -
srush/MiniChain โญ 1,220
A tiny library for coding with large language models.
๐ srush-minichain.hf.space -
mlfoundations/dclm โญ 1,213
DataComp for Language Models -
keirp/automatic_prompt_engineer โญ 1,205
Large Language Models Are Human-Level Prompt Engineers -
hao-ai-lab/LookaheadDecoding โญ 1,184
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
๐ arxiv.org/abs/2402.02057 -
explosion/spacy-llm โญ 1,180
๐ฆ Integrating LLMs into structured NLP pipelines
๐ spacy.io/usage/large-language-models -
ibm/Dromedary โญ 1,133
Dromedary: towards helpful, ethical and reliable LLMs. -
topoteretes/cognee โญ 1,108
Reliable LLM Memory for AI Applications and AI Agents
๐ www.cognee.ai -
lupantech/chameleon-llm โญ 1,108
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
๐ chameleon-llm.github.io -
rlancemartin/auto-evaluator โญ 1,067
Evaluation tool for LLM QA chains
๐ autoevaluator.langchain.com -
huggingface/lighteval โญ 1,061
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron. -
ctlllll/LLM-ToolMaker โญ 1,025
Large Language Models as Tool Makers -
microsoft/Llama-2-Onnx โญ 1,024
A Microsoft optimized version of the Llama 2 model, available from Meta -
nomic-ai/pygpt4all โญ 1,020
Official supported Python bindings for llama.cpp + gpt4all
๐ nomic-ai.github.io/pygpt4all -
cerebras/modelzoo โญ 1,011
Examples of common deep learning models that can be trained on Cerebras hardware -
nirdiamant/Controllable-RAG-Agent โญ 1,010
An advanced Retrieval-Augmented Generation (RAG) solution designed to tackle complex questions that simple semantic similarity-based retrieval cannot solve -
minishlab/model2vec โญ 1,006
Model2Vec is a technique to turn any sentence transformer into a really small static model, reducing model size by 15x and making the models up to 500x faster, with a small drop in performance
๐ minishlab.github.io -
pinecone-io/canopy โญ 998
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
๐ www.pinecone.io -
ajndkr/lanarky โญ 986
The web framework for building LLM microservices
๐ lanarky.ajndkr.com -
huggingface/evaluation-guidebook โญ 981
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval! -
likejazz/llama3.np โญ 972
llama3.np is a pure NumPy implementation for Llama 3 model. -
datadreamer-dev/DataDreamer โญ 949
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
๐ datadreamer.dev -
huggingface/optimum-nvidia โญ 927
Optimum-NVIDIA delivers the best inference performance on the NVIDIA platform through Hugging Face. Run LLaMA 2 at 1,200 tokens/second (up to 28x faster than the framework) -
soulter/hugging-chat-api โญ 898
HuggingChat Python API๐ค -
muennighoff/sgpt โญ 861
SGPT: GPT Sentence Embeddings for Semantic Search
๐ arxiv.org/abs/2202.08904 -
prometheus-eval/prometheus-eval โญ 855
Evaluate your LLM's response with Prometheus and GPT4 ๐ฏ -
langchain-ai/langsmith-cookbook โญ 833
LangSmith is a platform for building production-grade LLM applications.
๐ langsmith-cookbook.vercel.app -
wandb/weave โญ 797
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
๐ wandb.me/weave -
junruxiong/IncarnaMind โญ 789
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
๐ www.incarnamind.com -
nousresearch/Hermes-Function-Calling โญ 788
Code for the Hermes Pro Large Language Model to perform function calling based on the provided schema. It allows users to query the model and retrieve information related to stock prices, company fundamentals, financial statements -
oliveirabruno01/babyagi-asi โญ 788
BabyAGI: an Autonomous and Self-Improving agent, or BASI -
opengvlab/OmniQuant โญ 759
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs. -
opengenerativeai/GenossGPT โญ 751
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐๐ Replace OpenAI GPT with any LLMs in your app with one line.
๐ genoss.ai -
utkusen/promptmap โญ 726
Vulnerability scanning tool that automatically tests prompt injection attacks on your LLM applications. It analyzes your LLM system prompts, runs them, and sends attack prompts to them. -
salesforce/xgen โญ 717
Salesforce open-source LLMs with 8k sequence length. -
squeezeailab/SqueezeLLM โญ 671
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
๐ arxiv.org/abs/2306.07629 -
tag-research/TAG-Bench โญ 661
Table-Augmented Generation (TAG) is a unified and general-purpose paradigm for answering natural language questions over databases
๐ arxiv.org/pdf/2408.14717 -
mlc-ai/xgrammar โญ 639
XGrammar is an open-source library for efficient, flexible, and portable structured generation. It supports general context-free grammar to enable a broad range of structures while bringing careful system optimizations to enable fast executions.
๐ xgrammar.mlc.ai -
lupantech/ScienceQA โญ 627
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering". -
developersdigest/llm-api-engine โญ 622
Build and deploy AI-powered APIs in seconds. This project allows you to create custom APIs that extract structured data from websites using natural language descriptions, powered by LLMs and web scraping technology.
๐ www.youtube.com/watch?v=8kuek1bo4mm -
tsinghuadatabasegroup/DB-GPT โญ 597
LLM As Database Administrator
๐ dbgpt.dbmind.cn -
microsoft/VPTQ โญ 575
Extreme Low-bit Vector Post-Training Quantization for Large Language Models -
zhudotexe/kani โญ 570
kani (ใซใ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
๐ kani.readthedocs.io -
modal-labs/llm-finetuning โญ 564
Guide for fine-tuning Llama/Mistral/CodeLlama models and more -
magnivorg/prompt-layer-library โญ 544
๐ฐ PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
๐ www.promptlayer.com -
hazyresearch/ama_prompting โญ 544
Ask Me Anything language model prompting -
declare-lab/instruct-eval โญ 540
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
๐ declare-lab.github.io/instruct-eval -
vahe1994/SpQR โญ 538
Quantization algorithm and the model evaluation code for SpQR method for LLM compression -
eugeneyan/obsidian-copilot โญ 523
๐ค A prototype assistant for writing and thinking
๐ eugeneyan.com/writing/obsidian-copilot -
continuum-llms/chatgpt-memory โญ 523
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore. -
judahpaul16/gpt-home โญ 519
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
๐ hub.docker.com/r/judahpaul/gpt-home -
hazyresearch/H3 โญ 516
Language Modeling with the H3 State Space Model -
kbressem/medAlpaca โญ 504
LLM finetuned for medical question answering -
huggingface/text-clustering โญ 495
Easily embed, cluster and semantically label text datasets -
stanford-oval/suql โญ 241
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
๐ arxiv.org/abs/2311.09818 -
dottxt-ai/outlines-core โญ 173
Core functionality for structured generation, formerly implemented in Outlines, with a focus on performance and portability. -
prithivirajdamodaran/Route0x โญ 92
A production-grade query routing solution, leveraging LLMs while optimizing for cost per query -
whitead/paper-qa โญ 3
High accuracy RAG for answering questions from scientific documents with citations
Mathematical, numerical and scientific libraries.
-
numpy/numpy โญ 28,718
The fundamental package for scientific computing with Python.
๐ numpy.org -
camdavidsonpilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers โญ 27,158
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
๐ camdavidsonpilon.github.io/probabilistic-programming-and-bayesian-methods-for-hackers -
taichi-dev/taichi โญ 26,664
Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
๐ taichi-lang.org -
experience-monks/math-as-code โญ 15,277
This is a reference to ease developers into mathematical notation by showing comparisons with Python code -
scipy/scipy โญ 13,347
SciPy library main repository
๐ scipy.org -
sympy/sympy โญ 13,272
A computer algebra system written in pure Python
๐ sympy.org -
google/or-tools โญ 11,581
Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
๐ developers.google.com/optimization -
z3prover/z3 โญ 10,612
Z3 is a theorem prover from Microsoft Research with a Python language binding. -
google-deepmind/alphageometry โญ 4,259
Solving Olympiad Geometry without Human Demonstrations -
pim-book/programmers-introduction-to-mathematics โญ 3,555
Code for A Programmer's Introduction to Mathematics
๐ pimbook.org -
mikedh/trimesh โญ 3,119
Python library for loading and using triangular meshes.
๐ trimesh.org -
talalalrawajfeh/mathematics-roadmap โญ 2,787
A Comprehensive Roadmap to Mathematics -
pyro-ppl/numpyro โญ 2,381
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
๐ num.pyro.ai -
mckinsey/causalnex โญ 2,274
A Python library that helps data scientists to infer causation rather than observing correlation.
๐ causalnex.readthedocs.io -
pyomo/pyomo โญ 2,100
An object-oriented algebraic modeling language in Python for structured optimization problems.
๐ www.pyomo.org -
facebookresearch/theseus โญ 1,826
A library for differentiable nonlinear optimization -
arviz-devs/arviz โญ 1,644
Exploratory analysis of Bayesian models with Python
๐ python.arviz.org -
google-research/torchsde โญ 1,600
Differentiable SDE solvers with GPU support and efficient sensitivity analysis. -
dynamicslab/pysindy โญ 1,510
A package for the sparse identification of nonlinear dynamical systems from data
๐ pysindy.readthedocs.io/en/latest -
geomstats/geomstats โญ 1,289
Computations and statistics on manifolds with geometric structures.
๐ geomstats.ai -
cma-es/pycma โญ 1,139
pycma is a Python implementation of CMA-ES and a few related numerical optimization tools. -
pymc-labs/CausalPy โญ 945
A Python package for causal inference in quasi-experimental settings
๐ causalpy.readthedocs.io -
sj001/AI-Feynman โญ 661
Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression -
willianfuks/tfcausalimpact โญ 627
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability. -
lean-dojo/LeanDojo โญ 606
Tool for data extraction and interacting with Lean programmatically.
๐ leandojo.org -
brandondube/prysm โญ 280
Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
๐ prysm.readthedocs.io/en/stable -
lean-dojo/ReProver โญ 246
Retrieval-Augmented Theorem Provers for Lean
๐ leandojo.org -
albahnsen/pycircular โญ 104
pycircular is a Python module for circular data analysis -
gbillotey/Fractalshades โญ 28
Arbitrary-precision fractal explorer - Python package
General and classical machine learning libraries. See below for other sections covering specialised ML areas.
-
openai/openai-cookbook โญ 61,497
Examples and guides for using the OpenAI API
๐ cookbook.openai.com -
scikit-learn/scikit-learn โญ 60,964
scikit-learn: machine learning in Python
๐ scikit-learn.org -
suno-ai/bark โญ 36,830
๐ Text-Prompted Generative Audio Model -
tencentarc/GFPGAN โญ 36,245
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. -
google-research/google-research โญ 34,815
This repository contains code released by Google Research
๐ research.google -
facebookresearch/faiss โญ 32,737
A library for efficient similarity search and clustering of dense vectors.
๐ faiss.ai -
google/jax โญ 31,147
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
๐ jax.readthedocs.io -
open-mmlab/mmdetection โญ 30,143
OpenMMLab Detection Toolbox and Benchmark
๐ mmdetection.readthedocs.io -
lutzroeder/netron โญ 29,254
Visualizer for neural network, deep learning and machine learning models
๐ netron.app -
google/mediapipe โญ 28,457
Cross-platform, customizable ML solutions for live and streaming media.
๐ ai.google.dev/edge/mediapipe -
ageron/handson-ml2 โญ 28,334
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2. -
dmlc/xgboost โญ 26,551
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
๐ xgboost.readthedocs.io/en/stable -
roboflow/supervision โญ 24,769
We write your reusable computer vision tools. ๐
๐ supervision.roboflow.com -
harisiqbal88/PlotNeuralNet โญ 22,657
Latex code for making neural networks diagrams -
jina-ai/serve โญ 21,274
โ๏ธ Build multimodal AI applications with cloud-native stack
๐ jina.ai/serve -
ml-explore/mlx โญ 18,811
MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
๐ ml-explore.github.io/mlx -
onnx/onnx โญ 18,347
Open standard for machine learning interoperability
๐ onnx.ai -
microsoft/LightGBM โญ 16,932
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
๐ lightgbm.readthedocs.io/en/latest -
ddbourgin/numpy-ml โญ 15,889
Machine learning, in numpy
๐ numpy-ml.readthedocs.io -
tensorflow/tensor2tensor โญ 15,781
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. -
microsoft/onnxruntime โญ 15,466
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
๐ onnxruntime.ai -
aleju/imgaug โญ 14,500
Image augmentation for machine learning experiments.
๐ imgaug.readthedocs.io -
microsoft/nni โญ 14,109
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
๐ nni.readthedocs.io -
jindongwang/transferlearning โญ 13,679
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน
๐ transferlearning.xyz -
neonbjb/tortoise-tts โญ 13,624
A multi-voice TTS system trained with an emphasis on quality -
spotify/annoy โญ 13,463
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk -
deepmind/deepmind-research โญ 13,459
This repository contains implementations and illustrative code to accompany DeepMind publications -
deepmind/alphafold โญ 13,131
Implementation of the inference pipeline of AlphaFold v2 -
facebookresearch/AnimatedDrawings โญ 12,208
Code to accompany "A Method for Animating Children's Drawings of the Human Figure" -
ggerganov/ggml โญ 11,724
Tensor library for machine learning -
optuna/optuna โญ 11,303
A hyperparameter optimization framework
๐ optuna.org -
google-gemini/cookbook โญ 10,568
A collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts.
๐ ai.google.dev/gemini-api/docs -
thudm/CogVideo โญ 10,499
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) -
statsmodels/statsmodels โญ 10,408
Statsmodels: statistical modeling and econometrics in Python
๐ www.statsmodels.org/devel -
twitter/the-algorithm-ml โญ 10,178
Source code for Twitter's Recommendation Algorithm
๐ blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm -
cleanlab/cleanlab โญ 10,137
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
๐ cleanlab.ai -
epistasislab/tpot โญ 9,831
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
๐ epistasislab.github.io/tpot -
megvii-basedetection/YOLOX โญ 9,614
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/ -
wandb/wandb โญ 9,429
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
๐ wandb.ai -
pycaret/pycaret โญ 9,105
An open-source, low-code machine learning library in Python
๐ www.pycaret.org -
facebookresearch/xformers โญ 8,964
Hackable and optimized Transformers building blocks, supporting a composable construction.
๐ facebookresearch.github.io/xformers -
pymc-devs/pymc โญ 8,855
Bayesian Modeling and Probabilistic Programming in Python
๐ docs.pymc.io -
uberi/speech_recognition โญ 8,573
Speech recognition module for Python, supporting several engines and APIs, online and offline.
๐ pypi.python.org/pypi/speechrecognition -
open-mmlab/mmsegmentation โญ 8,513
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
๐ mmsegmentation.readthedocs.io/en/main -
awslabs/autogluon โญ 8,324
Fast and Accurate ML in 3 Lines of Code
๐ auto.gluon.ai -
huggingface/accelerate โญ 8,263
๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
๐ huggingface.co/docs/accelerate -
catboost/catboost โญ 8,225
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
๐ catboost.ai -
automl/auto-sklearn โญ 7,714
Automated Machine Learning with scikit-learn
๐ automl.github.io/auto-sklearn -
lmcinnes/umap โญ 7,611
Uniform Manifold Approximation and Projection -
featurelabs/featuretools โญ 7,354
An open source python library for automated feature engineering
๐ www.featuretools.com -
hyperopt/hyperopt โญ 7,332
Distributed Asynchronous Hyperparameter Optimization in Python
๐ hyperopt.github.io/hyperopt -
py-why/dowhy โญ 7,255
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
๐ www.pywhy.org/dowhy -
hips/autograd โญ 7,117
Efficiently computes derivatives of NumPy code. -
open-mmlab/mmagic โญ 7,047
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐ช: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
๐ mmagic.readthedocs.io/en/latest -
scikit-learn-contrib/imbalanced-learn โญ 6,908
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
๐ imbalanced-learn.org -
ml-explore/mlx-examples โญ 6,815
Examples in the MLX framework -
probml/pyprobml โญ 6,654
Python code for "Probabilistic Machine learning" book by Kevin Murphy -
nicolashug/Surprise โญ 6,488
A Python scikit for building and analyzing recommender systems
๐ surpriselib.com -
yangchris11/samurai โญ 6,435
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
๐ yangchris11.github.io/samurai -
cleverhans-lab/cleverhans โญ 6,236
An adversarial example library for constructing attacks, building defenses, and benchmarking both -
project-monai/MONAI โญ 6,079
AI Toolkit for Healthcare Imaging
๐ monai.io -
kevinmusgrave/pytorch-metric-learning โญ 6,078
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
๐ kevinmusgrave.github.io/pytorch-metric-learning -
open-mmlab/mmcv โญ 5,993
OpenMMLab Computer Vision Foundation
๐ mmcv.readthedocs.io/en/latest -
google-deepmind/graphcast โญ 5,793
GraphCast: Learning skillful medium-range global weather forecasting -
uber/causalml โญ 5,199
Uplift modeling and causal inference with machine learning algorithms -
online-ml/river โญ 5,182
๐ Online machine learning in Python
๐ riverml.xyz -
mdbloice/Augmentor โญ 5,092
Image augmentation library in Python for machine learning.
๐ augmentor.readthedocs.io/en/stable -
rasbt/mlxtend โญ 4,960
A library of extension and helper modules for Python's data analysis and machine learning libraries.
๐ rasbt.github.io/mlxtend -
marqo-ai/marqo โญ 4,748
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
๐ www.marqo.ai -
skvark/opencv-python โญ 4,677
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
๐ pypi.org/project/opencv-python -
apple/coremltools โญ 4,549
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
๐ coremltools.readme.io -
sanchit-gandhi/whisper-jax โญ 4,530
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU. -
nmslib/hnswlib โญ 4,517
Header-only C++/python library for fast approximate nearest neighbors
๐ github.com/nmslib/hnswlib -
lucidrains/deep-daze โญ 4,368
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun -
districtdatalabs/yellowbrick โญ 4,313
Visual analysis and diagnostic tools to facilitate machine learning model selection.
๐ www.scikit-yb.org -
nv-tlabs/GET3D โญ 4,286
Generative Model of High Quality 3D Textured Shapes Learned from Images -
huggingface/autotrain-advanced โญ 4,228
AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
๐ huggingface.co/autotrain -
microsoft/FLAML โญ 4,035
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
๐ microsoft.github.io/flaml -
cmusphinx/pocketsphinx โญ 4,014
A small speech recognizer -
ourownstory/neural_prophet โญ 3,975
NeuralProphet: A simple forecasting package
๐ neuralprophet.com -
py-why/EconML โญ 3,948
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
๐ www.microsoft.com/en-us/research/project/alice -
huggingface/notebooks โญ 3,840
Notebooks using the Hugging Face libraries ๐ค -
zjunlp/DeepKE โญ 3,711
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
๐ deepke.zjukg.cn -
huggingface/speech-to-speech โญ 3,703
Speech To Speech: an effort for an open-sourced and modular GPT4-o -
rucaibox/RecBole โญ 3,567
A unified, comprehensive and efficient recommendation library
๐ recbole.io -
yoheinakajima/instagraph โญ 3,498
Converts text input or URL into knowledge graph and displays -
pytorch/glow โญ 3,267
Compiler for Neural Network hardware accelerators -
lightly-ai/lightly โญ 3,266
A python library for self-supervised learning on images.
๐ docs.lightly.ai/self-supervised-learning -
facebookresearch/vissl โญ 3,265
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
๐ vissl.ai -
lucidrains/musiclm-pytorch โญ 3,221
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch -
hrnet/HRNet-Semantic-Segmentation โญ 3,193
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919 -
mljar/mljar-supervised โญ 3,106
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
๐ mljar.com -
shankarpandala/lazypredict โญ 3,083
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning -
huggingface/safetensors โญ 3,060
Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
๐ huggingface.co/docs/safetensors -
scikit-learn-contrib/hdbscan โญ 2,845
A high performance implementation of HDBSCAN clustering.
๐ hdbscan.readthedocs.io/en/latest -
scikit-optimize/scikit-optimize โญ 2,754
Sequential model-based optimization with ascipy.optimize
interface
๐ scikit-optimize.github.io -
google-research/t5x โญ 2,733
T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales. -
huggingface/optimum โญ 2,719
๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
๐ huggingface.co/docs/optimum/main -
apple/ml-ane-transformers โญ 2,587
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE) -
freedmand/semantra โญ 2,558
Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text. -
rom1504/clip-retrieval โญ 2,479
Easily compute clip embeddings and build a clip retrieval system with them
๐ rom1504.github.io/clip-retrieval -
scikit-learn-contrib/category_encoders โญ 2,422
A library of sklearn compatible categorical variable encoders
๐ contrib.scikit-learn.org/category_encoders -
neuraloperator/neuraloperator โญ 2,398
Comprehensive library for learning neural operators in PyTorch. It is the official implementation for Fourier Neural Operators and Tensorized Neural Operators.
๐ neuraloperator.github.io/dev/index.html -
priorlabs/TabPFN โญ 2,396
The TabPFN is a neural network that learned to do tabular data prediction. This is the original CUDA-supporting pytorch impelementation.
๐ priorlabs.ai -
eric-mitchell/direct-preference-optimization โญ 2,347
Reference implementation for DPO (Direct Preference Optimization) -
huggingface/huggingface_hub โญ 2,297
The official Python client for the Huggingface Hub.
๐ huggingface.co/docs/huggingface_hub -
aws/sagemaker-python-sdk โญ 2,126
A library for training and deploying machine learning models on Amazon SageMaker
๐ sagemaker.readthedocs.io -
huggingface/evaluate โญ 2,095
๐ค Evaluate: A library for easily evaluating machine learning models and datasets.
๐ huggingface.co/docs/evaluate -
contextlab/hypertools โญ 1,831
A Python toolbox for gaining geometric insights into high-dimensional data
๐ hypertools.readthedocs.io/en/latest -
linkedin/greykite โญ 1,824
A flexible, intuitive and fast forecasting library -
rentruewang/koila โญ 1,822
Prevent PyTorch'sCUDA error: out of memory
in just 1 line of code.
๐ koila.rentruewang.com -
bmabey/pyLDAvis โญ 1,812
Python library for interactive topic model visualization. Port of the R LDAvis package. -
microsoft/Olive โญ 1,734
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
๐ microsoft.github.io/olive -
scikit-learn-contrib/lightning โญ 1,732
Large-scale linear classification, regression and ranking in Python
๐ contrib.scikit-learn.org/lightning -
qdrant/fastembed โญ 1,729
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
๐ qdrant.github.io/fastembed -
castorini/pyserini โญ 1,729
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
๐ pyserini.io -
tensorflow/addons โญ 1,694
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons -
microsoft/i-Code โญ 1,683
The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning. -
stanfordmlgroup/ngboost โญ 1,677
Natural Gradient Boosting for Probabilistic Prediction -
visual-layer/fastdup โญ 1,651
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability. -
laekov/fastmoe โญ 1,610
A fast MoE impl for PyTorch
๐ fastmoe.ai -
kubeflow/katib โญ 1,536
Automated Machine Learning on Kubernetes
๐ www.kubeflow.org/docs/components/katib -
google/vizier โญ 1,531
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
๐ oss-vizier.readthedocs.io -
jina-ai/finetuner โญ 1,486
๐ฏ Task-oriented embedding tuning for BERT, CLIP, etc.
๐ finetuner.jina.ai -
csinva/imodels โญ 1,421
Interpretable ML package ๐ for concise, transparent, and accurate predictive modeling (sklearn-compatible).
๐ csinva.io/imodels -
microsoft/Semi-supervised-learning โญ 1,408
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
๐ usb.readthedocs.io -
patchy631/machine-learning โญ 1,401
Machine Learning Tutorials Repository -
spotify/voyager โญ 1,383
๐ฐ๏ธ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
๐ spotify.github.io/voyager -
borealisai/advertorch โญ 1,319
A Toolbox for Adversarial Robustness Research -
koaning/scikit-lego โญ 1,299
Extra blocks for scikit-learn pipelines.
๐ koaning.github.io/scikit-lego -
awslabs/dgl-ke โญ 1,283
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
๐ dglke.dgl.ai/doc -
lightning-ai/lightning-thunder โญ 1,276
Thunder is a source-to-source compiler for PyTorch. It makes PyTorch programs faster by combining and using different hardware executors at once -
pytorch/FBGEMM โญ 1,250
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/ -
nvidia/cuda-python โญ 1,081
CUDA Python: Performance meets Productivity
๐ nvidia.github.io/cuda-python -
davidmrau/mixture-of-experts โญ 1,031
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538 -
google-research/deeplab2 โญ 1,011
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks. -
opentensor/bittensor โญ 997
Internet-scale Neural Networks
๐ www.bittensor.com -
oml-team/open-metric-learning โญ 902
OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
๐ open-metric-learning.readthedocs.io/en/latest/index.html -
hazyresearch/safari โญ 876
Convolutions for Sequence Modeling -
huggingface/optimum-quanto โญ 871
A pytorch quantization backend for optimum -
criteo/autofaiss โญ 832
Automatically create Faiss knn indices with the most optimal similarity search parameters.
๐ criteo.github.io/autofaiss -
replicate/replicate-python โญ 792
Python client for Replicate
๐ replicate.com -
pymc-labs/pymc-marketing โญ 770
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
๐ www.pymc-marketing.io -
awslabs/python-deequ โญ 739
Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets -
facebookresearch/balance โญ 691
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
๐ import-balance.org -
googleapis/python-aiplatform โญ 686
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning. -
nicolas-hbt/pygraft โญ 680
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
๐ pygraft.readthedocs.io/en/latest -
qdrant/quaterion โญ 647
Blazing fast framework for fine-tuning similarity learning models
๐ quaterion.qdrant.tech -
huggingface/exporters โญ 640
Export Hugging Face models to Core ML and TensorFlow Lite -
hpcaitech/EnergonAI โญ 628
Large-scale model inference. -
intel/intel-npu-acceleration-library โญ 609
The Intel NPU Acceleration Library is a Python library designed to boost the efficiency of your applications by leveraging the power of the Intel Neural Processing Unit (NPU) to perform high-speed computations on compatible hardware. -
nomic-ai/contrastors โญ 577
Contrastive learning toolkit that enables researchers and engineers to train and evaluate contrastive models efficiently. -
intellabs/bayesian-torch โญ 565
A library for Bayesian neural network layers and uncertainty estimation in Deep Learning extending the core of PyTorch -
microsoft/Focal-Transformer โญ 549
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers" -
linkedin/FastTreeSHAP โญ 532
Fast SHAP value computation for interpreting tree-based models -
mrdbourke/m1-machine-learning-test โญ 531
Code for testing various M1 Chip benchmarks with TensorFlow. -
nevronai/MetisFL โญ 526
The first open Federated Learning framework implemented in C++ and Python.
๐ metisfl.org -
deepgraphlearning/ULTRA โญ 505
A foundation model for knowledge graph reasoning -
dylanhogg/gptauthor โญ 69
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.
Machine learning libraries that cross over with deep learning in some way.
-
tensorflow/tensorflow โญ 187,738
An Open Source Machine Learning Framework for Everyone
๐ tensorflow.org -
pytorch/pytorch โญ 86,488
Tensors and Dynamic neural networks in Python with strong GPU acceleration
๐ pytorch.org -
openai/whisper โญ 75,567
Robust Speech Recognition via Large-Scale Weak Supervision -
keras-team/keras โญ 62,452
Deep Learning for humans
๐ keras.io -
deepfakes/faceswap โญ 53,156
Deepfakes Software For All
๐ www.faceswap.dev -
facebookresearch/segment-anything โญ 48,687
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. -
microsoft/DeepSpeed โญ 36,513
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
๐ www.deepspeed.ai -
rwightman/pytorch-image-models โญ 33,053
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
๐ huggingface.co/docs/timm -
facebookresearch/detectron2 โญ 31,142
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
๐ detectron2.readthedocs.io/en/latest -
xinntao/Real-ESRGAN โญ 29,449
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. -
lightning-ai/pytorch-lightning โญ 28,899
The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
๐ lightning.ai -
google-research/tuning_playbook โญ 27,934
A playbook for systematically maximizing the performance of deep learning models. -
openai/CLIP โญ 27,193
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image -
facebookresearch/Detectron โญ 26,300
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet. -
matterport/Mask_RCNN โญ 24,875
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow -
paddlepaddle/Paddle โญ 22,439
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผใ้ฃๆกจใๆ ธๅฟๆกๆถ๏ผๆทฑๅบฆๅญฆไน &ๆบๅจๅญฆไน ้ซๆง่ฝๅๆบใๅๅธๅผ่ฎญ็ปๅ่ทจๅนณๅฐ้จ็ฝฒ๏ผ
๐ www.paddlepaddle.org -
pyg-team/pytorch_geometric โญ 21,828
Graph Neural Network Library for PyTorch
๐ pyg.org -
lucidrains/vit-pytorch โญ 21,621
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch -
apache/mxnet โญ 20,787
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
๐ mxnet.apache.org -
sanster/IOPaint โญ 20,309
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
๐ www.iopaint.com -
danielgatis/rembg โญ 17,862
Rembg is a tool to remove images background -
rasbt/deeplearning-models โญ 16,866
A collection of various deep learning architectures, models, and tips -
albumentations-team/albumentations โญ 14,541
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
๐ albumentations.ai -
microsoft/Swin-Transformer โญ 14,253
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
๐ arxiv.org/abs/2103.14030 -
facebookresearch/detr โญ 13,938
End-to-End Object Detection with Transformers -
nvidia/DeepLearningExamples โญ 13,895
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure. -
dmlc/dgl โญ 13,691
Python package built to ease deep learning on graph, on top of existing DL frameworks.
๐ dgl.ai -
mlfoundations/open_clip โญ 10,906
Open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training). -
kornia/kornia โญ 10,186
๐ Geometric Computer Vision Library for Spatial AI
๐ kornia.readthedocs.io -
modelscope/facechain โญ 9,252
FaceChain is a deep-learning toolchain for generating your Digital-Twin. -
keras-team/autokeras โญ 9,194
AutoML library for deep learning
๐ autokeras.com -
facebookresearch/pytorch3d โญ 9,016
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
๐ pytorch3d.org -
arogozhnikov/einops โญ 8,694
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
๐ einops.rocks -
pyro-ppl/pyro โญ 8,644
Deep universal probabilistic programming with Python and PyTorch
๐ pyro.ai -
nvidia/apex โญ 8,523
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch -
bytedance/monolith โญ 8,515
A deep learning framework for large scale recommendation modeling with collisionless embedding and real time training captures. -
facebookresearch/ImageBind โญ 8,493
ImageBind One Embedding Space to Bind Them All -
lucidrains/imagen-pytorch โญ 8,164
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch -
google/trax โญ 8,155
Trax โ Deep Learning with Clear Code and Speed -
tencent/HunyuanVideo โญ 8,066
HunyuanVideo: A Systematic Framework For Large Video Generation Model
๐ aivideo.hunyuan.tencent.com -
xpixelgroup/BasicSR โญ 7,131
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
๐ basicsr.readthedocs.io/en/latest -
google/flax โญ 6,304
Flax is a neural network library for JAX that is designed for flexibility.
๐ flax.readthedocs.io -
skorch-dev/skorch โญ 5,956
A scikit-learn compatible neural network library that wraps PyTorch -
facebookresearch/mmf โญ 5,525
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
๐ mmf.sh -
mosaicml/composer โญ 5,256
Supercharge Your Model Training
๐ docs.mosaicml.com -
deci-ai/super-gradients โญ 4,667
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
๐ www.supergradients.com -
nvidiagameworks/kaolin โญ 4,603
A PyTorch Library for Accelerating 3D Deep Learning Research -
facebookincubator/AITemplate โญ 4,591
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference. -
pytorch/ignite โญ 4,576
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
๐ pytorch-ignite.ai -
cvg/LightGlue โญ 3,575
LightGlue: Local Feature Matching at Light Speed (ICCV 2023) -
williamyang1991/VToonify โญ 3,559
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer -
google-research/scenic โญ 3,408
Scenic: A Jax Library for Computer Vision Research and Beyond -
facebookresearch/PyTorch-BigGraph โญ 3,395
Generate embeddings from large-scale graph-structured data.
๐ torchbiggraph.readthedocs.io -
pytorch/botorch โญ 3,166
Bayesian optimization in PyTorch
๐ botorch.org -
alpa-projects/alpa โญ 3,096
Training and serving large-scale neural networks with auto parallelization.
๐ alpa.ai -
deepmind/dm-haiku โญ 2,950
JAX-based neural network library
๐ dm-haiku.readthedocs.io -
explosion/thinc โญ 2,829
๐ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
๐ thinc.ai -
nerdyrodent/VQGAN-CLIP โญ 2,632
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab. -
danielegrattarola/spektral โญ 2,375
Graph Neural Networks with Keras and Tensorflow 2.
๐ graphneural.network -
google-research/electra โญ 2,345
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators -
modelscope/ClearerVoice-Studio โญ 2,150
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc. -
fepegar/torchio โญ 2,121
Medical imaging processing for deep learning.
๐ torchio.org -
neuralmagic/sparseml โญ 2,098
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models -
pytorch/torchrec โญ 2,027
Pytorch domain library for recommendation systems
๐ pytorch.org/torchrec -
tensorflow/mesh โญ 1,600
Mesh TensorFlow: Model Parallelism Made Easier -
tensorly/tensorly โญ 1,592
TensorLy: Tensor Learning in Python.
๐ tensorly.org -
vt-vl-lab/FGVC โญ 1,557
[ECCV 2020] Flow-edge Guided Video Completion -
calculatedcontent/WeightWatcher โญ 1,533
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks -
jeshraghian/snntorch โญ 1,431
Deep and online learning with spiking neural networks in Python
๐ snntorch.readthedocs.io/en/latest -
hysts/pytorch_image_classification โญ 1,379
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet -
xl0/lovely-tensors โญ 1,180
Tensors, for human consumption
๐ xl0.github.io/lovely-tensors -
deepmind/android_env โญ 1,043
RL research on Android devices. -
keras-team/keras-cv โญ 1,018
Industry-strength Computer Vision workflows with Keras -
tensorflow/similarity โญ 1,016
TensorFlow Similarity is a python package focused on making similarity learning quick and easy. -
kakaobrain/rq-vae-transformer โญ 828
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22) -
deepmind/chex โญ 814
Chex is a library of utilities for helping to write reliable JAX code
๐ chex.readthedocs.io -
mlfoundations/datacomp โญ 676
DataComp: In search of the next generation of multimodal datasets
๐ datacomp.ai -
whitead/dmol-book โญ 635
Deep learning for molecules and materials book
๐ dmol.pub -
allenai/reward-bench โญ 498
RewardBench is a benchmark designed to evaluate the capabilities and safety of reward models (including those trained with Direct Preference Optimization, DPO)
๐ huggingface.co/spaces/allenai/reward-bench
Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.
-
slundberg/shap โญ 23,330
A game theoretic approach to explain the output of any machine learning model.
๐ shap.readthedocs.io -
marcotcr/lime โญ 11,733
Lime: Explaining the predictions of any machine learning classifier -
interpretml/interpret โญ 6,378
Fit interpretable models. Explain blackbox machine learning.
๐ interpret.ml/docs -
pytorch/captum โญ 5,062
Model interpretability and understanding for PyTorch
๐ captum.ai -
tensorflow/lucid โญ 4,685
A collection of infrastructure and tools for research in neural network interpretability. -
arize-ai/phoenix โญ 4,627
AI Observability & Evaluation
๐ docs.arize.com/phoenix -
pair-code/lit โญ 3,520
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
๐ pair-code.github.io/lit -
maif/shapash โญ 2,770
๐ Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
๐ maif.github.io/shapash -
teamhg-memex/eli5 โญ 2,763
A library for debugging/inspecting machine learning classifiers and explaining their predictions
๐ eli5.readthedocs.io -
seldonio/alibi โญ 2,438
Algorithms for explaining machine learning models
๐ docs.seldon.io/projects/alibi/en/stable -
eleutherai/pythia โญ 2,355
Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers -
oegedijk/explainerdashboard โญ 2,349
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
๐ explainerdashboard.readthedocs.io -
jalammar/ecco โญ 2,005
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
๐ ecco.readthedocs.io -
transformerlensorg/TransformerLens โญ 1,819
A library for mechanistic interpretability of GPT-style language models
๐ transformerlensorg.github.io/transformerlens -
google-deepmind/penzai โญ 1,725
A JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained
๐ penzai.readthedocs.io -
trusted-ai/AIX360 โญ 1,654
Interpretability and explainability of data and machine learning models
๐ aix360.res.ibm.com -
cdpierse/transformers-interpret โญ 1,318
Model explainability that works seamlessly with ๐ค transformers. Explain your transformers model in just 2 lines of code. -
selfexplainml/PiML-Toolbox โญ 1,234
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
๐ selfexplainml.github.io/piml-toolbox -
ethicalml/xai โญ 1,154
XAI is a Machine Learning library that is designed with AI explainability in its core. XAI contains various tools that enable for analysis and evaluation of data and models
๐ ethical.institute/principles.html#commitment-3 -
salesforce/OmniXAI โญ 896
OmniXAI: A Library for eXplainable AI -
andyzoujm/representation-engineering โญ 782
Representation Engineering: A Top-Down Approach to AI Transparency
๐ www.ai-transparency.org -
jbloomaus/SAELens โญ 605
Training Sparse Autoencoders on LLms. Analyse sparse autoencoders and neural network internals.
๐ jbloomaus.github.io/saelens
MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.
-
apache/airflow โญ 38,571
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
๐ airflow.apache.org -
ray-project/ray โญ 35,169
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
๐ ray.io -
mlflow/mlflow โญ 19,391
Open source platform for the machine learning lifecycle
๐ mlflow.org -
prefecthq/prefect โญ 18,189
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
๐ prefect.io -
spotify/luigi โญ 18,062
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in. -
kestra-io/kestra โญ 15,729
โก Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
๐ kestra.io -
horovod/horovod โญ 14,360
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
๐ horovod.ai -
iterative/dvc โญ 14,134
๐ฆ Data Versioning and ML Experiments
๐ dvc.org -
dagster-io/dagster โญ 12,440
An orchestration platform for the development, production, and observation of data assets.
๐ dagster.io -
ludwig-ai/ludwig โญ 11,300
Low-code framework for building custom LLMs, neural networks, and other AI models
๐ ludwig.ai -
bentoml/OpenLLM โญ 10,503
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
๐ bentoml.com -
dbt-labs/dbt-core โญ 10,313
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
๐ getdbt.com -
great-expectations/great_expectations โญ 10,163
Always know what to expect from your data.
๐ docs.greatexpectations.io -
kedro-org/kedro โญ 10,141
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
๐ kedro.org -
huggingface/text-generation-inference โญ 9,694
A Rust, Python and gRPC server for text generation inference. Used in production at HuggingFace to power Hugging Chat, the Inference API and Inference Endpoint.
๐ hf.co/docs/text-generation-inference -
netflix/metaflow โญ 8,505
Open Source AI/ML Platform
๐ metaflow.org -
activeloopai/deeplake โญ 8,342
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
๐ activeloop.ai -
langfuse/langfuse โญ 8,258
๐ชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. ๐YC W23
๐ langfuse.com/docs -
mage-ai/mage-ai โญ 8,120
๐ง Build, run, and manage data pipelines for integrating and transforming data.
๐ www.mage.ai -
bentoml/BentoML โญ 7,322
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
๐ bentoml.com -
flyteorg/flyte โญ 5,974
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
๐ flyte.org -
allegroai/clearml โญ 5,799
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
๐ clear.ml/docs -
feast-dev/feast โญ 5,775
The Open Source Feature Store for Machine Learning
๐ feast.dev -
evidentlyai/evidently โญ 5,666
Evidently is โโan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
๐ discord.gg/xzjkranp8b -
internlm/lmdeploy โญ 5,417
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
๐ lmdeploy.readthedocs.io/en/latest -
adap/flower โญ 5,394
Flower: A Friendly Federated AI Framework
๐ flower.ai -
aimhubio/aim โญ 5,337
Aim ๐ซ โ An easy-to-use & supercharged open-source experiment tracker.
๐ aimstack.io -
zenml-io/zenml โญ 4,377
ZenML ๐: The bridge between ML and Ops. https://zenml.io.
๐ zenml.io -
internlm/xtuner โญ 4,188
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
๐ xtuner.readthedocs.io/zh-cn/latest -
orchest/orchest โญ 4,105
Build data pipelines, the easy way ๐ ๏ธ
๐ orchest.readthedocs.io/en/stable -
kubeflow/pipelines โญ 3,665
Machine Learning Pipelines for Kubeflow
๐ www.kubeflow.org/docs/components/pipelines -
polyaxon/polyaxon โญ 3,600
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
๐ polyaxon.com -
ploomber/ploomber โญ 3,537
The fastest โก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ๏ธ
๐ docs.ploomber.io -
towhee-io/towhee โญ 3,294
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
๐ towhee.io -
determined-ai/determined โญ 3,093
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
๐ determined.ai -
leptonai/leptonai โญ 2,681
A Pythonic framework to simplify AI service building
๐ lepton.ai -
azure/PyRIT โญ 2,161
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.
๐ azure.github.io/pyrit -
dagworks-inc/hamilton โญ 2,010
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
๐ hamilton.dagworks.io/en/latest -
meltano/meltano โญ 1,934
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
๐ meltano.com -
dstackai/dstack โญ 1,663
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel accelerators.
๐ dstack.ai/docs -
hi-primus/optimus โญ 1,492
๐ Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
๐ hi-optimus.com -
dagworks-inc/burr โญ 1,486
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
๐ burr.dagworks.io -
kubeflow/examples โญ 1,425
A repository to host extended examples and tutorials
Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF
-
openai/gym โญ 35,275
A toolkit for developing and comparing reinforcement learning algorithms.
๐ www.gymlibrary.dev -
openai/baselines โญ 15,988
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms -
google/dopamine โญ 10,630
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
๐ github.com/google/dopamine -
thu-ml/tianshou โญ 8,155
An elegant PyTorch deep reinforcement learning library.
๐ tianshou.org -
farama-foundation/Gymnasium โญ 8,067
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
๐ gymnasium.farama.org -
lucidrains/PaLM-rlhf-pytorch โญ 7,746
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM -
tensorlayer/TensorLayer โญ 7,342
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
๐ tensorlayerx.com -
keras-rl/keras-rl โญ 5,538
Deep Reinforcement Learning for Keras.
๐ keras-rl.readthedocs.io -
deepmind/dm_control โญ 3,907
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo. -
ai4finance-foundation/ElegantRL โญ 3,835
Massively Parallel Deep Reinforcement Learning. ๐ฅ
๐ ai4finance.org -
facebookresearch/ReAgent โญ 3,585
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
๐ reagent.ai -
deepmind/acme โญ 3,573
A library of reinforcement learning components and agents -
opendilab/DI-engine โญ 3,209
DI-engine is a generalized decision intelligence engine for PyTorch and JAX. It provides python-first and asynchronous-native task and middleware abstractions
๐ di-engine-docs.readthedocs.io -
eureka-research/Eureka โญ 2,885
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
๐ eureka-research.github.io -
pettingzoo-team/PettingZoo โญ 2,748
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
๐ pettingzoo.farama.org -
pytorch/rl โญ 2,509
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
๐ pytorch.org/rl -
kzl/decision-transformer โญ 2,467
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. -
anthropics/hh-rlhf โญ 1,675
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
๐ arxiv.org/abs/2204.05862 -
arise-initiative/robosuite โญ 1,454
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
๐ robosuite.ai -
humancompatibleai/imitation โญ 1,392
Clean PyTorch implementations of imitation and reward learning algorithms
๐ imitation.readthedocs.io -
denys88/rl_games โญ 1,000
RL Games: High performance RL library -
google-deepmind/meltingpot โญ 645
A suite of test scenarios for multi-agent reinforcement learning.
Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.
-
huggingface/transformers โญ 138,541
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐ huggingface.co/transformers -
pytorch/fairseq โญ 30,880
Facebook AI Research Sequence-to-Sequence Toolkit written in Python. -
explosion/spaCy โญ 30,803
๐ซ Industrial-strength Natural Language Processing (NLP) in Python
๐ spacy.io -
myshell-ai/OpenVoice โญ 30,706
Instant voice cloning by MIT and MyShell. Audio foundation model.
๐ research.myshell.ai/open-voice -
microsoft/unilm โญ 20,669
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
๐ aka.ms/generalai -
vikparuchuri/marker โญ 20,263
Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
๐ www.datalab.to -
huggingface/datasets โญ 19,544
๐ค The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
๐ huggingface.co/docs/datasets -
vikparuchuri/surya โญ 16,037
OCR, layout analysis, reading order, table recognition in 90+ languages
๐ www.datalab.to -
ukplab/sentence-transformers โญ 15,895
State-of-the-Art Text Embeddings
๐ www.sbert.net -
rare-technologies/gensim โญ 15,821
Topic Modelling for Humans
๐ radimrehurek.com/gensim -
gunthercox/ChatterBot โญ 14,178
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
๐ chatterbot.readthedocs.io -
flairnlp/flair โญ 14,042
A very simple framework for state-of-the-art Natural Language Processing (NLP)
๐ flairnlp.github.io/flair -
nltk/nltk โญ 13,817
NLTK Source
๐ www.nltk.org -
m-bain/whisperX โญ 13,665
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) -
openai/tiktoken โญ 13,231
tiktoken is a fast BPE tokeniser for use with OpenAI's models. -
nvidia/NeMo โญ 13,015
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
๐ docs.nvidia.com/nemo-framework/user-guide/latest/overview.html -
jina-ai/clip-as-service โญ 12,550
๐ Scalable embedding, reasoning, ranking for images and sentences with CLIP
๐ clip-as-service.jina.ai -
allenai/allennlp โญ 11,791
An open-source NLP research library, built on PyTorch.
๐ www.allennlp.org -
facebookresearch/seamless_communication โญ 11,268
Foundational Models for State-of-the-Art Speech and Text Translation -
google/sentencepiece โญ 10,544
Unsupervised text tokenizer for Neural Network-based text generation. -
facebookresearch/ParlAI โญ 10,497
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
๐ parl.ai -
neuml/txtai โญ 10,238
๐ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
๐ neuml.github.io/txtai -
doccano/doccano โญ 9,737
Open source annotation tool for machine learning practitioners. -
speechbrain/speechbrain โญ 9,302
A PyTorch-based Speech Toolkit
๐ speechbrain.github.io -
sloria/TextBlob โญ 9,235
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
๐ textblob.readthedocs.io -
facebookresearch/nougat โญ 9,190
Implementation of Nougat Neural Optical Understanding for Academic Documents
๐ facebookresearch.github.io/nougat -
togethercomputer/OpenChatKit โญ 9,024
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots -
clips/pattern โญ 8,763
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
๐ github.com/clips/pattern/wiki -
espnet/espnet โญ 8,741
End-to-End Speech Processing Toolkit
๐ espnet.github.io/espnet -
deeppavlov/DeepPavlov โญ 6,777
An open source library for deep learning end-to-end dialog systems and chatbots.
๐ deeppavlov.ai -
facebookresearch/metaseq โญ 6,519
A codebase for working with Open Pre-trained Transformers, originally forked from fairseq. -
maartengr/BERTopic โญ 6,368
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
๐ maartengr.github.io/bertopic -
kingoflolz/mesh-transformer-jax โญ 6,322
Model parallel transformers in JAX and Haiku -
aiwaves-cn/agents โญ 5,434
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents -
quivrhq/MegaParse โญ 5,226
File Parser optimised for LLM Ingestion with no loss ๐ง Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
๐ megaparse.com -
layout-parser/layout-parser โญ 5,056
A Unified Toolkit for Deep Learning Based Document Image Analysis
๐ layout-parser.github.io -
salesforce/CodeGen โญ 4,994
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex. -
minimaxir/textgenrnn โญ 4,936
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code. -
makcedward/nlpaug โญ 4,502
Data augmentation for NLP
๐ makcedward.github.io -
facebookresearch/DrQA โญ 4,481
Reading Wikipedia to Answer Open-Domain Questions -
argilla-io/argilla โญ 4,283
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
๐ docs.argilla.io -
thilinarajapakse/simpletransformers โญ 4,143
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
๐ simpletransformers.ai -
maartengr/KeyBERT โญ 3,682
Minimal keyword extraction with BERT
๐ maartengr.github.io/keybert -
life4/textdistance โญ 3,436
๐ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage. -
promptslab/Promptify โญ 3,387
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
๐ discord.gg/m88xfymbk6 -
jsvine/markovify โญ 3,320
A simple, extensible Markov chain generator. -
bytedance/lightseq โญ 3,244
LightSeq: A High Performance Library for Sequence Processing and Generation -
errbotio/errbot โญ 3,156
Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
๐ errbot.io -
neuralmagic/deepsparse โญ 3,091
Sparsity-aware deep learning inference runtime for CPUs
๐ neuralmagic.com/deepsparse -
huawei-noah/Pretrained-Language-Model โญ 3,054
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab. -
ddangelov/Top2Vec โญ 2,986
Top2Vec learns jointly embedded topic, document and word vectors. -
jbesomi/texthero โญ 2,897
Text preprocessing, representation and visualization from zero to hero.
๐ texthero.org -
salesforce/CodeT5 โญ 2,876
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
๐ arxiv.org/abs/2305.07922 -
huggingface/neuralcoref โญ 2,865
โจFast Coreference Resolution in spaCy with Neural Networks
๐ huggingface.co/coref -
bigscience-workshop/promptsource โญ 2,757
Toolkit for creating, sharing and using natural language prompts. -
bhavnicksm/chonkie โญ 2,388
๐ฆ CHONK your texts with Chonkie โจ - The no-nonsense RAG chunking library
๐ docs.chonkie.ai -
nvidia/nv-ingest โญ 2,383
NVIDIA-Ingest is a scalable, performance-oriented document content and metadata extraction microservice. -
huggingface/setfit โญ 2,335
SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.
๐ hf.co/docs/setfit -
alibaba/EasyNLP โญ 2,094
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit -
jamesturk/jellyfish โญ 2,086
๐ชผ a python library for doing approximate and phonetic matching of strings.
๐ jamesturk.github.io/jellyfish -
thudm/P-tuning-v2 โญ 2,002
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks -
marella/ctransformers โญ 1,835
Python bindings for the Transformer models implemented in C/C++ using GGML library. -
featureform/featureform โญ 1,832
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
๐ www.featureform.com -
deepset-ai/FARM โญ 1,750
๐ก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
๐ farm.deepset.ai -
urchade/GLiNER โญ 1,717
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
๐ arxiv.org/abs/2311.08526 -
franck-dernoncourt/NeuroNER โญ 1,703
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
๐ neuroner.com -
explosion/spacy-models โญ 1,689
๐ซ Models for the spaCy Natural Language Processing (NLP) library
๐ spacy.io -
google-research/language โญ 1,642
Shared repository for open-sourced projects from the Google AI Language team.
๐ ai.google/research/teams/language -
plasticityai/magnitude โญ 1,638
A fast, efficient universal vector embedding utility package. -
arxiv-vanity/arxiv-vanity โญ 1,613
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
๐ www.arxiv-vanity.com -
chrismattmann/tika-python โญ 1,539
Tika-Python is a Python binding to the Apache Tikaโข REST services allowing Tika to be called natively in the Python community. -
nomic-ai/nomic โญ 1,460
Interact, analyze and structure massive text, image, embedding, audio and video datasets
๐ atlas.nomic.ai -
intellabs/fastRAG โญ 1,438
Efficient Retrieval Augmentation and Generation Framework -
dmmiller612/bert-extractive-summarizer โญ 1,417
Easy to use extractive text summarization with BERT -
gunthercox/chatterbot-corpus โญ 1,378
A multilingual dialog corpus
๐ chatterbot-corpus.readthedocs.io -
jonasgeiping/cramming โญ 1,311
Cramming the training of a (BERT-type) language model into limited compute. -
pemistahl/lingua-py โญ 1,225
The most accurate natural language detection library for Python, suitable for short text and mixed-language text -
openai/grade-school-math โญ 1,169
GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems -
answerdotai/ModernBERT โญ 1,139
Bringing BERT into modernity via both architecture changes and scaling
๐ arxiv.org/abs/2412.13663 -
abertsch72/unlimiformer โญ 1,059
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input" -
unitaryai/detoxify โญ 997
Toxic Comment Classification with Pytorch Lightning and Transformers
๐ www.unitary.ai -
norskregnesentral/skweak โญ 923
skweak: A software toolkit for weak supervision applied to NLP tasks -
keras-team/keras-hub โญ 834
Pretrained model hub for Keras 3.
๐ keras.io/keras_hub -
explosion/spacy-streamlit โญ 825
๐ spaCy building blocks and visualizers for Streamlit apps
๐ share.streamlit.io/ines/spacy-streamlit-demo/master/app.py -
paddlepaddle/RocketQA โญ 774
๐ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models. -
webis-de/small-text โญ 605
Small-Text provides state-of-the-art Active Learning for Text Classification. Several pre-implemented Query Strategies, Initialization Strategies, and Stopping Critera are provided, which can be easily mixed and matched to build active learning experiments or applications.
๐ small-text.readthedocs.io -
babelscape/rebel โญ 511
REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).
Python packaging, dependency management and bundling.
-
pyenv/pyenv โญ 40,558
pyenv lets you easily switch between multiple versions of Python. -
astral-sh/uv โญ 38,223
An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
๐ docs.astral.sh/uv -
python-poetry/poetry โญ 32,487
Python packaging and dependency management made easy
๐ python-poetry.org -
pypa/pipenv โญ 24,970
A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
๐ pipenv.pypa.io -
mitsuhiko/rye โญ 14,000
a Hassle-Free Python Experience
๐ rye.astral.sh -
pyinstaller/pyinstaller โญ 12,108
Freeze (package) Python programs into stand-alone executables
๐ www.pyinstaller.org -
pypa/pipx โญ 11,056
Install and Run Python Applications in Isolated Environments
๐ pipx.pypa.io -
pdm-project/pdm โญ 8,104
A modern Python package and dependency manager supporting the latest PEP standards
๐ pdm-project.org -
jazzband/pip-tools โญ 7,830
A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
๐ pip-tools.rtfd.io -
mamba-org/mamba โญ 7,107
The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
๐ mamba.readthedocs.io -
conda-forge/miniforge โญ 6,985
A conda-forge distribution.
๐ conda-forge.org/download -
conda/conda โญ 6,589
A system-level, binary package and environment manager running on all major operating systems and platforms.
๐ docs.conda.io/projects/conda -
pypa/hatch โญ 6,317
Modern, extensible Python project management
๐ hatch.pypa.io/latest -
indygreg/PyOxidizer โญ 5,620
A modern Python application packaging and distribution tool -
pypa/virtualenv โญ 4,864
A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
๐ virtualenv.pypa.io -
spack/spack โญ 4,529
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
๐ spack.io -
prefix-dev/pixi โญ 3,808
pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
๐ pixi.sh -
pantsbuild/pex โญ 3,634
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
๐ docs.pex-tool.org -
beeware/briefcase โญ 2,754
Tools to support converting a Python project into a standalone native application.
๐ briefcase.readthedocs.io -
pypa/flit โญ 2,190
Simplified packaging of Python modules
๐ flit.pypa.io -
linkedin/shiv โญ 1,794
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included. -
marcelotduarte/cx_Freeze โญ 1,399
cx_Freeze creates standalone executables from Python scripts, with the same performance, is cross-platform and should work on any platform that Python itself works on.
๐ marcelotduarte.github.io/cx_freeze -
ofek/pyapp โญ 1,323
Runtime installer for Python applications
๐ ofek.dev/pyapp -
pypa/gh-action-pypi-publish โญ 989
The blessedGitHub Action, for publishing your ๐ฆ distribution files to PyPI, the tokenless way: https://github.com/marketplace/actions/pypi-publish
๐ packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows -
py2exe/py2exe โญ 899
Create standalone Windows programs from Python code
๐ www.py2exe.org -
prefix-dev/rip โญ 656
RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
๐ prefix.dev -
snok/install-poetry โญ 605
Github action for installing and configuring Poetry -
python-poetry/install.python-poetry.org โญ 218
The official Poetry installation script
๐ install.python-poetry.org
Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.
-
pandas-dev/pandas โญ 44,478
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
๐ pandas.pydata.org -
pola-rs/polars โญ 31,696
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
๐ docs.pola.rs -
duckdb/duckdb โญ 26,205
DuckDB is an analytical in-process SQL database management system
๐ www.duckdb.org -
gventuri/pandas-ai โญ 14,106
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
๐ getpanda.ai -
kanaries/pygwalker โญ 14,010
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
๐ kanaries.net/pygwalker -
ydataai/ydata-profiling โญ 12,686
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
๐ docs.profiling.ydata.ai -
rapidsai/cudf โญ 8,641
cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
๐ docs.rapids.ai/api/cudf/stable -
aws/aws-sdk-pandas โญ 3,974
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
๐ aws-sdk-pandas.readthedocs.io -
nalepae/pandarallel โญ 3,716
A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
๐ nalepae.github.io/pandarallel -
unionai-oss/pandera โญ 3,597
A light-weight, flexible, and expressive statistical data testing library
๐ www.union.ai/pandera -
adamerose/PandasGUI โญ 3,213
A GUI for Pandas DataFrames -
blaze/blaze โญ 3,191
NumPy and Pandas interface to Big Data
๐ blaze.pydata.org -
pydata/pandas-datareader โญ 3,000
Extract data from a wide range of Internet sources into a pandas DataFrame.
๐ pydata.github.io/pandas-datareader/stable/index.html -
scikit-learn-contrib/sklearn-pandas โญ 2,820
Pandas integration with sklearn -
jmcarpenter2/swifter โญ 2,564
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner -
delta-io/delta-rs โญ 2,538
A native Rust library for Delta Lake, with bindings into Python
๐ delta-io.github.io/delta-rs -
eventual-inc/Daft โญ 2,511
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
๐ getdaft.io -
fugue-project/fugue โญ 2,037
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
๐ fugue-tutorials.readthedocs.io -
pyjanitor-devs/pyjanitor โญ 1,390
Clean APIs for data cleaning. Python implementation of R package Janitor
๐ pyjanitor-devs.github.io/pyjanitor -
machow/siuba โญ 1,167
Python library for using dplyr like syntax with pandas and SQL
๐ siuba.org -
holoviz/hvplot โญ 1,162
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
๐ hvplot.holoviz.org -
renumics/spotlight โญ 1,145
Interactively explore unstructured datasets from your dataframe.
๐ renumics.com -
tkrabel/bamboolib โญ 942
bamboolib - a GUI for pandas DataFrames
๐ bamboolib.com -
mwouts/itables โญ 824
This packages changes how Pandas and Polars DataFrames are rendered in Jupyter Notebooks. With itables you can display your tables as interactive DataTables that you can sort, paginate, scroll or filter.
๐ mwouts.github.io/itables
Performance, parallelisation and low level libraries.
-
celery/celery โญ 25,442
Distributed Task Queue (development branch)
๐ docs.celeryq.dev -
google/flatbuffers โญ 23,718
FlatBuffers: Memory Efficient Serialization Library
๐ flatbuffers.dev -
pybind/pybind11 โญ 16,118
Seamless operability between C++11 and Python
๐ pybind11.readthedocs.io -
exaloop/codon โญ 15,348
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
๐ docs.exaloop.io/codon -
dask/dask โญ 12,905
Parallel computing with task scheduling
๐ dask.org -
numba/numba โญ 10,171
NumPy aware dynamic Python compiler using LLVM
๐ numba.pydata.org -
modin-project/modin โญ 10,002
Modin: Scale your Pandas workflows by changing a single line of code
๐ modin.readthedocs.io -
nebuly-ai/optimate โญ 8,372
A collection of libraries to optimise AI model performances
๐ www.nebuly.com -
vaexio/vaex โญ 8,332
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
๐ vaex.io -
mher/flower โญ 6,580
Real-time monitor and web admin for Celery distributed task queue
๐ flower.readthedocs.io -
python-trio/trio โญ 6,337
Trio โ a friendly Python library for async concurrency and I/O
๐ trio.readthedocs.io -
ultrajson/ultrajson โญ 4,364
Ultra fast JSON decoder and encoder written in C with Python bindings
๐ pypi.org/project/ujson -
tlkh/asitop โญ 3,787
Perf monitoring CLI tool for Apple Silicon
๐ tlkh.github.io/asitop -
facebookincubator/cinder โญ 3,561
Cinder is Meta's internal performance-oriented production version of CPython.
๐ trycinder.com -
airtai/faststream โญ 3,419
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
๐ faststream.airt.ai/latest -
ipython/ipyparallel โญ 2,603
IPython Parallel: Interactive Parallel Computing in Python
๐ ipyparallel.readthedocs.io -
intel/intel-extension-for-transformers โญ 2,157
โก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโก -
h5py/h5py โญ 2,108
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
๐ www.h5py.org -
agronholm/anyio โญ 1,905
High level asynchronous concurrency and networking framework that works on top of either trio or asyncio -
tiangolo/asyncer โญ 1,787
Asyncer, async and await, focused on developer experience.
๐ asyncer.tiangolo.com -
intel/intel-extension-for-pytorch โญ 1,721
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform -
faster-cpython/ideas โญ 1,705
Discussion and work tracker for Faster CPython project. -
dask/distributed โญ 1,595
A distributed task scheduler for Dask
๐ distributed.dask.org -
nschloe/perfplot โญ 1,357
๐ Performance analysis for Python snippets -
intel/scikit-learn-intelex โญ 1,247
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
๐ uxlfoundation.github.io/scikit-learn-intelex -
markshannon/faster-cpython โญ 945
How to make CPython faster. -
zerointensity/pointers.py โญ 920
Bringing the hell of pointers to Python.
๐ pointers.zintensity.dev -
brandtbucher/specialist โญ 647
Visualize CPython's specializing, adaptive interpreter. ๐ฅ
Memory and CPU/GPU profiling tools and libraries.
-
bloomberg/memray โญ 13,638
Memray is a memory profiler for Python
๐ bloomberg.github.io/memray -
benfred/py-spy โญ 13,201
Sampling profiler for Python programs -
plasma-umass/scalene โญ 12,399
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals -
joerick/pyinstrument โญ 6,833
๐ดย Call stack profiler for Python. Shows you why your code is slow!
๐ pyinstrument.readthedocs.io -
gaogaotiantian/viztracer โญ 5,926
A debugging and profiling tool that can trace and visualize python code execution
๐ viztracer.readthedocs.io -
pythonprofilers/memory_profiler โญ 4,433
Monitor Memory usage of Python code
๐ pypi.python.org/pypi/memory_profiler -
reloadware/reloadium โญ 2,845
Hot Reloading and Profiling for Python -
pyutils/line_profiler โญ 2,830
Line-by-line profiling for Python -
jiffyclub/snakeviz โญ 2,385
An in-browser Python profile viewer
๐ jiffyclub.github.io/snakeviz -
p403n1x87/austin โญ 1,980
Python frame stack sampler for CPython
๐ pypi.org/project/austin-dist -
pythonspeed/filprofiler โญ 857
A Python memory profiler for data processing and scientific computing applications
๐ pythonspeed.com/products/filmemoryprofiler
Security related libraries: vulnerability discovery, SQL injection, environment auditing.
-
swisskyrepo/PayloadsAllTheThings โญ 62,972
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
๐ swisskyrepo.github.io/payloadsallthethings -
sqlmapproject/sqlmap โญ 33,211
Automatic SQL injection and database takeover tool
๐ sqlmap.org -
certbot/certbot โญ 31,871
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol. -
aquasecurity/trivy โญ 24,542
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
๐ trivy.dev -
bridgecrewio/checkov โญ 7,339
Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
๐ www.checkov.io -
nccgroup/ScoutSuite โญ 6,910
Multi-Cloud Security Auditing Tool -
stamparm/maltrail โญ 6,723
Malicious traffic detection system -
pycqa/bandit โญ 6,703
Bandit is a tool designed to find common security issues in Python code.
๐ bandit.readthedocs.io -
rhinosecuritylabs/pacu โญ 4,510
The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
๐ rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework -
dashingsoft/pyarmor โญ 3,904
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
๐ pyarmor.dashingsoft.com -
pyupio/safety โญ 1,789
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
๐ safetycli.com/product/safety-cli -
trailofbits/pip-audit โญ 1,005
Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
๐ pypi.org/project/pip-audit -
fadi002/de4py โญ 877
toolkit for python reverse engineering
๐ de4py.000.pe -
thecyb3ralpha/BobTheSmuggler โญ 506
A tool that leverages HTML Smuggling Attack and allows you to create HTML files with embedded 7z/zip archives.
Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.
-
atsushisakai/PythonRobotics โญ 24,151
Python sample codes and textbook for robotics algorithms.
๐ atsushisakai.github.io/pythonrobotics -
genesis-embodied-ai/Genesis โญ 23,535
Genesis is a physics platform, and generative data engine, designed for general purpose Robotics/Embodied AI/Physical AI applications -
bulletphysics/bullet3 โญ 12,950
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
๐ bulletphysics.org -
isl-org/Open3D โญ 11,861
Open3D: A Modern Library for 3D Data Processing
๐ www.open3d.org -
dlr-rm/stable-baselines3 โญ 9,673
Stable Baselines3 (SB3) is a set of reliable implementations of reinforcement learning algorithms in PyTorch
๐ stable-baselines3.readthedocs.io -
nvidia/Cosmos โญ 7,364
NVIDIA Cosmos is a developer-first world foundation model platform designed to help Physical AI developers build their Physical AI systems better and faster. -
qiskit/qiskit โญ 5,620
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
๐ www.ibm.com/quantum/qiskit -
astropy/astropy โญ 4,568
Astronomy and astrophysics core library
๐ www.astropy.org -
nvidia/warp โญ 4,504
A Python framework for high performance GPU simulation and graphics
๐ nvidia.github.io/warp -
quantumlib/Cirq โญ 4,429
A Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.
๐ quantumai.google/cirq -
chakazul/Lenia โญ 3,573
Lenia is a 2D cellular automata with continuous space, time and states. It produces a huge variety of interesting methematical life forms
๐ chakazul.github.io/lenia/javascript/lenia.html -
openai/mujoco-py โญ 2,918
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3. -
rdkit/rdkit โญ 2,772
The official sources for the RDKit library -
projectmesa/mesa โญ 2,762
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
๐ mesa.readthedocs.io -
nvidia-omniverse/IsaacLab โญ 2,712
Unified framework for robot learning built on NVIDIA Isaac Sim
๐ isaac-sim.github.io/isaaclab -
taichi-dev/difftaichi โญ 2,538
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020) -
google/brax โญ 2,507
Massively parallel rigidbody physics simulation on accelerator hardware. -
dlr-rm/rl-baselines3-zoo โญ 2,226
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
๐ rl-baselines3-zoo.readthedocs.io -
nvidia-omniverse/IsaacGymEnvs โญ 2,191
Example RL environments for the NVIDIA Isaac Gym high performance environments -
facebookresearch/habitat-lab โญ 2,099
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
๐ aihabitat.org -
quantecon/QuantEcon.py โญ 2,037
A community based Python library for quantitative economics
๐ quantecon.org/quantecon-py -
microsoft/PromptCraft-Robotics โญ 1,947
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
๐ aka.ms/chatgpt-robotics -
eloialonso/diamond โญ 1,715
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model
๐ diamond-wm.github.io -
deepmodeling/deepmd-kit โญ 1,568
A deep learning package for many-body potential energy representation and molecular dynamics
๐ docs.deepmodeling.com/projects/deepmd -
sail-sg/envpool โญ 1,120
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
๐ envpool.readthedocs.io -
bowang-lab/scGPT โญ 1,109
scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
๐ scgpt.readthedocs.io/en/latest -
a-r-j/graphein โญ 1,058
Protein Graph Library
๐ graphein.ai -
viblo/pymunk โญ 955
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
๐ www.pymunk.org -
google-deepmind/materials_discovery โญ 931
Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science. -
nvidia-omniverse/OmniIsaacGymEnvs โญ 919
Reinforcement Learning Environments for Omniverse Isaac Gym -
altera-al/project-sid โญ 908
This repository contains our technical report: "Project Sid: Many-agent simulations toward AI civilization" -
google/evojax โญ 867
EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library -
facebookresearch/fairo โญ 860
A modular embodied agent architecture and platform for building embodied agents -
eureka-research/DrEureka โญ 838
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
๐ eureka-research.github.io/dr-eureka -
polymathicai/the_well โญ 738
15TB of Physics Simulations: collection of machine learning datasets containing numerical simulations of a wide variety of spatiotemporal physical systems.
๐ polymathic-ai.org/the_well -
ur-whitelab/chemcrow-public โญ 680
Chemcrow -
ur-whitelab/chemcrow-runs โญ 77
ur-whitelab/chemcrow-runs
Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.
-
thealgorithms/Python โญ 196,978
All Algorithms implemented in Python
๐ thealgorithms.github.io/python -
microsoft/generative-ai-for-beginners โญ 70,017
21 Lessons, Get Started Building with Generative AI ๐ https://microsoft.github.io/generative-ai-for-beginners/
๐ microsoft.github.io/generative-ai-for-beginners -
mlabonne/llm-course โญ 45,322
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
๐ mlabonne.github.io/blog -
jakevdp/PythonDataScienceHandbook โญ 43,768
Python Data Science Handbook: full text in Jupyter Notebooks
๐ jakevdp.github.io/pythondatasciencehandbook -
rasbt/LLMs-from-scratch โญ 39,139
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
๐ amzn.to/4fqvn0d -
realpython/python-guide โญ 28,607
Python best practices guidebook, written for humans.
๐ docs.python-guide.org -
d2l-ai/d2l-en โญ 24,823
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
๐ d2l.ai -
christoschristofidis/awesome-deep-learning โญ 24,733
A curated list of awesome Deep Learning tutorials, projects and communities. -
wesm/pydata-book โญ 22,643
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media -
hannibal046/Awesome-LLM โญ 21,185
Awesome-LLM: a curated list of Large Language Model -
microsoft/recommenders โญ 19,700
Best Practices on Recommendation Systems
๐ recommenders-team.github.io/recommenders/intro.html -
fchollet/deep-learning-with-python-notebooks โญ 18,946
Jupyter notebooks for the code samples of the book "Deep Learning with Python" -
graykode/nlp-tutorial โญ 14,401
Natural Language Processing Tutorial for Deep Learning Researchers
๐ www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying -
naklecha/llama3-from-scratch โญ 14,085
llama3 implementation one matrix multiplication at a time -
shangtongzhang/reinforcement-learning-an-introduction โญ 13,790
Python Implementation of Reinforcement Learning: An Introduction -
karpathy/nn-zero-to-hero โญ 13,092
Neural Networks: Zero to Hero -
mrdbourke/pytorch-deep-learning โญ 12,210
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
๐ learnpytorch.io -
eugeneyan/open-llms โญ 11,606
๐ A list of open LLMs available for commercial use. -
karpathy/micrograd โญ 11,061
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API -
rucaibox/LLMSurvey โญ 10,910
The official GitHub page for the survey paper "A Survey of Large Language Models".
๐ arxiv.org/abs/2303.18223 -
openai/spinningup โญ 10,458
An educational resource to help anyone learn deep reinforcement learning.
๐ spinningup.openai.com -
srush/GPU-Puzzles โญ 10,441
Teaching beginner GPU programming in a completely interactive fashion -
zhanymkanov/fastapi-best-practices โญ 10,231
FastAPI Best Practices and Conventions we used at our startup -
nielsrogge/Transformers-Tutorials โญ 9,880
This repository contains demos I made with the Transformers library by HuggingFace. -
mooler0410/LLMsPracticalGuide โญ 9,674
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
๐ arxiv.org/abs/2304.13712v2 -
firmai/industry-machine-learning โญ 7,293
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
๐ www.sov.ai -
udlbook/udlbook โญ 6,961
Understanding Deep Learning - Simon J.D. Prince -
gkamradt/langchain-tutorials โญ 6,895
Overview and tutorial of the LangChain Library -
roboflow/notebooks โญ 6,730
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.
๐ roboflow.com/models -
neetcode-gh/leetcode โญ 5,879
Leetcode solutions for NeetCode.io -
mrdbourke/tensorflow-deep-learning โญ 5,421
All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
๐ dbourke.link/ztmtfcourse -
alirezadir/Machine-Learning-Interviews โญ 5,389
This repo is meant to serve as a guide for Machine Learning/AI technical interviews. -
udacity/deep-learning-v2-pytorch โญ 5,357
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101 -
huggingface/smol-course โญ 5,233
a practical course on aligning language models for your specific use case. It's a handy way to get started with aligning language models, because everything runs on most local machines. -
timofurrer/awesome-asyncio โญ 4,701
A curated list of awesome Python asyncio frameworks, libraries, software and resources -
zotroneneis/machine_learning_basics โญ 4,343
Plain python implementations of basic machine learning algorithms -
handsonllm/Hands-On-Large-Language-Models โญ 4,311
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
๐ www.llm-book.com -
promptslab/Awesome-Prompt-Engineering โญ 4,145
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
๐ discord.gg/m88xfymbk6 -
huggingface/deep-rl-class โญ 4,027
This repo contains the Hugging Face Deep Reinforcement Learning Course. -
rasbt/machine-learning-book โญ 3,906
Code Repository for Machine Learning with PyTorch and Scikit-Learn
๐ sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn -
huggingface/diffusion-models-class โญ 3,855
Materials for the Hugging Face Diffusion Models Course -
cosmicpython/book โญ 3,472
A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
๐ www.cosmicpython.com -
amanchadha/coursera-deep-learning-specialization โญ 3,445
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv... -
fluentpython/example-code-2e โญ 3,411
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
๐ amzn.to/3j48u2j -
mrdbourke/zero-to-mastery-ml โญ 3,074
All course materials for the Zero to Mastery Machine Learning and Data Science course.
๐ dbourke.link/ztmmlcourse -
krzjoa/awesome-python-data-science โญ 2,693
Probably the best curated list of data science software in Python.
๐ krzjoa.github.io/awesome-python-data-science -
chiphuyen/aie-book โญ 2,269
Code for AI Engineering: Building Applications with Foundation Models (Chip Huyen 2025) -
gerdm/prml โญ 2,236
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop -
cgpotts/cs224u โญ 2,133
Code for CS224u: Natural Language Understanding -
cerlymarco/MEDIUM_NoteBook โญ 2,096
Repository containing notebooks of my posts on Medium -
trananhkma/fucking-awesome-python โญ 1,988
awesome-python withโญ and ๐ด
-
huggingface/cookbook โญ 1,816
Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
๐ huggingface.co/learn/cookbook -
chandlerbang/awesome-self-supervised-gnn โญ 1,636
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN). -
atcold/NYU-DLSP21 โญ 1,593
NYU Deep Learning Spring 2021
๐ atcold.github.io/nyu-dlsp21 -
patrickloeber/MLfromscratch โญ 1,383
Machine Learning algorithm implementations from scratch. -
davidadsp/Generative_Deep_Learning_2nd_Edition โญ 1,198
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
๐ www.oreilly.com/library/view/generative-deep-learning/9781098134174 -
rasbt/LLM-workshop-2024 โญ 859
A 4-hour coding workshop to understand how LLMs are implemented and used -
jackhidary/quantumcomputingbook โญ 822
Companion site for the textbook Quantum Computing: An Applied Approach -
bayesianmodelingandcomputationinpython/BookCode_Edition1 โญ 510
Bayesian Modeling and Computation in Python: open-access version of the text and the code examples in the book
๐ www.bayesiancomputationbook.com -
dylanhogg/awesome-python โญ 348
๐ Hand-picked awesome Python libraries and frameworks, organised by category
๐ www.awesomepython.org
Template tools and libraries: cookiecutter repos, generators, quick-starts.
-
tiangolo/full-stack-fastapi-template โญ 29,474
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more. -
cookiecutter/cookiecutter โญ 22,999
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
๐ pypi.org/project/cookiecutter -
drivendata/cookiecutter-data-science โญ 8,538
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
๐ cookiecutter-data-science.drivendata.org -
buuntu/fastapi-react โญ 2,296
๐ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker -
pyscaffold/pyscaffold โญ 2,151
๐ Python project template generator with batteries included
๐ pyscaffold.org -
cjolowicz/cookiecutter-hypermodern-python โญ 1,838
Cookiecutter template for a Python package based on the Hypermodern Python article series.
๐ cookiecutter-hypermodern-python.readthedocs.io -
fmind/mlops-python-package โญ 1,103
Best practices designed to support your MLOPs initiatives. You can use this package as part of your MLOps toolkit or platform e.g. Model Registry, Experiment Tracking, Realtime Inference
๐ fmind.github.io/mlops-python-package -
tezromach/python-package-template โญ 1,085
๐ Your next Python package needs a bleeding-edge project structure. -
martinheinz/python-project-blueprint โญ 963
Blueprint/Boilerplate For Python Projects -
callmesora/llmops-python-package โญ 810
Best practices designed to support your LLMOps initiatives. You can use this package as part of your LLMOps toolkit or platform e.g. Model Registry, Experiment Tracking, Realtime Inference
Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.
-
willmcgugan/rich โญ 50,556
Rich is a Python library for rich text and beautiful formatting in the terminal.
๐ rich.readthedocs.io/en/latest -
tqdm/tqdm โญ 29,182
โก A Fast, Extensible Progress Bar for Python and CLI
๐ tqdm.github.io -
google/python-fire โญ 27,374
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object. -
willmcgugan/textual โญ 27,183
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
๐ textual.textualize.io -
tiangolo/typer โญ 16,328
Typer, build great CLIs. Easy to code. Based on Python type hints.
๐ typer.tiangolo.com -
pallets/click โญ 16,022
Python composable command line interface toolkit
๐ click.palletsprojects.com -
saulpw/visidata โญ 8,029
A terminal spreadsheet multitool for discovering and arranging data
๐ visidata.org -
tconbeer/harlequin โญ 4,119
The SQL IDE for Your Terminal.
๐ harlequin.sh -
manrajgrover/halo โญ 2,913
๐ซ Beautiful spinners for terminal, IPython and Jupyter -
urwid/urwid โญ 2,850
Console user interface library for Python (official repo)
๐ urwid.org -
textualize/trogon โญ 2,556
Easily turn your Click CLI into a powerful terminal application -
darrenburns/elia โญ 1,985
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. -
tmbo/questionary โญ 1,639
Python library to build pretty command line user prompts โจEasy to use multi-select lists, confirmations, free text prompts ... -
jazzband/prettytable โญ 1,433
Display tabular data in a visually appealing ASCII table format
๐ pypi.org/project/prettytable -
1j01/textual-paint โญ 983
๐จ MS Paint in your terminal.
๐ pypi.org/project/textual-paint
Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.
-
mitmproxy/mitmproxy โญ 37,773
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
๐ mitmproxy.org -
locustio/locust โญ 25,520
Write scalable load tests in plain Python ๐๐จ
๐ locust.cloud -
pytest-dev/pytest โญ 12,380
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
๐ pytest.org -
microsoft/playwright-python โญ 12,288
Python version of the Playwright testing and automation library.
๐ playwright.dev/python -
robotframework/robotframework โญ 10,219
Generic automation framework for acceptance testing and RPA
๐ robotframework.org -
seleniumbase/SeleniumBase โญ 9,253
Python APIs for web automation, testing, and bypassing bot-detection.
๐ seleniumbase.io -
getmoto/moto โญ 7,757
A library that allows you to easily mock out tests based on AWS infrastructure.
๐ docs.getmoto.org/en/latest -
hypothesisworks/hypothesis โญ 7,689
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
๐ hypothesis.works -
newsapps/beeswithmachineguns โญ 6,461
A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
๐ apps.chicagotribune.com -
codium-ai/qodo-cover โญ 4,770
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐ป๐ค๐งช๐
๐ qodo.ai -
confident-ai/deepeval โญ 4,657
The LLM Evaluation Framework
๐ docs.confident-ai.com -
spulec/freezegun โญ 4,264
Let your Python tests travel through time -
getsentry/responses โญ 4,204
A utility for mocking out the Python Requests library. -
tox-dev/tox โญ 3,735
Command line driven CI frontend and development task automation tool.
๐ tox.wiki -
behave/behave โญ 3,244
BDD, Python style.
๐ behave.readthedocs.io/en/latest -
nedbat/coveragepy โญ 3,076
The code coverage tool for Python
๐ coverage.readthedocs.io -
kevin1024/vcrpy โญ 2,750
Automatically mock your HTTP interactions to simplify and speed up testing -
cobrateam/splinter โญ 2,732
splinter - python test framework for web applications
๐ splinter.readthedocs.org/en/stable/index.html -
pytest-dev/pytest-testinfra โญ 2,388
With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
๐ testinfra.readthedocs.io -
pytest-dev/pytest-mock โญ 1,887
Thin-wrapper around the mock package for easier use with pytest
๐ pytest-mock.readthedocs.io/en/latest -
pytest-dev/pytest-cov โญ 1,809
Coverage plugin for pytest. -
pytest-dev/pytest-xdist โญ 1,529
pytest plugin for distributed testing and loop-on-failures testing modes.
๐ pytest-xdist.readthedocs.io -
pytest-dev/pytest-asyncio โญ 1,462
Asyncio support for pytest
๐ pytest-asyncio.readthedocs.io -
taverntesting/tavern โญ 1,046
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
๐ taverntesting.github.io
Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.
-
facebook/prophet โญ 18,814
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
๐ facebook.github.io/prophet -
blue-yonder/tsfresh โญ 8,555
Automatic extraction of relevant features from time series:
๐ tsfresh.readthedocs.io -
unit8co/darts โญ 8,306
A python library for user-friendly forecasting and anomaly detection on time series.
๐ unit8co.github.io/darts -
sktime/sktime โญ 8,169
A unified framework for machine learning with time series
๐ www.sktime.net -
facebookresearch/Kats โญ 5,619
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends. -
awslabs/gluonts โญ 4,752
Probabilistic time series modeling in Python
๐ ts.gluon.ai -
google-research/timesfm โญ 4,303
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
๐ research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting -
nixtla/statsforecast โญ 4,128
Lightning โก๏ธ fast forecasting with statistical and econometric models.
๐ nixtlaverse.nixtla.io/statsforecast -
tdameritrade/stumpy โญ 3,770
STUMPY is a powerful and scalable Python library for modern time series analysis
๐ stumpy.readthedocs.io/en/latest -
salesforce/Merlion โญ 3,517
Merlion: A Machine Learning Framework for Time Series Intelligence -
amazon-science/chronos-forecasting โญ 2,879
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
๐ arxiv.org/abs/2403.07815 -
rjt1990/pyflux โญ 2,115
Open source time series library for Python -
aistream-peelout/flow-forecast โญ 2,106
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
๐ flow-forecast.atlassian.net/wiki/spaces/ff/overview -
uber/orbit โญ 1,936
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
๐ orbit-ml.readthedocs.io/en/stable -
alkaline-ml/pmdarima โญ 1,614
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
๐ www.alkaline-ml.com/pmdarima -
time-series-foundation-models/lag-llama โญ 1,342
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting -
winedarksea/AutoTS โญ 1,185
Automated Time Series Forecasting -
autoviml/Auto_TS โญ 745
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome. -
google/temporian โญ 685
Temporian is an open-source Python library for preprocessing โก and feature engineering ๐ temporal data ๐ for machine learning applications ๐ค
๐ temporian.readthedocs.io
Typing libraries: static and run-time type checking, annotations.
-
python/mypy โญ 18,890
Optional static typing for Python
๐ www.mypy-lang.org -
microsoft/pyright โญ 13,781
Static Type Checker for Python -
facebook/pyre-check โญ 6,910
Performant type-checking for python.
๐ pyre-check.org -
python-attrs/attrs โญ 5,396
Python Classes Without Boilerplate
๐ www.attrs.org -
google/pytype โญ 4,825
A static type analyzer for Python code
๐ google.github.io/pytype -
instagram/MonkeyType โญ 4,825
A Python library that generates static type annotations by collecting runtime types -
python/typeshed โญ 4,478
Collection of library stubs for Python, with static types -
mtshiba/pylyzer โญ 2,639
A fast, feature-rich static code analyzer & language server for Python
๐ mtshiba.github.io/pylyzer -
microsoft/pylance-release โญ 1,739
Fast, feature-rich language support for Python. Documentation and issues for Pylance. -
agronholm/typeguard โญ 1,588
Run-time type checker for Python -
patrick-kidger/torchtyping โญ 1,416
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc. -
robertcraigie/pyright-python โญ 194
Python command line wrapper for pyright, a static type checker
๐ pypi.org/project/pyright
General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.
-
yt-dlp/yt-dlp โญ 99,072
A feature-rich command-line audio/video downloader
๐ discord.gg/h5mncfw63r -
home-assistant/core โญ 76,261
๐ก Open source home automation that puts local control and privacy first.
๐ www.home-assistant.io -
abi/screenshot-to-code โญ 67,970
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
๐ screenshottocode.com -
python/cpython โญ 65,073
The Python programming language
๐ www.python.org -
localstack/localstack โญ 57,515
๐ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
๐ localstack.cloud -
faif/python-patterns โญ 40,871
A collection of design patterns/idioms in Python -
mingrammer/diagrams โญ 40,212
๐จ Diagram as Code for prototyping cloud system architectures
๐ diagrams.mingrammer.com -
ggerganov/whisper.cpp โญ 37,385
Port of OpenAI's Whisper model in C/C++ -
paul-gauthier/aider โญ 26,416
Aider is a command line tool that lets you pair program with LLMs, to edit code stored in your local git repository
๐ aider.chat -
openai/openai-python โญ 24,355
The official Python library for the OpenAI API
๐ pypi.org/project/openai -
keon/algorithms โญ 24,272
Minimal examples of data structures and algorithms in Python -
modularml/mojo โญ 23,641
The Mojo Programming Language
๐ docs.modular.com/mojo/manual -
norvig/pytudes โญ 23,279
Python programs, usually short, of considerable difficulty, to perfect particular skills. -
pydantic/pydantic โญ 22,286
Data validation using Python type hints
๐ docs.pydantic.dev -
squidfunk/mkdocs-material โญ 21,982
Documentation that simply works
๐ squidfunk.github.io/mkdocs-material -
facebookresearch/audiocraft โญ 21,419
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. -
blakeblackshear/frigate โญ 20,867
NVR with realtime local object detection for IP cameras
๐ frigate.video -
chriskiehl/Gooey โญ 20,808
Turn (almost) any Python command line program into a full GUI application with one line -
delgan/loguru โญ 20,673
Python logging made (stupidly) simple -
micropython/micropython โญ 19,819
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
๐ micropython.org -
mkdocs/mkdocs โญ 19,804
Project documentation with Markdown.
๐ www.mkdocs.org -
rustpython/RustPython โญ 19,568
A Python Interpreter written in Rust
๐ rustpython.github.io -
higherorderco/Bend โญ 17,992
A massively parallel, high-level programming language
๐ higherorderco.com -
kivy/kivy โญ 17,985
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
๐ kivy.org -
ipython/ipython โญ 16,388
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
๐ ipython.readthedocs.org -
alievk/avatarify-python โญ 16,358
Avatars for Zoom, Skype and other video-conferencing apps. -
openai/triton โญ 14,261
Development repository for the Triton language and compiler
๐ triton-lang.org -
zulko/moviepy โญ 12,972
Video editing with Python
๐ zulko.github.io/moviepy -
pyo3/pyo3 โญ 12,927
Rust bindings for the Python interpreter
๐ pyo3.rs -
pyodide/pyodide โญ 12,653
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
๐ pyodide.org/en/stable -
pytube/pytube โญ 12,579
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
๐ pytube.io -
caronc/apprise โญ 12,551
Apprise - Push Notifications that work with just about every platform!
๐ hub.docker.com/r/caronc/apprise -
python-pillow/Pillow โญ 12,509
The Python Imaging Library adds image processing capabilities to Python (Pillow is the friendly PIL fork)
๐ python-pillow.github.io -
nuitka/Nuitka โญ 12,454
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
๐ nuitka.net -
dbader/schedule โญ 11,959
Python job scheduling for humans.
๐ schedule.readthedocs.io -
ninja-build/ninja โญ 11,550
Ninja is a small build system with a focus on speed.
๐ ninja-build.org -
secdev/scapy โญ 11,052
Scapy: the Python-based interactive packet manipulation program & library.
๐ scapy.net -
asweigart/pyautogui โญ 10,790
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard. -
magicstack/uvloop โญ 10,621
Ultra fast asyncio event loop. -
pallets/jinja โญ 10,558
A very fast and expressive template engine.
๐ jinja.palletsprojects.com -
aristocratos/bpytop โญ 10,453
Linux/OSX/FreeBSD resource monitor -
cython/cython โญ 9,743
The most widely used Python to C compiler
๐ cython.org -
aws/serverless-application-model โญ 9,402
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
๐ aws.amazon.com/serverless/sam -
paramiko/paramiko โญ 9,236
The leading native Python SSHv2 protocol library.
๐ paramiko.org -
boto/boto3 โญ 9,175
AWS SDK for Python
๐ aws.amazon.com/sdk-for-python -
facebookresearch/hydra โญ 9,011
Hydra is a framework for elegantly configuring complex applications
๐ hydra.cc -
arrow-py/arrow โญ 8,780
๐น Better dates & times for Python
๐ arrow.readthedocs.io -
py-pdf/pypdf โญ 8,686
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
๐ pypdf.readthedocs.io/en/latest -
xonsh/xonsh โญ 8,517
๐ Python-powered shell. Full-featured and cross-platform.
๐ xon.sh -
eternnoir/pyTelegramBotAPI โญ 8,252
Python Telegram bot api. -
jasonppy/VoiceCraft โญ 8,083
Zero-Shot Speech Editing and Text-to-Speech in the Wild -
kellyjonbrazil/jc โญ 8,020
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. -
googleapis/google-api-python-client โญ 7,957
๐ The official Python client library for Google's discovery based APIs.
๐ googleapis.github.io/google-api-python-client/docs -
theskumar/python-dotenv โญ 7,859
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
๐ saurabh-kumar.com/python-dotenv -
googlecloudplatform/python-docs-samples โญ 7,569
Code samples used on cloud.google.com -
icloud-photos-downloader/icloud_photos_downloader โญ 7,549
A command-line tool to download photos from iCloud -
google/latexify_py โญ 7,397
A library to generate LaTeX expression from Python code. -
pygithub/PyGithub โญ 7,168
Typed interactions with the GitHub API v3
๐ pygithub.readthedocs.io -
marshmallow-code/marshmallow โญ 7,090
A lightweight library for converting complex objects to and from simple Python datatypes.
๐ marshmallow.readthedocs.io -
jd/tenacity โญ 7,002
Retrying library for Python
๐ tenacity.readthedocs.io -
bndr/pipreqs โญ 7,001
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward. -
hugapi/hug โญ 6,868
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler. -
pyca/cryptography โญ 6,854
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
๐ cryptography.io -
sphinx-doc/sphinx โญ 6,789
The Sphinx documentation generator
๐ www.sphinx-doc.org -
gorakhargosh/watchdog โญ 6,734
Python library and shell utilities to monitor filesystem events.
๐ packages.python.org/watchdog -
openai/point-e โญ 6,617
Point cloud diffusion for 3D model synthesis -
timdettmers/bitsandbytes โญ 6,592
Accessible large language models via k-bit quantization for PyTorch.
๐ huggingface.co/docs/bitsandbytes/main/en/index -
ijl/orjson โญ 6,528
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy -
agronholm/apscheduler โญ 6,484
Task scheduling library for Python -
sdispater/pendulum โญ 6,346
Python datetimes made easy
๐ pendulum.eustace.io -
pdfminer/pdfminer.six โญ 6,170
Community maintained fork of pdfminer - we fathom PDF
๐ pdfminersix.readthedocs.io -
scikit-image/scikit-image โญ 6,165
Image processing in Python
๐ scikit-image.org -
wireservice/csvkit โญ 6,077
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
๐ csvkit.readthedocs.io -
pytransitions/transitions โญ 5,908
A lightweight, object-oriented finite state machine implementation in Python with many extensions -
rsalmei/alive-progress โญ 5,681
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations! -
traceloop/openllmetry โญ 5,362
Open-source observability for your LLM application, based on OpenTelemetry
๐ www.traceloop.com/openllmetry -
spotify/pedalboard โญ 5,358
๐ ๐ A Python library for audio.
๐ spotify.github.io/pedalboard -
buildbot/buildbot โญ 5,305
Python-based continuous integration testing framework; your pull requests are more than welcome!
๐ www.buildbot.net -
prompt-toolkit/ptpython โญ 5,259
A better Python REPL -
pywinauto/pywinauto โญ 5,128
Windows GUI Automation with Python (based on text properties)
๐ pywinauto.github.io -
pycqa/pycodestyle โญ 5,065
Simple Python style checker in one Python file
๐ pycodestyle.pycqa.org -
tebelorg/RPA-Python โญ 5,057
Python package for doing RPA -
pythonnet/pythonnet โญ 4,892
Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and provides a powerful application scripting tool for .NET developers.
๐ pythonnet.github.io -
jorgebastida/awslogs โญ 4,887
AWS CloudWatch logs for Humansโข -
comet-ml/opik โญ 4,851
Opik is an open-source platform for evaluating, testing and monitoring LLM applications.
๐ www.comet.com/docs/opik -
pytoolz/toolz โญ 4,762
A functional standard library for Python.
๐ toolz.readthedocs.org -
hhatto/autopep8 โญ 4,589
A tool that automatically formats Python code to conform to the PEP 8 style guide.
๐ pypi.org/project/autopep8 -
pyinvoke/invoke โญ 4,464
Pythonic task management & command execution.
๐ pyinvoke.org -
bogdanp/dramatiq โญ 4,459
A fast and reliable background task processing library for Python 3.
๐ dramatiq.io -
ashleve/lightning-hydra-template โญ 4,419
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โก๐ฅโก -
blealtan/efficient-kan โญ 4,209
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN). -
adafruit/circuitpython โญ 4,191
CircuitPython - a Python implementation for teaching coding with microcontrollers
๐ circuitpython.org -
pyo3/maturin โญ 4,158
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
๐ maturin.rs -
ets-labs/python-dependency-injector โญ 4,131
Dependency injection framework for Python
๐ python-dependency-injector.ets-labs.org -
evhub/coconut โญ 4,130
Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
๐ coconut-lang.org -
miguelgrinberg/python-socketio โญ 4,080
Python Socket.IO server and client -
pyinfra-dev/pyinfra โญ 4,051
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
๐ pyinfra.com -
joblib/joblib โญ 3,962
Computing with Python functions.
๐ joblib.readthedocs.org -
python-markdown/markdown โญ 3,877
A Python implementation of John Gruberโs Markdown with Extension support.
๐ python-markdown.github.io -
rspeer/python-ftfy โญ 3,848
Fixes mojibake and other glitches in Unicode text, after the fact.
๐ ftfy.readthedocs.org -
zeromq/pyzmq โญ 3,790
PyZMQ: Python bindings for zeromq
๐ zguide.zeromq.org/py:all -
more-itertools/more-itertools โญ 3,785
More routines for operating on iterables, beyond itertools
๐ more-itertools.rtfd.io -
hynek/structlog โญ 3,741
Simple, powerful, and fast logging for Python.
๐ www.structlog.org -
pydata/xarray โญ 3,696
N-D labeled arrays and datasets in Python
๐ xarray.dev -
spotify/basic-pitch โญ 3,657
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
๐ basicpitch.io -
pypi/warehouse โญ 3,651
The Python Package Index
๐ pypi.org -
tartley/colorama โญ 3,605
Simple cross-platform colored terminal text in Python -
jorisschellekens/borb โญ 3,444
borb is a library for reading, creating and manipulating PDF files in python.
๐ borbpdf.com -
pyserial/pyserial โญ 3,308
Python serial port access library -
camelot-dev/camelot โญ 3,139
A Python library to extract tabular data from PDFs
๐ camelot-py.readthedocs.io -
libaudioflux/audioFlux โญ 2,981
A library for audio and music analysis, feature extraction.
๐ audioflux.top -
legrandin/pycryptodome โญ 2,926
A self-contained cryptographic library for Python
๐ www.pycryptodome.org -
tox-dev/pipdeptree โญ 2,851
A command line utility to display dependency tree of the installed Python packages
๐ pypi.python.org/pypi/pipdeptree -
lxml/lxml โญ 2,754
The lxml XML toolkit for Python
๐ lxml.de -
liiight/notifiers โญ 2,690
The easy way to send notifications
๐ notifiers.readthedocs.io -
whylabs/whylogs โญ 2,685
An open-source data logging library for machine learning models and data pipelines. ๐ Provides visibility into data quality & model performance over time. ๐ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐
๐ whylogs.readthedocs.io -
cdgriffith/Box โญ 2,667
Python dictionaries with advanced dot notation access
๐ github.com/cdgriffith/box/wiki -
pexpect/pexpect โญ 2,658
A Python module for controlling interactive programs in a pseudo-terminal
๐ pexpect.readthedocs.io -
pydantic/logfire โญ 2,629
Uncomplicated Observability for Python and beyond! ๐ชต๐ฅ
๐ logfire.pydantic.dev/docs -
litl/backoff โญ 2,629
Python library providing function decorators for configurable backoff and retry -
yaml/pyyaml โญ 2,625
Canonical source repository for PyYAML -
scrapinghub/dateparser โญ 2,598
python parser for human readable dates -
pypa/setuptools โญ 2,586
Official project repository for the Setuptools build system
๐ pypi.org/project/setuptools -
jcrist/msgspec โญ 2,578
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
๐ jcristharif.com/msgspec -
pyston/pyston โญ 2,504
(No longer maintained) A faster and highly-compatible implementation of the Python programming language.
๐ www.pyston.org -
dosisod/refurb โญ 2,493
A tool for refurbishing and modernizing Python codebases -
hgrecco/pint โญ 2,470
Operate and manipulate physical quantities in Python
๐ pint.readthedocs.org -
nschloe/tikzplotlib โญ 2,459
๐ Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX. -
grantjenks/python-diskcache โญ 2,453
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
๐ www.grantjenks.com/docs/diskcache -
dateutil/dateutil โญ 2,416
Useful extensions to the standard Python datetime features -
tkem/cachetools โญ 2,411
Various memoizing collections and decorators, including variants of the Python Standard Library's @lru_cache function decorator -
pndurette/gTTS โญ 2,384
Python library and CLI tool to interface with Google Translate's text-to-speech API
๐ gtts.readthedocs.org -
rhettbull/osxphotos โญ 2,337
Python app to work with pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata. -
abseil/abseil-py โญ 2,320
A collection of Python library code for building Python applications. The code is collected from Google's own Python code base, and has been extensively tested and used in production. -
kiminewt/pyshark โญ 2,308
Python wrapper for tshark, allowing python packet parsing using wireshark dissectors -
pyparsing/pyparsing โญ 2,262
Python library for creating PEG parsers -
astanin/python-tabulate โญ 2,244
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
๐ pypi.org/project/tabulate -
nateshmbhat/pyttsx3 โญ 2,222
Offline Text To Speech synthesis for python -
ianmiell/shutit โญ 2,149
Automation framework for programmers
๐ ianmiell.github.io/shutit -
grahamdumpleton/wrapt โญ 2,093
A Python module for decorators, wrappers and monkey patching. -
seperman/deepdiff โญ 2,085
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
๐ zepworks.com -
google/gin-config โญ 2,071
Gin provides a lightweight configuration framework for Python -
omry/omegaconf โญ 2,041
Flexible Python configuration system. The last one you will ever need. -
mitmproxy/pdoc โญ 2,022
API Documentation for Python Projects
๐ pdoc.dev -
pyfilesystem/pyfilesystem2 โญ 2,019
Python's Filesystem abstraction layer
๐ www.pyfilesystem.org -
python-rope/rope โญ 1,995
a python refactoring library -
julienpalard/Pipe โญ 1,989
A Python library to use infix notation in Python -
numba/llvmlite โญ 1,983
A lightweight LLVM python binding for writing JIT compilers
๐ llvmlite.pydata.org -
landscapeio/prospector โญ 1,970
Inspects Python source files and provides information about type and location of classes, methods etc -
hbldh/bleak โญ 1,928
A cross platform Bluetooth Low Energy Client for Python using asyncio -
carpedm20/emoji โญ 1,924
emoji terminal output for Python -
pydoit/doit โญ 1,908
CLI task management & automation tool
๐ pydoit.org -
chaostoolkit/chaostoolkit โญ 1,903
Chaos Engineering Toolkit & Orchestration for Developers
๐ chaostoolkit.org -
pygments/pygments โญ 1,899
Pygments is a generic syntax highlighter written in Python
๐ pygments.org -
open-telemetry/opentelemetry-python โญ 1,882
OpenTelemetry Python API and SDK
๐ opentelemetry.io -
samuelcolvin/watchfiles โญ 1,875
Simple, modern and fast file watching and code reload in Python.
๐ watchfiles.helpmanual.io -
p0dalirius/Coercer โญ 1,873
A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
๐ podalirius.net -
home-assistant/supervisor โญ 1,855
๐ก Home Assistant Supervisor
๐ home-assistant.io/hassio -
joowani/binarytree โญ 1,811
Python Library for Studying Binary Trees
๐ binarytree.readthedocs.io -
konradhalas/dacite โญ 1,805
Simple creation of data classes from dictionaries. -
mkdocstrings/mkdocstrings โญ 1,797
๐ Automatic documentation from sources, for MkDocs.
๐ mkdocstrings.github.io -
rubik/radon โญ 1,767
Various code metrics for Python code
๐ radon.readthedocs.org -
kalliope-project/kalliope โญ 1,724
Kalliope is a framework that will help you to create your own personal assistant.
๐ kalliope-project.github.io -
anthropics/anthropic-sdk-python โญ 1,669
SDK providing access to Anthropic's safety-first language model APIs -
quodlibet/mutagen โญ 1,633
Python module for handling audio metadata
๐ mutagen.readthedocs.io -
instagram/LibCST โญ 1,604
A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
๐ libcst.readthedocs.io -
facebookincubator/Bowler โญ 1,574
Safe code refactoring for modern Python.
๐ pybowler.io -
imageio/imageio โญ 1,547
Python library for reading and writing image data
๐ imageio.readthedocs.io -
fabiocaccamo/python-benedict โญ 1,528
๐ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities. -
lcompilers/lpython โญ 1,519
Python compiler
๐ lpython.org -
nficano/python-lambda โญ 1,501
A toolkit for developing and deploying serverless Python code in AWS Lambda. -
aws-samples/aws-glue-samples โญ 1,459
AWS Glue code samples -
lidatong/dataclasses-json โญ 1,403
Easily serialize Data Classes to and from JSON -
brandon-rhodes/python-patterns โญ 1,366
Source code behind the python-patterns.guide site by Brandon Rhodes -
aio-libs/yarl โญ 1,363
Yet another URL library
๐ yarl.aio-libs.org -
ossf/criticality_score โญ 1,356
Gives criticality score for an open source project -
oracle/graalpython โญ 1,299
GraalPy โ A high-performance embeddable Python 3 runtime for Java
๐ www.graalvm.org/python -
pypy/pypy โญ 1,237
PyPy is a very fast and compliant implementation of the Python language.
๐ pypy.org -
pyo3/rust-numpy โญ 1,179
PyO3-based Rust bindings of the NumPy C-API -
ariebovenberg/whenever โญ 1,174
โฐ Modern datetime library for Python
๐ whenever.rtfd.io -
pyfpdf/fpdf2 โญ 1,174
Simple PDF generation for Python
๐ py-pdf.github.io/fpdf2 -
pdoc3/pdoc โญ 1,147
๐ โก๏ธ ๐ Auto-generate API documentation for Python projects
๐ pdoc3.github.io/pdoc -
fsspec/filesystem_spec โญ 1,101
A specification that python filesystems should adhere to. -
milvus-io/pymilvus โญ 1,078
Python SDK for Milvus. -
c4urself/bump2version โญ 1,072
Version-bump your software with a single command
๐ pypi.python.org/pypi/bump2version -
metachris/logzero โญ 1,024
Robust and effective logging for Python 2 and 3.
๐ logzero.readthedocs.io -
extensityai/symbolicai โญ 1,005
Compositional Differentiable Programming Library - divide-and-conquer approach to break down a complex problem into smaller, more manageable problems. -
fastai/fastcore โญ 997
Python supercharged for the fastai library
๐ fastcore.fast.ai -
lastmile-ai/aiconfig โญ 985
AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
๐ aiconfig.lastmileai.dev -
juanbindez/pytubefix โญ 973
Python3 library for downloading YouTube Videos.
๐ pytubefix.readthedocs.io -
barracuda-fsh/pyobd โญ 914
An OBD-II compliant car diagnostic tool -
qdrant/qdrant-client โญ 849
Python client for Qdrant vector search engine
๐ qdrant.tech -
samuelcolvin/dirty-equals โญ 839
Doing dirty (but extremely useful) things with equals.
๐ dirty-equals.helpmanual.io -
tox-dev/filelock โญ 808
A platform independent file lock in Python, which provides a simple way of inter-process communication
๐ py-filelock.readthedocs.io -
modal-labs/modal-examples โญ 774
Examples of programs built using Modal
๐ modal.com/docs -
open-telemetry/opentelemetry-python-contrib โญ 768
OpenTelemetry instrumentation for Python modules
๐ opentelemetry.io -
pypa/build โญ 767
A simple, correct Python build frontend
๐ build.pypa.io -
gefyrahq/gefyra โญ 710
Blazingly-fast ๐, rock-solid, local application development โก๏ธ with Kubernetes.
๐ gefyra.dev -
instagram/Fixit โญ 674
Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
๐ fixit.rtfd.io/en/latest -
argoproj-labs/hera โญ 645
Hera makes Python code easy to orchestrate on Argo Workflows through native Python integrations. It lets you construct and submit your Workflows entirely in Python. โญ๏ธ Remember to star!
๐ hera.rtfd.io -
platformdirs/platformdirs โญ 638
A small Python module for determining appropriate platform-specific dirs, e.g. a "user data dir".
๐ platformdirs.readthedocs.io -
fastai/ghapi โญ 636
A delightful and complete interface to GitHub's amazing API
๐ ghapi.fast.ai -
methexis-inc/terminal-copilot โญ 572
A smart terminal assistant that helps you find the right command. -
chrishayuk/mcp-cli โญ 542
A protocol-level CLI designed to interact with a Model Context Protocol server. The client allows users to send commands, query data, and interact with various resources provided by the server. -
steamship-core/steamship-langchain โญ 510
steamship-langchain -
pypdfium2-team/pypdfium2 โญ 504
Python bindings to PDFium
๐ pypdfium2.readthedocs.io -
neuml/annotateai โญ 287
Automatically annotates papers using Large Language Models (LLMs)
Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.
-
apache/superset โญ 64,196
Apache Superset is a Data Visualization and Data Exploration Platform
๐ superset.apache.org -
streamlit/streamlit โญ 37,080
Streamlit โ A faster way to build and share data apps.
๐ streamlit.io -
gradio-app/gradio โญ 35,732
Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!
๐ www.gradio.app -
plotly/dash โญ 21,880
Data Apps & Dashboards for Python. No JavaScript Required.
๐ plotly.com/dash -
danny-avila/LibreChat โญ 21,427
LibreChat is a free, open source AI chat platform. This Web UI offers vast customization, supporting numerous AI providers, services, and integrations.
๐ librechat.ai -
matplotlib/matplotlib โญ 20,660
matplotlib: plotting with Python
๐ matplotlib.org/stable -
bokeh/bokeh โญ 19,566
Interactive Data Visualization in the browser, from Python
๐ bokeh.org -
plotly/plotly.py โญ 16,664
The interactive graphing library for Python โจ This project now includes Plotly Express!
๐ plotly.com/python -
mwaskom/seaborn โญ 12,814
Statistical data visualization in Python
๐ seaborn.pydata.org -
visgl/deck.gl โญ 12,446
WebGL2 powered visualization framework
๐ deck.gl -
marceloprates/prettymaps โญ 11,489
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries. -
altair-viz/altair โญ 9,555
Declarative visualization library for Python
๐ altair-viz.github.io -
nvidia/TensorRT-LLM โญ 9,300
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
๐ nvidia.github.io/tensorrt-llm -
lux-org/lux โญ 5,244
Automatically visualize your pandas dataframe via a single print! ๐ ๐ก -
renpy/renpy โญ 5,216
The Ren'Py Visual Novel Engine
๐ www.renpy.org -
holoviz/panel โญ 5,004
Panel: The powerful data exploration & web app framework for Python
๐ panel.holoviz.org -
man-group/dtale โญ 4,836
Visualizer for pandas data structures
๐ alphatechadmin.pythonanywhere.com -
has2k1/plotnine โญ 4,114
A Grammar of Graphics for Python
๐ plotnine.org -
residentmario/missingno โญ 4,017
missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset. -
pyqtgraph/pyqtgraph โญ 3,965
Fast data visualization and GUI tools for scientific / engineering applications
๐ www.pyqtgraph.org -
vispy/vispy โญ 3,368
Main repository for Vispy
๐ vispy.org -
ml-tooling/opyrator โญ 3,117
๐ช Turns your machine learning code into microservices with web API, interactive GUI, and more.
๐ opyrator-playground.mltooling.org -
netflix/flamescope โญ 3,036
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs. -
pyvista/pyvista โญ 2,912
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
๐ docs.pyvista.org -
facebookresearch/hiplot โญ 2,777
HiPlot makes understanding high dimensional data easy
๐ facebookresearch.github.io/hiplot -
mckinsey/vizro โญ 2,775
Vizro is a low-code toolkit for building high-quality data visualization apps.
๐ vizro.readthedocs.io/en/stable -
holoviz/holoviews โญ 2,744
With Holoviews, your data visualizes itself.
๐ holoviews.org -
kozea/pygal โญ 2,688
pygal is a dynamic SVG charting library written in python.
๐ www.pygal.org -
napari/napari โญ 2,268
A fast, interactive, multi-dimensional image viewer for Python. It's designed for browsing, annotating, and analyzing large multi-dimensional images.
๐ napari.org -
marcomusy/vedo โญ 2,081
A python module for scientific analysis of 3D data based on VTK and Numpy
๐ vedo.embl.es -
datapane/datapane โญ 1,389
Build and share data reports in 100% Python
๐ datapane.com -
facultyai/dash-bootstrap-components โญ 1,139
Bootstrap components for Plotly Dash
๐ dash-bootstrap-components.opensource.faculty.ai -
nomic-ai/deepscatter โญ 1,079
Zoomable, animated scatterplots in the browser that scales over a billion points -
holoviz/holoviz โญ 858
High-level tools to simplify visualization in Python.
๐ holoviz.org -
hazyresearch/meerkat โญ 836
Creative interactive views of any dataset. -
anvaka/word2vec-graph โญ 707
Exploring word2vec embeddings as a graph of nearest neighbors
๐ anvaka.github.io/pm/#/galaxy/word2vec-wiki?cx=-4651&cy=4492&cz=-1988&lx=-0.0915&ly=-0.9746&lz=-0.2030&lw=0.0237&ml=300&s=1.75&l=1&v=d50_clean_small
Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.
-
django/django โญ 82,157
The Web framework for perfectionists with deadlines.
๐ www.djangoproject.com -
tiangolo/fastapi โญ 80,493
FastAPI framework, high performance, easy to learn, fast to code, ready for production
๐ fastapi.tiangolo.com -
pallets/flask โญ 68,703
The Python micro framework for building web applications.
๐ flask.palletsprojects.com -
sherlock-project/sherlock โญ 62,208
Hunt down social media accounts by username across social networks
๐ sherlockproject.xyz -
psf/requests โญ 52,451
A simple, yet elegant, HTTP library.
๐ requests.readthedocs.io/en/latest -
tornadoweb/tornado โญ 21,824
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
๐ www.tornadoweb.org -
reflex-dev/reflex โญ 21,556
๐ธ๏ธ Web apps in pure Python ๐
๐ reflex.dev -
wagtail/wagtail โญ 18,690
A Django content management system focused on flexibility and user experience
๐ wagtail.org -
huge-success/sanic โญ 18,224
Accelerate your web app development | Build fast. Run fast.
๐ sanic.dev -
pyscript/pyscript โญ 18,178
A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
๐ pyscript.net -
vincigit00/Scrapegraph-ai โญ 17,801
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents
๐ scrapegraphai.com -
aio-libs/aiohttp โญ 15,381
Asynchronous HTTP client/server framework for asyncio and Python
๐ docs.aiohttp.org -
encode/httpx โญ 13,648
A next generation HTTP client for Python. ๐ฆ
๐ www.python-httpx.org -
getpelican/pelican โญ 12,711
Static site generator that supports Markdown and reST syntax. Powered by Python.
๐ getpelican.com -
flet-dev/flet โญ 12,265
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
๐ flet.dev -
zauberzeug/nicegui โญ 10,882
Create web-based user interfaces with Python. The nice way.
๐ nicegui.io -
aws/chalice โญ 10,752
Python Serverless Microframework for AWS -
encode/starlette โญ 10,555
The little ASGI framework that shines. ๐
๐ www.starlette.io -
benoitc/gunicorn โญ 9,965
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
๐ www.gunicorn.org -
falconry/falcon โญ 9,585
The no-magic web API and microservices framework for Python developers, with an emphasis on reliability and performance at scale.
๐ falcon.readthedocs.io -
encode/uvicorn โญ 8,835
An ASGI web server, for Python. ๐ฆ
๐ www.uvicorn.org -
bottlepy/bottle โญ 8,525
bottle.py is a fast and simple micro-framework for python web-applications.
๐ bottlepy.org -
graphql-python/graphene โญ 8,140
GraphQL framework for Python
๐ graphene-python.org -
reactive-python/reactpy โญ 7,937
ReactPy is a library for building user interfaces in Python without Javascript
๐ reactpy.dev -
vitalik/django-ninja โญ 7,684
๐จ Fast, Async-ready, Openapi, type hints based framework for building APIs
๐ django-ninja.dev -
pyeve/eve โญ 6,712
REST API framework designed for human beings
๐ python-eve.org -
pallets/werkzeug โญ 6,694
The comprehensive WSGI web application library.
๐ werkzeug.palletsprojects.com -
starlite-api/litestar โญ 5,949
Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
๐ litestar.dev -
webpy/webpy โญ 5,899
web.py is a web framework for python that is as simple as it is powerful.
๐ webpy.org -
fastapi-users/fastapi-users โญ 4,855
Ready-to-use and customizable users management for FastAPI
๐ fastapi-users.github.io/fastapi-users -
stephenmcd/mezzanine โญ 4,772
CMS framework for Django
๐ mezzanine.jupo.org -
nameko/nameko โญ 4,729
A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
๐ www.nameko.io -
pywebio/PyWebIO โญ 4,638
Write interactive web app in script way.
๐ pywebio.readthedocs.io -
strawberry-graphql/strawberry โญ 4,140
A GraphQL library for Python that leverages type annotations ๐
๐ strawberry.rocks -
h2oai/wave โญ 4,044
H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
๐ wave.h2o.ai -
pylons/pyramid โญ 4,006
Pyramid - A Python web framework
๐ trypyramid.com -
websocket-client/websocket-client โญ 3,616
WebSocket client for Python
๐ github.com/websocket-client/websocket-client -
unbit/uwsgi โญ 3,477
uWSGI application server container
๐ projects.unbit.it/uwsgi -
pallets/quart โญ 3,144
An async Python micro framework for building web applications.
๐ quart.palletsprojects.com -
fastapi-admin/fastapi-admin โญ 3,047
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
๐ fastapi-admin-docs.long2ice.io -
flipkart-incubator/Astra โญ 2,538
Automated Security Testing For REST API's -
dot-agent/nextpy โญ 2,256
๐คSelf-Modifying Framework from the Future ๐ฎ World's First AMS
๐ dotagent.ai -
masoniteframework/masonite โญ 2,238
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
๐ docs.masoniteproject.com -
python-restx/flask-restx โญ 2,181
Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
๐ flask-restx.readthedocs.io/en/latest -
s3rius/FastAPI-template โญ 2,126
Feature rich robust FastAPI template. -
neoteroi/BlackSheep โญ 2,084
Fast ASGI web framework for Python
๐ www.neoteroi.dev/blacksheep -
dmontagu/fastapi-utils โญ 2,004
Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects
๐ fastapiutils.github.io/fastapi-utils -
cherrypy/cherrypy โญ 1,874
CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
๐ docs.cherrypy.dev -
indico/indico โญ 1,821
Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.
๐ getindico.io -
jordaneremieff/mangum โญ 1,794
An adapter for running ASGI applications in AWS Lambda to handle Function URL, API Gateway, ALB, and Lambda@Edge events
๐ mangum.fastapiexpert.com -
wtforms/wtforms โญ 1,525
A flexible forms validation and rendering library for Python.
๐ wtforms.readthedocs.io -
long2ice/fastapi-cache โญ 1,452
fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
๐ github.com/long2ice/fastapi-cache -
awtkns/fastapi-crudrouter โญ 1,450
A dynamic FastAPI router that automatically creates CRUD routes for your models
๐ fastapi-crudrouter.awtkns.com -
rstudio/py-shiny โญ 1,373
Shiny for Python
๐ shiny.posit.co/py -
whitphx/stlite โญ 1,318
A port of Streamlit to WebAssembly, powered by Pyodide.
๐ edit.share.stlite.net -
magicstack/httptools โญ 1,227
Fast HTTP parser -
koxudaxi/fastapi-code-generator โญ 1,120
This code generator creates FastAPI app from an openapi file. -
aeternalis-ingenium/FastAPI-Backend-Template โญ 692
A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.
Interactive version: www.awesomepython.org, Hugging Face Dataset: awesome-python
Please raise a new issue to suggest a Python repo that you would like to see added.
1,709 hand-picked awesome Python libraries and frameworks, updated 06 Feb 2025