langchain-mariadb

The langchain-mariadb package provides implementations of core LangChain abstractions using MariaDB's vector capabilities.

The package is released under the MIT license.

Feel free to use the abstractions as provided or modify/extend them as appropriate for your own application.

Installation

pip install -U langchain-mariadb

Vector store

This is an implementation of a LangChain vectorstore using mariadb as the backend. MariaDB requires version 11.7.1 or later for vector support.

This code provides a MariaDB vectorstore implementation with the following features:

Uses MariaDB's native vector similarity search capabilities
Supports both cosine and euclidean distance metrics
Provides comprehensive metadata filtering
Uses connection pooling for better performance
Supports custom table and column configurations

Installation

You can run the following command to spin up a MariaDB container:

docker run --name mariadb-container -e MARIADB_ROOT_PASSWORD=langchain -e MARIADB_DATABASE=langchain -p 3306:3306 -d mariadb:11.7

install c/c connector

# on ubuntu
sudo apt install libmariadb3 libmariadb-dev
# on CentOS, RHEL, Rocky Linux
sudo yum install MariaDB-shared MariaDB-devel
# python  
pip install --quiet -U langchain_openai mariadb langchain_mariadb

Initialize the vectorstore

from langchain_openai import OpenAIEmbeddings
from langchain_mariadb import MariaDBStore
from langchain_core.documents import Document
import mariadb

# Create a connection pool
pool = mariadb.ConnectionPool(
    pool_name="mypool",
    pool_size=3,
    host="127.0.0.1",
    port=3306,
    user="langchain",
    password="langchain",
    database="langchain"
)

# Create a new vector store
vectorstore = MariaDBStore(
    embeddings=OpenAIEmbeddings(),
    embedding_length=1536,
    datasource=pool,
    collection_name="my_docs"
)

Add new data

# adding documents
docs = [
    Document(page_content='there are cats in the pond', metadata={"id": 1, "location": "pond", "topic": "animals"}),
    Document(page_content='ducks are also found in the pond', metadata={"id": 2, "location": "pond", "topic": "animals"}),
    Document(page_content='fresh apples are available at the market', metadata={"id": 3, "location": "market", "topic": "food"}),
    Document(page_content='the market also sells fresh oranges', metadata={"id": 4, "location": "market", "topic": "food"}),
    Document(page_content='the new art exhibit is fascinating', metadata={"id": 5, "location": "museum", "topic": "art"}),
]
vectorstore.add_documents(docs)

# add from text
texts = [
    'a sculpture exhibit is also at the museum',
    'a new coffee shop opened on Main Street',
    'the book club meets at the library',
    'the library hosts a weekly story time for kids',
    'a cooking class for beginners is offered at the community center'
]

# optional metadata
metadatas = [
    {"id": 6, "location": "museum", "topic": "art"},
    {"id": 7, "location": "Main Street", "topic": "food"},
    {"id": 8, "location": "library", "topic": "reading"},
    {"id": 9, "location": "library", "topic": "reading"},
    {"id": 10, "location": "community center", "topic": "classes"}
]

vectorstore.add_texts(texts=texts, metadatas=metadatas)

Searching similarity

# Search similar texts
results = vectorstore.similarity_search("Hello", k=2)

# Search with metadata filter
results = vectorstore.similarity_search(
"Hello",
filter={"category": "greeting"}
)

Filtering Support

The vectorstore supports a set of filters that can be applied against the metadata fields of the documents.

Operator	Meaning/Category
$eq	Equality (==)
$ne	Inequality (!=)
$lt	Less than (<)
$lte	Less than or equal (<=)
$gt	Greater than (>)
$gte	Greater than or equal (>=)
$in	Special Cased (in)
$nin	Special Cased (not in)
$like	Text (like)
$nlike	Text (not like)
$and	Logical (and)
$or	Logical (or)
$not	Logical (not)

# Search with simple filter
results = vectorstore.similarity_search('kitty', k=10, filter={
    'id': {'$in': [1, 5, 2, 9]}
})

# Search with multiple conditions (AND)
results = vectorstore.similarity_search('ducks', k=10, filter={
    'id': {'$in': [1, 5, 2, 9]},
    'location': {'$in': ["pond", "market"]}
})

ChatMessageHistory

The chat message history abstraction helps to persist chat message history in a MariaDB table.

MariaDBChatMessageHistory is parameterized using a table_name and a session_id.

The table_name is the name of the table in the database where the chat messages will be stored.

The session_id is a unique identifier for the chat session. It can be assigned by the caller using uuid.uuid4().

import uuid

from langchain_core.messages import SystemMessage, AIMessage, HumanMessage
from langchain_mariadb import MariaDBChatMessageHistory
import mariadb

# Establish a pool to the database
pool = mariadb.ConnectionPool(
    pool_name="chat_pool",
    user="root",
    host="127.0.0.1",
    database="chatdb"
)

# Create the table schema (only needs to be done once)
table_name = "chat_history"
MariaDBChatMessageHistory.create_tables(pool, table_name)

# Initialize the chat history manager
chat_history = MariaDBChatMessageHistory(
    table_name,
    str(uuid.uuid4()), # session_id
    datasource=pool
)

# Add messages to the chat history
chat_history.add_messages([
    SystemMessage(content="Meow"),
    AIMessage(content="woof"),
    HumanMessage(content="bark"),
])

print(chat_history.messages)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github		.github
langchain_mariadb		langchain_mariadb
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

langchain-mariadb

Installation

Vector store

Installation

install c/c connector

Initialize the vectorstore

Add new data

Searching similarity

Filtering Support

ChatMessageHistory

About

Releases

Packages

Languages

License

rusher/langchain-mariadb

Folders and files

Latest commit

History

Repository files navigation

langchain-mariadb

Installation

Vector store

Installation

install c/c connector

Initialize the vectorstore

Add new data

Searching similarity

Filtering Support

ChatMessageHistory

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages