feat: add Milvus vectorDB #1171

zc277584121 · 2025-02-20T07:11:03Z

What does this PR do?

feat: add Milvus vectorDB

note: I use the MilvusClient to implement it instead of AsyncMilvusClient, because when I tested AsyncMilvusClient, it would raise issues about evenloop, which I think AsyncMilvusClient SDK is not robust enough to be compatible with llama_stack framework.

Test Plan

have passed the unit test and ene2end test
Here is my end2end test logs, including the client code, client log, server logs from inline and remote settings
test_end2end_logs.zip

facebook-github-bot · 2025-02-20T07:11:09Z

Hi @zc277584121!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

facebook-github-bot · 2025-02-20T08:06:08Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

terrytangyuan

@zc277584121 Thank you! @franciscojavierarceo given your prior experience on this, would you like to help review this PR?

franciscojavierarceo · 2025-02-24T01:34:38Z

yeah of course!

franciscojavierarceo · 2025-02-24T01:36:13Z

docs/source/concepts/index.md

@@ -25,7 +25,7 @@ We are working on adding a few more APIs to complete the application lifecycle.

 The goal of Llama Stack is to build an ecosystem where users can easily swap out different implementations for the same API. Examples for these include:
 - LLM inference providers (e.g., Fireworks, Together, AWS Bedrock, Groq, Cerebras, SambaNova, etc.),
- Vector databases (e.g., ChromaDB, Weaviate, Qdrant, FAISS, PGVector, etc.),


please also update the docs section as well under https://github.com/meta-llama/llama-stack/tree/main/docs/source/providers/vector_io/

See this PR as a reference #1195

franciscojavierarceo · 2025-02-24T01:57:22Z

llama_stack/providers/remote/vector_io/milvus/__init__.py

+
+
+async def get_adapter_impl(config: MilvusVectorIOConfig, deps: Dict[Api, ProviderSpec]):
+    from .milvus import MilvusVectorIOAdapter


I think the inline and remote implementations are equivalent.

If that is, indeed, the case can you follow the pattern that's done by Chroma?

See here: https://github.com/meta-llama/llama-stack/blob/main/llama_stack/providers/inline/vector_io/chroma/__init__.py#L15

In short, they just import the remote implementation in the __init__.py and don't have an inline chromadb.py file.

franciscojavierarceo · 2025-02-24T02:00:57Z

llama_stack/providers/inline/vector_io/milvus/milvus.py

+
+        data = []
+        for i, (chunk, embedding) in enumerate(zip(chunks, embeddings, strict=False)):
+            chunk_id = f"{chunk.metadata['document_id']}:chunk-{i}"


It's probably better to generate the chunk ID from the text and the document ID in case the order of insertion changes. See this function: https://github.com/meta-llama/llama-stack/blob/main/llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py#L234

Maybe that can be moved to a utils folder.

franciscojavierarceo · 2025-02-24T02:03:54Z

llama_stack/providers/inline/vector_io/milvus/milvus.py

+                    "chunk_content": chunk.model_dump(),
+                }
+            )
+        self.client.insert(


Please have some error handling and log the collection name if insertion fails

franciscojavierarceo

Thank you for this! Very excited to see Milvus. Left some initial feedback, could you make some changes?

Also, could you update your PR description with the output of your test run?

franciscojavierarceo · 2025-02-24T02:24:22Z

llama_stack/providers/inline/vector_io/milvus/milvus.py

+            f"Chunk length {len(chunks)} does not match embedding length {len(embeddings)}"
+        )
+        if not self.client.has_collection(self.collection_name):
+            self.client.create_collection(self.collection_name, dimension=len(embeddings[0]), auto_id=True)


I know bounded staleness is the the OOTB consistency configuration. It may be useful to make that explicit. Alternatively, I could see strong consistency being a useful default.

zc277584121 · 2025-02-25T02:25:51Z

sure, I will fix them soon

Signed-off-by: ChengZi <[email protected]>

zc277584121 · 2025-02-25T12:35:12Z

@franciscojavierarceo I have updated this PR to solve all the problems you mentioned, please check them again, thanks.

zc277584121 · 2025-02-25T12:46:30Z

Have updated the PR description and upload my end2end test logs into it, please check it.

zc277584121 requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721, ehhuang and terrytangyuan as code owners February 20, 2025 07:11

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 20, 2025

terrytangyuan reviewed Feb 24, 2025

View reviewed changes

franciscojavierarceo reviewed Feb 24, 2025

View reviewed changes

franciscojavierarceo suggested changes Feb 24, 2025

View reviewed changes

franciscojavierarceo reviewed Feb 24, 2025

View reviewed changes

zc277584121 added 2 commits February 25, 2025 19:23

feat: add Milvus vectorDB

b701387

Signed-off-by: ChengZi <[email protected]>

fix: fixed Milvus integration code

d34e262

Signed-off-by: ChengZi <[email protected]>

zc277584121 force-pushed the main branch from 6390a22 to d34e262 Compare February 25, 2025 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Milvus vectorDB #1171

feat: add Milvus vectorDB #1171

zc277584121 commented Feb 20, 2025 •

edited

Loading

facebook-github-bot commented Feb 20, 2025

facebook-github-bot commented Feb 20, 2025

terrytangyuan left a comment •

edited

Loading

franciscojavierarceo commented Feb 24, 2025

franciscojavierarceo Feb 24, 2025

franciscojavierarceo Feb 24, 2025 •

edited

Loading

franciscojavierarceo Feb 24, 2025

franciscojavierarceo Feb 24, 2025

franciscojavierarceo left a comment

franciscojavierarceo Feb 24, 2025

zc277584121 commented Feb 25, 2025

zc277584121 commented Feb 25, 2025

zc277584121 commented Feb 25, 2025



		async def get_adapter_impl(config: MilvusVectorIOConfig, deps: Dict[Api, ProviderSpec]):
		from .milvus import MilvusVectorIOAdapter

feat: add Milvus vectorDB #1171

Are you sure you want to change the base?

feat: add Milvus vectorDB #1171

Conversation

zc277584121 commented Feb 20, 2025 • edited Loading

What does this PR do?

Test Plan

facebook-github-bot commented Feb 20, 2025

Action Required

Process

facebook-github-bot commented Feb 20, 2025

terrytangyuan left a comment • edited Loading

Choose a reason for hiding this comment

franciscojavierarceo commented Feb 24, 2025

franciscojavierarceo Feb 24, 2025

Choose a reason for hiding this comment

franciscojavierarceo Feb 24, 2025 • edited Loading

Choose a reason for hiding this comment

franciscojavierarceo Feb 24, 2025

Choose a reason for hiding this comment

franciscojavierarceo Feb 24, 2025

Choose a reason for hiding this comment

franciscojavierarceo left a comment

Choose a reason for hiding this comment

franciscojavierarceo Feb 24, 2025

Choose a reason for hiding this comment

zc277584121 commented Feb 25, 2025

zc277584121 commented Feb 25, 2025

zc277584121 commented Feb 25, 2025

zc277584121 commented Feb 20, 2025 •

edited

Loading

terrytangyuan left a comment •

edited

Loading

franciscojavierarceo Feb 24, 2025 •

edited

Loading