feat: completing text /chat-completion and /completion tests #1223

LESSuseLESS · 2025-02-23T18:17:17Z

What does this PR do?

The goal is to have a fairly complete set of provider and e2e tests for /chat-completion and /completion. This is the current list,

grep -oE "def test_[a-zA-Z_+]*" llama_stack/providers/tests/inference/test_text_inference.py | cut -d' ' -f2

test_model_list
test_text_completion_non_streaming
test_text_completion_streaming
test_text_completion_logprobs_non_streaming
test_text_completion_logprobs_streaming
test_text_completion_structured_output
test_text_chat_completion_non_streaming
test_text_chat_completion_structured_output
test_text_chat_completion_streaming
test_text_chat_completion_with_tool_calling
test_text_chat_completion_with_tool_calling_streaming

grep -oE "def test_[a-zA-Z_+]*" tests/client-sdk/inference/test_text_inference.py | cut -d' ' -f2

test_text_completion_non_streaming
test_text_completion_streaming
test_text_completion_log_probs_non_streaming
test_text_completion_log_probs_streaming
test_text_completion_structured_output
test_text_chat_completion_non_streaming
test_text_chat_completion_streaming
test_text_chat_completion_with_tool_calling_and_non_streaming
test_text_chat_completion_with_tool_calling_and_streaming
test_text_chat_completion_with_tool_choice_required
test_text_chat_completion_with_tool_choice_none
test_text_chat_completion_structured_output
test_text_chat_completion_tool_calling_tools_not_in_request

Test plan

== Set up Ollama local server

OLLAMA_HOST=127.0.0.1:8321 with-proxy ollama serve
OLLAMA_HOST=127.0.0.1:8321 ollama run llama3.2:3b-instruct-fp16 --keepalive 60m

== Run a provider test

conda activate stack
OLLAMA_URL="http://localhost:8321" \
pytest -v -s -k "ollama" --inference-model="llama3.2:3b-instruct-fp16" \
llama_stack/providers/tests/inference/test_text_inference.py::TestInference

== Run an e2e test

conda activate sherpa
with-proxy pip install llama-stack
export INFERENCE_MODEL=llama3.2:3b-instruct-fp16
export LLAMA_STACK_PORT=8322
with-proxy llama stack build --template ollama
with-proxy llama stack run --env OLLAMA_URL=http://localhost:8321 ollama

conda activate stack
LLAMA_STACK_PORT=8322 LLAMA_STACK_BASE_URL="http://localhost:8322" \
pytest -v -s --inference-model="llama3.2:3b-instruct-fp16" \
tests/client-sdk/inference/test_text_inference.py

ashwinb · 2025-02-24T21:59:42Z

lgtm. cc @hardikjshah @ehhuang to take one look given you have been in the test world lately.

hardikjshah

left a couple of small comments

hardikjshah · 2025-02-25T18:37:50Z

llama_stack/providers/tests/test_cases/inference/chat_completion.json

@@ -0,0 +1,171 @@
+{


Can you please update the MANIFEST.in to include these files and remove the deleted ones ? https://github.com/meta-llama/llama-stack/blob/main/MANIFEST.in

ah, didn't know this before. fixed.

hardikjshah · 2025-02-25T18:42:22Z

tests/client-sdk/inference/test_text_inference.py

 ):
+    # TODO: more dynamic lookup on tool_prompt_format for model family
+    tool_prompt_format = "json" if "3.1" in text_model_id else "python_list"


Lets just drop this as we recently added (https://github.com/meta-llama/llama-stack/pull/1214/files) defaults to be inferred based on model-id at the api level when not provided.

let me fix this with a new diff, so that i can keep this one iso-transformation without any coding logic change

hardikjshah · 2025-02-25T18:42:52Z

tests/client-sdk/inference/test_text_inference.py

 def test_text_chat_completion_with_tool_calling_and_non_streaming(
-    client_with_models, text_model_id, get_weather_tool_definition, provider_tool_format
+    client_with_models, text_model_id, provider_tool_format, test_case


provider_tool_format does not seem to be used

same here, there are several places that need some cleanup, and I'll fix them with one more diff, instead of this one

…e1e tests"

LESSuseLESS requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721, ehhuang and terrytangyuan as code owners February 23, 2025 18:17

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 23, 2025

LESSuseLESS changed the title ~~"feat: completing text /chat-completion and /completion tests~~ feat: completing text /chat-completion and /completion tests Feb 23, 2025

LESSuseLESS force-pushed the llama_stack_hzhao_pr branch from 25257c4 to 165b613 Compare February 23, 2025 18:29

hardikjshah reviewed Feb 25, 2025

View reviewed changes

"feat: completing text /chat-completion and /completion provider and …

056432f

…e1e tests"

LESSuseLESS force-pushed the llama_stack_hzhao_pr branch from 165b613 to 056432f Compare February 25, 2025 18:53

ashwinb approved these changes Feb 25, 2025

View reviewed changes

LESSuseLESS merged commit 3a31611 into main Feb 25, 2025
3 checks passed

LESSuseLESS deleted the llama_stack_hzhao_pr branch February 25, 2025 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: completing text /chat-completion and /completion tests #1223

feat: completing text /chat-completion and /completion tests #1223

LESSuseLESS commented Feb 23, 2025

ashwinb commented Feb 24, 2025

hardikjshah left a comment

hardikjshah Feb 25, 2025

LESSuseLESS Feb 25, 2025

hardikjshah Feb 25, 2025

LESSuseLESS Feb 25, 2025

hardikjshah Feb 25, 2025

LESSuseLESS Feb 25, 2025

feat: completing text /chat-completion and /completion tests #1223

feat: completing text /chat-completion and /completion tests #1223

Conversation

LESSuseLESS commented Feb 23, 2025

What does this PR do?

Test plan

ashwinb commented Feb 24, 2025

hardikjshah left a comment

Choose a reason for hiding this comment

hardikjshah Feb 25, 2025

Choose a reason for hiding this comment

LESSuseLESS Feb 25, 2025

Choose a reason for hiding this comment

hardikjshah Feb 25, 2025

Choose a reason for hiding this comment

LESSuseLESS Feb 25, 2025

Choose a reason for hiding this comment

hardikjshah Feb 25, 2025

Choose a reason for hiding this comment

LESSuseLESS Feb 25, 2025

Choose a reason for hiding this comment