You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Mar 8, 2025. It is now read-only.
Regardless of the kind of deployment I would expect the same output, but in case of monolithic deployment I get
{"id":"9dcede72-7092-4dd9-ac7f-d9ae99c5224e","choices":[{"message":{"role":"assistant","content":"Okay, so I need to figure out the capital of France. Hmm, I'm not entirely sure, but I think it's one of the major cities in France. Let me try to recall. I remember that Paris is a big city there,"},"index":0,"finish_reason":"length"}],"created":1740322046,"model":"deepseek-ai/DeepSeek-R1-Distill-Llama-8B","object":"chat.completion","usage":null,"system_fingerprint":null}
and for disaggregated deployment:
{"id":"85e5cad2-6495-4dd7-b81b-757e4a564c70___decode_hostname_ipp2-0493___decode_kv_rank_1","choices":[{"message":{"role":"assistant","content":"\n\n</think>\n\nThe capital of France is Paris."},"index":0,"finish_reason":"stop"}],"created":1740322505,"model":"deepseek-ai/DeepSeek-R1-Distill-Llama-8B","object":"chat.completion","usage":null,"system_fingerprint":null}
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
I'm runing my deployments like this
Then I run the client
In case of dissagregated serving I replace step 2 with
Regardless of the kind of deployment I would expect the same output, but in case of monolithic deployment I get
and for disaggregated deployment:
The text was updated successfully, but these errors were encountered: