-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Reverting the TGI image version for LLAMA multiple GPUs in GKE samples (
#1591) * The current image override the HF_HOME to /tmp from /data. Even after changing the mountpath to /tmp there is some regression in the newer TGI image which results into out of GPU memory on L4 and requires atleast A2 node. Rolling back the image version to get the sample working will investigation happen in the background. * Updating the images to GCR which works for these models. * Update ai-ml/llm-multiple-gpus/falcon-40b/text-generation-inference.yaml Co-authored-by: Alvaro Bartolome <[email protected]> * Update ai-ml/llm-multiple-gpus/llama3-70b/text-generation-inference.yaml Co-authored-by: Alvaro Bartolome <[email protected]> --------- Co-authored-by: Mofi Rahman <[email protected]> Co-authored-by: Alvaro Bartolome <[email protected]>
- Loading branch information
1 parent
48a4009
commit 7683cb2
Showing
4 changed files
with
17 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters