-
Notifications
You must be signed in to change notification settings - Fork 12
Electriclizard triton solution #36
base: main
Are you sure you want to change the base?
Conversation
Triton solution
Triton solution
Triton solution
Hi @rsolovev, finally i've decided to avoid from docker-compose back to a single dockerfile, now it builds in ci and metrics raises again, hope to see a test a results on your environment |
78b0ec2
to
bd49505
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hey @electriclizard, thank you for this iteration, unfortunately this didnt work out:
shm-size
is not configurable in k8s [1]. There are some workarounds with mounting an empty volume, but in k8s 1.24 they removed dockershim support completely [2] so this workarounds kinda go against our infrastructure "standards" (not that it is impossible to configure, just "feels unstable" in this form). Also image size is quite large (25GB) -- it failed to build on github runner and I had to build it locally.
I'm sure that with more time we could make this approach feasible and stable, but right now it is inconvenient to follow
hey @rsolovev, thank you for a try anyway, yes image is quite big, thats true. I just wanted to test it, because i am sure and local tests confirm my theory, that it might be impossible to run models on python with the same productivity |
Hi @rsolovev, i can decrease image size to fix bulding on a github runner, but i cannot change infrastructure, may be can i try to add these lines to helm?
Because i've checked it on 1.24 api k8s version and it works |
that sounds great, lets try reducing the image size first, and then I'll tweak the volume confs backstage |
great, i'll push the commit later |
hey @rsolovev i decreased the image size to 20.1gb, by pulling models before the app start, not while building the image, it takes a few minutes before start now |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@electriclizard impressive! I take my words back, no issues with dshm volume from the first try. An absolute peak rps and total requests record (121 rps, 26850 total) -- full stats
p.s added shm mount as a reference
That's great!!! Thank you for a run, it was my latest commit to a challenge)) |
Hi @rsolovev, i've made a new solution, based on nvidia-triton, metrics raise x2 on local tests, can you please explain me how can i correctly deploy an application using docker-compose? Now it starts with docker-compose up
![telegram-cloud-photo-size-2-5201986564564044257-y](https://private-user-images.githubusercontent.com/35398709/244932528-60ad0a50-c8f7-48bf-a301-cc28d00ca541.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkyNzY4NDgsIm5iZiI6MTczOTI3NjU0OCwicGF0aCI6Ii8zNTM5ODcwOS8yNDQ5MzI1MjgtNjBhZDBhNTAtYzhmNy00OGJmLWEzMDEtY2MyOGQwMGNhNTQxLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjExVDEyMjIyOFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWQ2NmNmODdmNzUyYmQyOTMzOGQ2NmI3NGJjYjgzNDkzYzNiZWI3Zjg4ODNmYzc5ZTNkMjcyYzI2NWM1NjI4OGImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.byxvHOq-qOdBZsnL0Ih6SCHZ57pqDdYy62o9eTQ7JdE)