Skip to content
This repository has been archived by the owner on Feb 15, 2025. It is now read-only.

feat(vllm)!: upgrade vllm backend and refactor deployment#854

Merged
justinthelaw merged 350 commits intomainfrom 835-upgrade-vllm-for-gptq-bfloat16-inferencingOct 3, 2024

Commits

This pull request is big! We're only showing the most recent 250 commits

Commits on Sep 16, 2024

Commits on Sep 17, 2024

Commits on Sep 18, 2024

Commits on Sep 20, 2024

Commits on Sep 23, 2024

Commits on Sep 25, 2024

Commits on Sep 27, 2024

Commits on Oct 1, 2024

Commits on Oct 2, 2024