Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump vllm from 0.6.3.post1 to 0.7.2 in /examples/speed-benchmark #1194

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

dependabot[bot]
Copy link

@dependabot dependabot bot commented on behalf of github Feb 8, 2025

Bumps vllm from 0.6.3.post1 to 0.7.2.

Release notes

Sourced from vllm's releases.

v0.7.2

Highlights

  • Qwen2.5-VL is now supported in vLLM. Please note that it requires a source installation from Hugging Face transformers library at the moment (#12604)
  • Add transformers backend support via --model-impl=transformers. This allows vLLM to be ran with arbitrary Hugging Face text models (#11330, #12785, #12727).
  • Performance enhancement to DeepSeek models.
    • Align KV caches entries to start 256 byte boundaries, yielding 43% throughput enhancement (#12676)
    • Apply torch.compile to fused_moe/grouped_topk, yielding 5% throughput enhancement (#12637)
    • Enable MLA for DeepSeek VL2 (#12729)
    • Enable DeepSeek model on ROCm (#12662)

Core Engine

  • Use VLLM_LOGITS_PROCESSOR_THREADS to speed up structured decoding in high batch size scenarios (#12368)

Security Update

  • Improve hash collision avoidance in prefix caching (#12621)
  • Add SPDX-License-Identifier headers to python source files (#12628)

Other

  • Enable FusedSDPA support for Intel Gaudi (HPU) (#12359)

What's Changed

... (truncated)

Commits
  • 0408efc [Misc] Improve error message for incorrect pynvml (#12809)
  • 449d1bc [Misc] Remove duplicated DeepSeek V2/V3 model definition (#12793)
  • 1a6fcad Improve TransformersModel UX (#12785)
  • 56534cd [Bugfix] Fix the test_ultravox.py's license (#12806)
  • d88506d [Model] LoRA Support for Ultravox model (#11253)
  • 9cdea30 [Misc][Easy] Remove the space from the file name
  • 76abd0c [Bugfix] Better FP8 supported defaults
  • 5b19b93 [ROCm][Kernel] Using the correct warp_size value
  • 75404d0 [VLM] Update compatibility with transformers 4.49
  • bf3b79e [VLM] Qwen2.5-VL
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    You can disable automated security fix PRs for this repo from the Security Alerts page.

Bumps [vllm](https://github.com/vllm-project/vllm) from 0.6.3.post1 to 0.7.2.
- [Release notes](https://github.com/vllm-project/vllm/releases)
- [Commits](vllm-project/vllm@v0.6.3.post1...v0.7.2)

---
updated-dependencies:
- dependency-name: vllm
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Feb 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants