Upstream merge 25 02 10#418
Merged
gshtras merged 134 commits intomainfrom upstream_merge_25_02_10Feb 12, 2025
+24,970-6,416
Commits
Commits on Feb 3, 2025
- authored
- authored
- authored
- authored
- authored
Commits on Feb 4, 2025
- authored
- authored
- authored
[Quant] Fix use_mla TypeError and support loading pure-sparsity Compressed Tensors configs (vllm-project#12711)
authored- authored
- authored
- authored
- authored
- authored
Commits on Feb 5, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
[Bugfix] Fix 'ModuleNotFoundError: No module named 'intel_extension_for_pytorch'' for --tensor-parallel-size more than 1 (vllm-project#12546)
authored- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Feb 6, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Feb 7, 2025
- authored
- authored
- authored
[MISC][EASY] Break check file names into entry and args in the pre-commit hooks (vllm-project#12880)
authored- authored
- authored
[ROCm] [Feature] [Doc] [Dockerfile] [BugFix] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing (vllm-project#12501)
authored- authored
Commits on Feb 8, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Feb 10, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- committed
Commits on Feb 11, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Fix initializing GGUF weights for ColumnParallelLinear when using tensor parallel > 1 (vllm-project#13023)
authored- authored
- committed
- committed