Skip to content

Actions: ROCm/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,337 workflow runs
5,337 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[MFM-2025-02-03] Merge Main to llama fp8; With Faster ROCm Paged Attention
Cleanup PR Body #153: Pull request #399 opened by tjtanaa
February 3, 2025 14:37 15s
February 3, 2025 14:37 15s
Add tuned moe config for qwen1.5_moe_A2.7B
pre-commit #100: Pull request #398 opened by sky0530
February 3, 2025 14:26 4m 34s sky0530:qwen1.5_moe
February 3, 2025 14:26 4m 34s
Add tuned moe config for qwen1.5_moe_A2.7B
Cleanup PR Body #152: Pull request #398 opened by sky0530
February 3, 2025 14:26 22s
February 3, 2025 14:26 22s
Fix quark fp8 format loading.
pre-commit #99: Pull request #395 synchronize by fxmarty-amd
February 3, 2025 08:30 4m 31s fxmarty-amd:fix-quark-fp8
February 3, 2025 08:30 4m 31s
Close inactive issues and PRs
Close inactive issues and PRs #97: Scheduled
February 3, 2025 01:57 16s main
February 3, 2025 01:57 16s
Close inactive issues and PRs
Close inactive issues and PRs #96: Scheduled
February 2, 2025 01:58 18s main
February 2, 2025 01:58 18s
Close inactive issues and PRs
Close inactive issues and PRs #95: Scheduled
February 1, 2025 01:59 16s main
February 1, 2025 01:59 16s
Fp8 header
pre-commit #98: Pull request #396 opened by gshtras
January 31, 2025 17:46 4m 34s fp8_header
January 31, 2025 17:46 4m 34s
Fp8 header
Cleanup PR Body #151: Pull request #396 opened by gshtras
January 31, 2025 17:46 21s
January 31, 2025 17:46 21s
Fix quark fp8 format loading.
pre-commit #97: Pull request #395 opened by fxmarty-amd
January 31, 2025 14:49 4m 35s fxmarty-amd:fix-quark-fp8
January 31, 2025 14:49 4m 35s
Fix quark fp8 format loading.
Cleanup PR Body #150: Pull request #395 opened by fxmarty-amd
January 31, 2025 14:49 20s
January 31, 2025 14:49 20s
Close inactive issues and PRs
Close inactive issues and PRs #94: Scheduled
January 31, 2025 01:56 14s main
January 31, 2025 01:56 14s
Update Dockerfile.rocm
pre-commit #96: Commit 6852819 pushed by gshtras
January 30, 2025 20:53 4m 31s main
January 30, 2025 20:53 4m 31s
Using a more precise profiling on ROCm to properly account for weight…
pre-commit #95: Commit 22141e7 pushed by gshtras
January 30, 2025 19:27 4m 34s main
January 30, 2025 19:27 4m 34s
Improved memory profiling
pre-commit #94: Pull request #394 synchronize by gshtras
January 30, 2025 19:22 4m 50s memory_profiling
January 30, 2025 19:22 4m 50s
Improved memory profiling
pre-commit #93: Pull request #394 opened by gshtras
January 30, 2025 19:22 4m 34s memory_profiling
January 30, 2025 19:22 4m 34s
Improved memory profiling
Cleanup PR Body #149: Pull request #394 opened by gshtras
January 30, 2025 19:22 22s
January 30, 2025 19:22 22s
Faster Custom Paged Attention kernels (#372)
pre-commit #92: Commit 273c949 pushed by gshtras
January 30, 2025 19:21 4m 37s main
January 30, 2025 19:21 4m 37s
Faster Custom Paged Attention kernels
pre-commit #91: Pull request #372 synchronize by gshtras
January 30, 2025 19:16 4m 40s shsanyal_cpa_main_integration
January 30, 2025 19:16 4m 40s
Close inactive issues and PRs
Close inactive issues and PRs #93: Scheduled
January 30, 2025 01:55 16s main
January 30, 2025 01:55 16s
Test queue with 8 gpu
pre-commit #90: Pull request #393 synchronize by dhonnappa-amd
January 29, 2025 20:38 4m 30s test-new-ci-queues
January 29, 2025 20:38 4m 30s
Test queue with 8 gpu
pre-commit #89: Pull request #393 synchronize by dhonnappa-amd
January 29, 2025 19:17 4m 36s test-new-ci-queues
January 29, 2025 19:17 4m 36s
Test queue with 8 gpu
Cleanup PR Body #148: Pull request #393 opened by dhonnappa-amd
January 29, 2025 18:52 24s
January 29, 2025 18:52 24s
Test queue with 8 gpu
pre-commit #88: Pull request #393 opened by dhonnappa-amd
January 29, 2025 18:52 4m 53s test-new-ci-queues
January 29, 2025 18:52 4m 53s
20250127 docs update (#392)
pre-commit #87: Commit 7a292f9 pushed by arakowsk-amd
January 29, 2025 17:20 4m 45s main
January 29, 2025 17:20 4m 45s