[Inference PagedAttention] Integrate initial paged attention implementation into maxengine (2/N) #1686
RunTests.yml
on: pull_request
prelim
7s
gpu_image
/
Build and upload image (a100-40gb-4)
43s
gpu_unit_tests
/
run
4m 11s
gpu_integration_tests
/
run
7m 30s
tpu_unit_tests
/
run
20m 42s
tpu_integration_tests
/
run
7m 11s