Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pytorch 2.5 & torchtune 0.3+ #315

Open
wants to merge 38 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
0fa1419
Started instrumeting recipes from newer torchtune for milabench
rkarhila-amd Nov 12, 2024
3d83577
This specific torchtune version requires a roundabout way of importin…
rkarhila-amd Nov 12, 2024
37b35f7
Updated recipes and configs
rkarhila-amd Nov 14, 2024
87b987f
file left out from previous commit + conf typo fix
rkarhila-amd Nov 15, 2024
b6cf6be
Merge branch 'master' of github.com:mila-iqia/milabench into pytorch2.5
Nov 22, 2024
d47751c
Update dockerfile
Jan 14, 2025
5020932
-
Jan 15, 2025
7936166
Delete benchmarks/geo_gnn/bad.txt
Delaunay Jan 15, 2025
40ff390
Update base.yaml
Delaunay Jan 15, 2025
feb9cca
update dependencies to torch 2.5
Jan 15, 2025
35cdcfa
Add shared setup
Jan 15, 2025
2b2bcb2
Merge branch 'docker' of github.com:mila-iqia/milabench into docker
Jan 15, 2025
a0293eb
Update torchtune and pytorch
Jan 16, 2025
1340e16
Merge branch 'docker' of github.com:mila-iqia/milabench into staging
Jan 16, 2025
776e3e1
Update LLM benchmarks
Jan 16, 2025
df7d8a1
use python 3.10
Jan 17, 2025
cf751f7
Add utility to help launch milabench with docker
Jan 17, 2025
d73af7d
Make torchrun use docker in multinode
Jan 17, 2025
40c35bd
Add docker to ForeachNode
Jan 17, 2025
3f860c7
Add documentation for docker + multinode
Jan 17, 2025
68cc940
Disable GPU warden on prepare
Jan 21, 2025
b2e4cc2
Maximise build space
Jan 21, 2025
1c538b0
Add missing dependencies
Jan 21, 2025
f035e5b
Increase root system size
Jan 21, 2025
0710fff
Add to avoid flooding journald
Jan 21, 2025
8cad4a2
Fix dataset path for vjepa
Jan 22, 2025
a784485
Update llm-lora-ddp-gpus
Jan 22, 2025
684e894
Update llm-lora-ddp-gpus
Jan 22, 2025
d02a574
Update llm-lora-ddp-gpus
Jan 22, 2025
2091a16
Fix rlhf-gpus
Jan 22, 2025
f67e5de
Update llava model
Jan 22, 2025
a1a9a06
Fix llm-lora-ddp-gpus
Jan 22, 2025
612a8c8
Fix llm-lora-single
Jan 22, 2025
130a131
Update llm-full-mp-gpus
Jan 22, 2025
5b4fe16
Remove dataset.pack
Jan 22, 2025
ad2f3e3
update batch resizing logic
Jan 23, 2025
316fdfa
Remove the process monitor grom the GPU monitor
Jan 23, 2025
60843ba
Add channel last to resnet50
Jan 23, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions .github/workflows/docker.yml
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,15 @@ jobs:
packages: write

steps:
- uses: easimon/maximize-build-space@master
with:
remove-dotnet: 'true'
remove-codeql: 'true'
remove-haskell: 'true'
remove-android: 'true'
build-mount-path: /home/runner/work/milabench/
root-reserve-mb: 30000

- name: Show all images
run: |
docker image ls
Expand Down
338 changes: 0 additions & 338 deletions .pin/constraints-cuda-gnn.txt

This file was deleted.

Loading
Loading