Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flash attention for GPUs like in maxtext #149

Merged
merged 35 commits into from
Feb 21, 2025

Conversation

ksikiric
Copy link
Contributor

Related to #147

Adding FA support for GPUs using TransformerEngine, same as in maxtext. These changes are added on top of #147, which has been rebased on flux_lora as per #148.

Copy link
Collaborator

@entrpn entrpn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ksikiric left some comments. Overall looks good. Last comment is to remove the training part of the code out of this PR and we can work on it on the Flux training PR.

@entrpn
Copy link
Collaborator

entrpn commented Feb 14, 2025

@ksikiric tested this and looks great. Added a few comments, once resolved this should be ready to merge into main.

Please rebase with main and run ruff --fix and bash code_style.sh.

@entrpn
Copy link
Collaborator

entrpn commented Feb 19, 2025

@ksikiric let me know when you can take a look at the latest comments. This is very close to being ready! :)

@ksikiric
Copy link
Contributor Author

@entrpn I can't see any other comments than the ones I have already marked as resolved, are there any other comments that I am missing?

@entrpn
Copy link
Collaborator

entrpn commented Feb 19, 2025

@ksikiric my bad, forgot to click the button. Take a look and let me know if you see them now.

@ksikiric
Copy link
Contributor Author

ksikiric commented Feb 20, 2025

@entrpn I fixed those comments now!

@ksikiric ksikiric marked this pull request as ready for review February 20, 2025 09:09
@entrpn entrpn merged commit 1c9d4c1 into AI-Hypercomputer:main Feb 21, 2025
1 of 2 checks passed
@entrpn
Copy link
Collaborator

entrpn commented Feb 21, 2025

@entrpn I fixed those comments now!

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants