Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ROCm Sparse Marlin Kernels #1206
base: main
Are you sure you want to change the base?
ROCm Sparse Marlin Kernels #1206
Changes from 25 commits
6d92e40
14b3fce
f1a22cf
0bef6ca
893ae03
a0d3788
e4e654d
3e2c6a1
c86880e
91d3c75
38b7d1c
279f4b3
612ad14
bbf5a72
735570e
253c188
a2f1736
f817edf
c9bc1bc
16feff4
0b21555
f23b194
ecc3927
a80730b
d2c7ce4
15974c7
a4e8c30
c678cb0
08d1cfb
b96196b
aea9d81
f18043d
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we gate on a specific ROCm version like we do for CUDA?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point! What we need is a GPU arch check instead of ROCm version check. I have added a GPU architecture check in the
setup.py
. As a result, the kernel will now only be built for the MI300X architecture.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sounds good, I think
setup.py
was recently updated by #1490, so you may have to pull in the new changes.