-
Notifications
You must be signed in to change notification settings - Fork 157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reduction, missing indices in dst access #1055
Comments
can you give @AWSNB permission |
Added |
Hi! It is correct to use The BIRCodegenLoop error comes from the following loop:
As the error message suggest, the start partition of tensor copy must be multiply of 32. However, we are iterating through the partition dimension of In general, we should not iterate through the partition dimension. Instead, we should copy/load into the partition dimension in batch. |
thanks @aws-qieqingy , would this be the intended way to perform the copy?
|
It is still not quite right. We should also copy the free dimension in batch. In addition, the partition dimension of |
Thank you so much for your time all @aws-zhehongb @aws-qieqingy ,the code executes correctly when using --simulate now. However, dropping the simulation flag produces different results. I've tried to avoid any inferred dependencies by replacing all loops with nl.sequential_range, though this didn't fix the problem. I would greatly appreciate any pointers! (#1051 is similar so moving there) |
how could we reproduce your error? did you shared your private repo ? |
Updated comment in #1051 |
Hello, just saw on #1052 that the intended way to perform reduction is as such:
However performing that change gives a new error:
I am unsure as to what this error means, and also why it is only generated when performing reduction as above, and not as a[...] += b
private repo link:
https://github.com/andxalex/CS149/blob/main/asst4-trainium/part2/conv2d.py
Changing to a[...] += b generates:
The text was updated successfully, but these errors were encountered: