Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upstream LLVM SLP vectorizer change requires the correct triple #5930

Open
pclove1 opened this issue Feb 14, 2025 · 0 comments
Open

Upstream LLVM SLP vectorizer change requires the correct triple #5930

pclove1 opened this issue Feb 14, 2025 · 0 comments
Assignees

Comments

@pclove1
Copy link

pclove1 commented Feb 14, 2025

Describe the issue

We recently encountered a performance regression in one of our triton kernels and identified the culprit as the upstream LLVM SLP vectorization pass change (llvm/llvm-project@88e7b8b / llvm/llvm-project#125725).

So, I filed llvm/llvm-project#127244 but got a response from the original author (@alexey-bataev) saying that the correct triple is required to get it vectorized properly now.

Digging triton code suggests that triton doesn't pass the target machine to SLP vectorization on purpose.

Any chance/plan to address TODO item in the linked code? (i.e. pass NVPTX target)

Environment details

GPU: Nvidia A100

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants