Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

python: ../../../lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp:36: int mlir::triton::gpu::(anonymous namespace)::getMMAVersionSafe(int, DotOp): Assertion `false && "computeCapability not supported"' failed. #6087

Open
ruralharry opened this issue Mar 3, 2025 · 2 comments
Labels

Comments

@ruralharry
Copy link

Describe the bug

我在使用5080的运行Ktransformers的时候,出现这种报错

python: ../../../lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp:36: int mlir::triton::gpu::(anonymous namespace)::getMMAVersionSafe(int, DotOp): Assertion `false && "computeCapability not supported"' failed.

经查验,问题出现在这里
// Get the highest version supported for the hardware and the dot.
static int getMMAVersionSafe(int computeCapability, DotOp op) {
// List supported mma version in order of preference.
SmallVector versionsSupported;
if (computeCapability < 75) {
versionsSupported = {1};
} else if (computeCapability < 90) {
versionsSupported = {2};
} else if (computeCapability < 100) {
versionsSupported = {3, 2};
} else if (computeCapability < 110) {
versionsSupported = {5, 2};
} else {
assert(false && "computeCapability not supported");
}
for (int baseVersion : versionsSupported) {
if (supportMMA(op, baseVersion))
return baseVersion;
if (baseVersion == 3)
op.emitRemark() << "Warning: can't use MMA V3 for the dot op";
}
return 0;
}

能否更改computeCapability < 110
改为computeCapability 支持120以及最新的显卡能力
谢谢!

Environment details

triton版本: 3.2.0
GPU:Nvidia RTX 5080
Python版本:3.11.11
PyTorch版本: 2.7.0.dev20250302+cu128, CUDA可用: True
Ktransformers版本: 0.2.2rc1

@ruralharry ruralharry added the bug label Mar 3, 2025
@Jokeren
Copy link
Contributor

Jokeren commented Mar 3, 2025

能否更改computeCapability < 110
改为computeCapability 支持120以及最新的显卡能力
谢谢!

This is not correct, RTX 5080 doesn't support wgmma and tcgen. The correct implementation would be fall back to versionsSupported = {2}

@Jokeren
Copy link
Contributor

Jokeren commented Mar 3, 2025

Also I would recommend you to translate the issue in English

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants