Skip to content

Commit

Permalink
enable rocm path
Browse files Browse the repository at this point in the history
  • Loading branch information
petrex committed Jan 6, 2025
1 parent 0b21555 commit f23b194
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion torchao/csrc/cuda/sparse_marlin/marlin_kernel_nm.cu
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@ static constexpr int min_thread_n = 128;
static constexpr int tile_size = 16;
static constexpr int max_par = 64;

#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800
#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800 && !defined(USE_ROCM)

template <const int num_bits, // weight bits
const int threads, // number of threads in a threadblock
Expand Down

0 comments on commit f23b194

Please sign in to comment.