More flexible matmul test #476

maxtremblay · 2025-02-11T20:34:29Z

Prepare the matmul tests to be ready for quantization. However, this currently skip all tests related to quantization as it is not implemented yet. The strategy will be to partially enable quantization tests during development using this method:

https://github.com/tracel-ai/cubecl/blob/c1c333c56461aeea2bba6aac9f72c897ad725d01/crates/cubecl-linalg/src/matmul/tests/test_utils.rs#L148C1-L152C6

maxtremblay · 2025-02-11T20:35:55Z

crates/cubecl-linalg/src/matmul/tests/test_utils.rs

+        // Perform matmul
+        for row in 0..m {
+            for col in 0..n {
+                for middle in 0..k {
+                    let lhs_index = row * k + middle;
+                    let rhs_index = middle * n + col;
+                    let out_index = row * n + col;
+
+                    let l = lhs[batch_lhs + lhs_index] as u16;
+                    let r = rhs[batch_rhs + rhs_index] as u16;
+                    let prod = l * r;
+
+                    out[batch_out + out_index] += prod as i32;
+                }
+            }
+        }
+
+        // Substract rhs_zero_offset * sum_rows(lhs)
+        for row in 0..m {
+            let mut sum = 0;
+            for col in 0..k {
+                sum += lhs[batch_lhs + row * k + col] as i32;
+            }
+            sum *= rhs_zero_offset;
+            for col in 0..n {
+                out[batch_out + row * n + col] -= sum;
+            }
+        }
+
+        // Substract lhs_zero_offset * sum_cols(rhs)
+        for col in 0..n {
+            let mut sum = 0;
+            for row in 0..k {
+                sum += rhs[batch_lhs + row * n + col] as i32;
+            }
+            sum *= lhs_zero_offset;
+            for row in 0..m {
+                out[batch_out + row * n + col] -= sum;
+            }
+        }
+
+        // Add final constant term
+        for row in 0..m {
+            for col in 0..n {
+                out[batch_out + row * n + col] += (k as i32) * lhs_zero_offset * rhs_zero_offset;
+            }
+        }


Here you can see the flow for the quantized matmul.

Another way to test it is to dequantize a tensor, perform matmul, requantize the tensor and see if the quantize matmul works.

maxtremblay added 4 commits February 11, 2025 13:58

change random strategy

ca7c22e

add supports for different types in test

ee1cab7

remove small spec for test

d921793

impl test macro for quantization

c1c333c

maxtremblay requested review from nathanielsimard and louisfd February 11, 2025 20:34

maxtremblay commented Feb 11, 2025

View reviewed changes

nathanielsimard approved these changes Feb 11, 2025

View reviewed changes

maxtremblay merged commit 47f31fb into main Feb 11, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More flexible matmul test #476

More flexible matmul test #476

maxtremblay commented Feb 11, 2025

maxtremblay Feb 11, 2025

nathanielsimard Feb 11, 2025

More flexible matmul test #476

More flexible matmul test #476

Conversation

maxtremblay commented Feb 11, 2025

maxtremblay Feb 11, 2025

Choose a reason for hiding this comment

nathanielsimard Feb 11, 2025

Choose a reason for hiding this comment