Implement `ConvInteger` operator #566

robertknight · 2025-01-31T09:18:42Z

Implement the ConvInteger operator.

This follows the implementation of Conv for f32, with the addition of zero point inputs and the removal of the bias input.

In the process fix handling of padding elements in int8 im2col packing when conversion from i8 -> u8 is enabled.

TODO:

Implement depthwise convolution
Decide which int8 formats to support and implement necessary conversions. This PR currently implements input=i8, weights=u8 because that conveniently maps to the already supported u8 x i8 -> i32 GEMM support, but this is not a super useful combination. ORT only supports input=u8, weights=u8 and there are issues open to support input=u8, weights=i8.
Benchmark
Decide what to do on x64 systems if ConvInteger weights are not in the safe range (i7/u7). Validate weights at load time? (deferred to Improve handling of weights with a non-reduced range in ConvInteger #574)

Implement the ConvInteger operator for all combinations of int8 input signed-ness, with the caveat that combinations other than (input=int8, weights=uint8) will result in a shift-conversion of the input and/or weights to `input=int8`, `weights=uint8` which maps to the currently supported GEMM kernels. Depthwise convolution for int8 uses a naive conversion of the f32 depthwise kernel and both have significant room for optimization.

robertknight force-pushed the conv-integer branch 10 times, most recently from 4270f88 to 29a7121 Compare February 3, 2025 08:08

robertknight added 3 commits February 3, 2025 08:47

Support converting and deserializing ConvInteger operators

87298df

Add option to quantize Conv operators in ort-quantize.py

ad266bd

robertknight force-pushed the conv-integer branch from 29a7121 to ad266bd Compare February 3, 2025 08:47

robertknight marked this pull request as ready for review February 3, 2025 08:55

robertknight merged commit e48df48 into main Feb 3, 2025
2 checks passed

robertknight deleted the conv-integer branch February 3, 2025 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `ConvInteger` operator #566

Implement `ConvInteger` operator #566

robertknight commented Jan 31, 2025 •

edited

Loading

Implement ConvInteger operator #566

Implement ConvInteger operator #566

Conversation

robertknight commented Jan 31, 2025 • edited Loading

Implement `ConvInteger` operator #566

Implement `ConvInteger` operator #566

robertknight commented Jan 31, 2025 •

edited

Loading