Framework | Optimization | API Alias |
---|---|---|
PyTorch | Mixed Precision | pytorch_amp |
PyTorch | Channels Last | pytorch_channels_last |
PyTorch | JIT (Just-In-Time) Script/Trace & optimize_for_inference | pytorch_jit_script , pytorch_jit_trace , pytorch_jit_script_ofi , pytorch_jit_trace_ofi |
PyTorch | JIT with TorchDynamo | pytorch_torchdynamo_jit_script , pytorch_torchdynamo_jit_trace , pytorch_torchdynamo_jit_script_ofi , pytorch_torchdynamo_jit_trace_ofi |
PyTorch | Intel Neural Compressor Mixed Precision | pytorch_inc_bf16 |
PyTorch | Intel Neural Compressor INT8 Static Quantization (FX/IPEX) | pytorch_inc_static_quant_fx , pytorch_inc_static_quant_ipex |
PyTorch | Intel Neural Compressor INT8 Dynamic Quantization | pytorch_inc_dynamic_quant |
PyTorch | Intel Extension for PyTorch (FP32, BF16, INT8 Static/Dynamic Quantization) | pytorch_ipex_fp32 , pytorch_ipex_bf16 , pytorch_ipex_int8_static_quant , pytorch_ipex_int8_dynamic_quant |
PyTorch | Alibaba Blade-DISC | pytorch_aliblade |
PyTorch Lightning | Mixed Precision | pytorch_lightning_bf16_cpu |
TensorFlow | Mixed Precision | tensorflow_amp |
Keras | Mixed Precision | keras_amp |
ONNX Runtime | INC Static Quantization (QLinear) | onnx_inc_static_quant_qlinear |