Quantized Model #17

JoyBoy-Su · 2024-07-11T06:57:36Z

Speeding up Inference with Quantized models

dhruv1710 · 2024-07-11T12:05:25Z

Can I work on it?
and we need to use dynamic quantization of pytorch right?

EthanC111 · 2024-07-11T13:42:21Z

Of course! Thanks a lot! You could submit a PR!

dhruv1710 · 2024-07-13T16:46:42Z

PR created #21

CoffeeShifter · 2024-07-18T15:44:37Z

What's the status of this? want to use quantized model on Windows

dhruv1710 · 2024-07-18T16:19:40Z

I am currently adding a feature so that you can use -q and you'll be able to use a quantised model for inference

JoyBoy-Su added model About model updating inference Something about inference labels Jul 11, 2024

Provide feedback