WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

Almenon · 2025-02-06T06:33:45Z

This warning appears during AI initialization when running python main.py. Better inference performance would be nice because inference is pretty slow on old hardware.

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel via pip install git+https://github.com/AutoGPTQ/AutoGPTQ.git@b8b4127

The text was updated successfully, but these errors were encountered:

Almenon · 2025-02-06T06:35:37Z

I looked into installing AutoGPTQ, but found out that it is deprecated. It looks like autoround should be using GPTQModel instead. I raised a issue on their Github: intel/auto-round#428

Almenon changed the title ~~WARNING utils.py L37: For better inference performance, please install exllamav2 kernel via pip install git+https://github.com/AutoGPTQ/AutoGPTQ.git@b8b4127~~ WARNING utils.py L37: For better inference performance, please install exllamav2 kernel Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

Almenon commented Feb 6, 2025 •

edited

Loading

Almenon commented Feb 6, 2025

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

Comments

Almenon commented Feb 6, 2025 • edited Loading

Almenon commented Feb 6, 2025

Almenon commented Feb 6, 2025 •

edited

Loading