Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel #1

Open
Almenon opened this issue Feb 6, 2025 · 1 comment

Comments

@Almenon
Copy link
Owner

Almenon commented Feb 6, 2025

This warning appears during AI initialization when running python main.py. Better inference performance would be nice because inference is pretty slow on old hardware.

WARNING utils.py L37: For better inference performance, please install exllamav2 kernel via pip install git+https://github.com/AutoGPTQ/AutoGPTQ.git@b8b4127

@Almenon Almenon changed the title WARNING utils.py L37: For better inference performance, please install exllamav2 kernel via pip install git+https://github.com/AutoGPTQ/AutoGPTQ.git@b8b4127 WARNING utils.py L37: For better inference performance, please install exllamav2 kernel Feb 6, 2025
@Almenon
Copy link
Owner Author

Almenon commented Feb 6, 2025

I looked into installing AutoGPTQ, but found out that it is deprecated. It looks like autoround should be using GPTQModel instead. I raised a issue on their Github: intel/auto-round#428

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant