-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug with number of layers for scan_layers=False in deepseek #1301
Comments
Thanks for spotting issue! Yes, you are right. Could you help draft a PR with one line fix? Thank you! |
I have submitted a PR (changed file: models.py) |
Cool! Just have a minor comment. Once you fill in the PR description, and we should be good to go! For this small changes, if you could have a quick run locally that will be perfect, but unit test and compile tests in github runners should be sufficient. |
We will track this change in #1337 |
Hi,
I tried to run a proxy model for deepseekv3 model with scan_layer=False, and it runs only two layers.
I think there is an error in this logic and an inner loop is missing in MaxText/models.py: :
I think there should be another for loop just after the first for loop that iterates through "range(num_layers[index])".
The text was updated successfully, but these errors were encountered: