-
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training randomly quit working. #3024
Comments
You can try to use Cosine/Cosine with restart instead of Cosine with min LR. |
it sometimes happens to me, i just press start train again , and it starts. But my trainings stops with same errors in the middle of a training, completely random times. |
Somehow that worked. I don't know why it worked, since it worked fine the last time I used it, but somehow it works now. I don't think I updated anything, so I'm at a loss as to what is going on here. Also, when I say random, I mean it worked fine a month ago, and then up and dies on me today, with nothing being changed in between. This program, as useful as it is, is temperamental at best, and seems to only work when it wants to. At least it wasn't something I overlooked like running activate.bat with admin privileges. 🙄 |
Do you perhaps have a i9-14900k? They are supposedly a bit flaky, atleast at high frequencies, as i found out mine was. I have solved MY problem with weird errors and failing installs. I turned off INTEL CPU BOOST in the bios. I will see if i can get windows own cpu limiter to work, or if MSI's software can help me limit cpu speed. But ti have been running trains for almost 2 days now, several 4-5 hours at a time with no errors. |
It's been a while, and training up and quit working for some unknown reason, so I tried a fresh install, and nothing changed. To make matters worse I tried to delete the default_config.yaml file, and reconfigure it through setup.bat, and unlike last time, no dice. Sadly I'm a bit stupid when it comes to this programming stuff, but I gave a look through the logs, realized I didn't have SD-Scripts in my folder, and that didn't fix it either...nice try though. I'm running short on what worked, so after getting everything back to it's pre-configured state, here is the latest log.
The text was updated successfully, but these errors were encountered: