-
-
Notifications
You must be signed in to change notification settings - Fork 319
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Add audio out of dataset to audio section in TensorBoard #878
Comments
***my alternative is actually already implemented |
i guess this commit changes random tensorboard audio to first audio from dataset for evaluation, but it still compromise the reference, like i said my comment in this commit page
Better alternative is to exclude first sample of training loader and set it exclusively for evaluation** |
find out these comments in rvc/train/train.py
i would have any issue enabling it in Applio 3.2.7? |
How to create your own reference:
|
Many thanks, love it! You can close it if you wish. |
That's exactly what I was trying to do.
Any idea what I might be doing wrong? 🤔 |
Dont train on those small references. Use wav, two f0 files and feature file as references instead. |
Could you elaborate, please? |
to make reference files you just need to do preprocess and extract features and use the files generated from those to replace references in logs/reference folder |
Hmm... okay, I kinda expected that. There's some alignment between pitch and phoneme tensors that needs to be made and it is quite annoying for random sample sizes |
Is it possible to fix this issue? Or should I accept that training won't be possible with version 3.2.7? |
You can disable the custom reference and fall back to the original 3.2.6 method of picking a random sample from the training set. Or you can try making a different size of reference audio. What I had included with 3.2.7 was this G:\ApplioV3.2.7\logs\reference>python
feature gets expanded 2x (153 -> 306) so they match each other in size. |
On my side, I get this:
|
Why your reference wav is so big? (100258259,) - that's 30 minutes+ I said use a 5-10 sec sample at most. |
File error when replacing.. 😉😂
|
Description
When training, the script choose one audio from dataset to be on tensorboard each epoch, but using an audio with same features than the model trained make it hard to see if the training is well enough. I still can see by loss graphic if it's starting to overfit, but hearing the audio could help when can't train for many time and the quality is already acceptable and stop training.
Problem
already in description
Proposed Solution
add an option for cli script to pick an audio, something like, --tensorboard-audio "/path/to/audio/file" and for GUI could just add a gradio element to pick audio.
Alternatives Considered
not exactly an alternative, but would be awesome an auto-stop training when values don't change in a range, like, --auto-stop 10
would stop if model don't get better when finish next 10 epochs, or if get better, reset the count.
The text was updated successfully, but these errors were encountered: