Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiband Architecture #5

Open
Rongjiehuang opened this issue Jul 25, 2021 · 7 comments
Open

Multiband Architecture #5

Rongjiehuang opened this issue Jul 25, 2021 · 7 comments
Labels
help wanted Extra attention is needed

Comments

@Rongjiehuang
Copy link

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.
audio

@xcmyz
Copy link
Owner

xcmyz commented Jul 28, 2021

@xcmyz xcmyz closed this as completed Jul 28, 2021
@Rongjiehuang
Copy link
Author

hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.

@xcmyz
Copy link
Owner

xcmyz commented Jul 30, 2021

hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.

In my case, it can solve the checkerboard artifacts problem. Maybe you can use some low-quality speech to train the model, like aishell3. I combine biaobei data and aishell3 in the training data, this problem can be solved. Besides, you can try u-law algorithm in different band and make normalization in different band to fix the problem.

@xcmyz xcmyz reopened this Jul 30, 2021
@xcmyz xcmyz added the help wanted Extra attention is needed label Jul 30, 2021
@RuqiaoLiu
Copy link

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks. audio

Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?

@Rongjiehuang
Copy link
Author

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks. audio

Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?

There are three main general approaches for these constant lines:

  1. train for more steps.
  2. add discriminator (work in GAN based waveform generation)
  3. after PQMF, the full band waveforms pass through an additional conv layer.

@ysujiang
Copy link

Is better than hifigan??

@HaiFengZeng
Copy link

@Rongjiehuang Thanks,the last advice works for me!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

5 participants