Multiband Architecture #5

Rongjiehuang · 2021-07-25T12:57:56Z

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.

xcmyz · 2021-07-28T12:05:10Z

You can refer https://github.com/xcmyz/FastVocoder/blob/main/bin/synthesize.py#L79

Rongjiehuang · 2021-07-28T15:58:47Z

hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.

xcmyz · 2021-07-30T15:53:41Z

hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.

In my case, it can solve the checkerboard artifacts problem. Maybe you can use some low-quality speech to train the model, like aishell3. I combine biaobei data and aishell3 in the training data, this problem can be solved. Besides, you can try u-law algorithm in different band and make normalization in different band to fix the problem.

RuqiaoLiu · 2021-10-15T09:18:11Z

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.

Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?

Rongjiehuang · 2021-10-15T11:14:11Z

Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.

Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?

There are three main general approaches for these constant lines:

train for more steps.
add discriminator (work in GAN based waveform generation)
after PQMF, the full band waveforms pass through an additional conv layer.

ysujiang · 2022-02-22T09:27:53Z

Is better than hifigan??

HaiFengZeng · 2023-02-08T09:58:16Z

@Rongjiehuang Thanks,the last advice works for me!

xcmyz closed this as completed Jul 28, 2021

xcmyz reopened this Jul 30, 2021

xcmyz added the help wanted Extra attention is needed label Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiband Architecture #5

Multiband Architecture #5

Rongjiehuang commented Jul 25, 2021

xcmyz commented Jul 28, 2021

Rongjiehuang commented Jul 28, 2021

xcmyz commented Jul 30, 2021

RuqiaoLiu commented Oct 15, 2021

Rongjiehuang commented Oct 15, 2021

ysujiang commented Feb 22, 2022

HaiFengZeng commented Feb 8, 2023

Multiband Architecture #5

Multiband Architecture #5

Comments

Rongjiehuang commented Jul 25, 2021

xcmyz commented Jul 28, 2021

Rongjiehuang commented Jul 28, 2021

xcmyz commented Jul 30, 2021

RuqiaoLiu commented Oct 15, 2021

Rongjiehuang commented Oct 15, 2021

ysujiang commented Feb 22, 2022

HaiFengZeng commented Feb 8, 2023