Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When infer() with multiple text, all audio are the same file size (当使用多个文本进行 infer() 时,所有音频的文件大小相同_ #813

Open
ericfarng opened this issue Nov 2, 2024 · 1 comment
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation help wanted Extra attention is needed

Comments

@ericfarng
Copy link

ericfarng commented Nov 2, 2024

When calling infer() with multiple text, even if the text are very different length, then all the audio files are the same length
I am on Windows, using the demo code provided.

google translate:
当使用多个文本调用 infer() 时,即使文本的长度差别很大,所有音频文件的长度也相同
我在 Windows 上使用提供的演示代码。

@ericfarng ericfarng changed the title When infer() with multiple text, all audio are the same file size When infer() with multiple text, all audio are the same file size (当使用多个文本进行 infer() 时,所有音频的文件大小相同_ Nov 2, 2024
@fumiama
Copy link
Member

fumiama commented Nov 3, 2024

The Vocos will decode a batch of audio at the same time, who needs a matrix input, which means that the input length of all audio should match the longest sentence in infer array. User needs to remove the trailing zero values of their output by themself. The program cannot remove it because the stream mode also needs a same output length.

@fumiama fumiama added documentation Improvements or additions to documentation help wanted Extra attention is needed algorithm Algorithm improvements & issues labels Nov 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants