Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request] Return timestamps for TTS output #257

Open
AlexLavSi opened this issue Jan 14, 2025 · 2 comments
Open

[Feature request] Return timestamps for TTS output #257

AlexLavSi opened this issue Jan 14, 2025 · 2 comments
Labels
enhancement New feature or request

Comments

@AlexLavSi
Copy link

Hey, would you mind adding some sweet new features? It would be amazing if we could get timelines for sentences in the text, maybe as a JSON file or something.

@eginhard eginhard added the enhancement New feature or request label Jan 14, 2025
@eginhard eginhard changed the title [Feature request] add timeline [Feature request] Return timestamps for TTS output Jan 14, 2025
@eginhard
Copy link
Member

It's not something we're planning to do ourselves in the near term, but I'd merge PRs adding this. If someone wants to work on it, best to submit a rough plan for feedback here first to agree on a common structure that would work with all Coqui models.

@AlexLavSi
Copy link
Author

AlexLavSi commented Jan 15, 2025

Thank you for your response.
I hope that this improvement will be both needed and liked by someone and that it will be made quickly. I would like to propose this variant of the JSON file (based on my needs):

{ "duration": 500, "timestamps": [ { "text": "bla bla bla 1.", "start_time": 0.0, "end_time": 3.5 }, ... { "text": "bla bla bla end.", "start_time": 480.7, "end_time": 500 } ] }

And text parsing can be done by sentences in the main text.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants