Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Yielding part of float numbers/prices #2

Open
olgakuak opened this issue Apr 15, 2024 · 1 comment
Open

Yielding part of float numbers/prices #2

olgakuak opened this issue Apr 15, 2024 · 1 comment

Comments

@olgakuak
Copy link

Hello! Using the "quick_yield_single_sentence_fragment" parameter entails the separation of non-integer numbers and prices, which significantly affects the quality of the TTS:
["The price for Pure Leaf Sweet Tea is $3.", "5 for small size and $5 for large size."]

@KoljaB
Copy link
Owner

KoljaB commented Apr 15, 2024

Yes, true. I use the NLTK or stanza tokenizers for the standard "splitting into sentence" logic. These tokenizers can't split in comma as of my knowledge. Therefore quick_yield_single_sentence_fragment uses a very simplified splitting logic only based on detection of single characters.

I am open to any suggestions how to enhance this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants