Realtime TTS #1187

EtahReign · 2024-10-23T17:30:21Z

The Koboldcpp app is amazing. The only issue I see is the TTS occurs after the text is finished which takes forever. Is there a way to have the TTS occur as the text is being outputted to reduce the delay information being outputted?

LostRuins · 2024-10-24T14:38:24Z

Unfortunately this is not possible at this time, since the TTS can only work on the completed text. Perhaps if you disable streaming it might feel better?

WesleyFister · 2024-10-24T20:32:52Z

Unfortunately this is not possible at this time, since the TTS can only work on the completed text. Perhaps if you disable streaming it might feel better?

You can break the streamed response into sentences and then run the TTS on each sentence, playing it back to the user. In this case you would only have to wait until the first sentence is created. This is what I do in my speech-to-speech project.

EtahReign · 2024-10-24T21:57:59Z

Thank you both for the advice. How do I break the streamed response intosentences?

WesleyFister · 2024-10-25T02:39:47Z

The pseudocode is

from nltk.tokenize import sent_tokenize

def getSentences():
    tokens = streamed_response_from_LLM
    
    currentSentence = 1
    response = ""
    for token in tokens:
        response = response + token
        sentences = sent_tokenize(response)
        if len(sentences) > sentence:
            currentSentence += 1
            yield sentences[sentence - 1]

    yield sentences[sentence - 1] # Yield the final sentence

You would ideally run this in a separate thread and queue the sentences. In another thread use TTS to generate audio from each sentence and queue that. Finally, in yet another thread play each audio file.

EtahReign · 2024-10-25T16:43:05Z

Thank you

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Realtime TTS #1187

Realtime TTS #1187

EtahReign commented Oct 23, 2024

LostRuins commented Oct 24, 2024

WesleyFister commented Oct 24, 2024

EtahReign commented Oct 24, 2024 •

edited

Loading

WesleyFister commented Oct 25, 2024

EtahReign commented Oct 25, 2024

Realtime TTS #1187

Realtime TTS #1187

Comments

EtahReign commented Oct 23, 2024

LostRuins commented Oct 24, 2024

WesleyFister commented Oct 24, 2024

EtahReign commented Oct 24, 2024 • edited Loading

WesleyFister commented Oct 25, 2024

EtahReign commented Oct 25, 2024

EtahReign commented Oct 24, 2024 •

edited

Loading