-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wake up word trigger #10
Comments
Should by quite easy to do for both projects. Easy way filtering the incoming tokens for emoticons then triggering the animation. Probably a better, more reliable way would be to use a structured output library like instructor and force the llm to fill out a pydantic field with the desired expression. |
ok, it might be above my skills :) I couldn't find a pseudo-code logic for being sure that the expressive face scene would be triggered while contextual speech was played. Just to be sure I explained myself correctly (by re-reading my ask, it's not sure) : |
What I wish to do is to ask the AI to provide an answer including emotion cues in it so that while speaking we would see the face change. The difficulty I see is that the cues must trigger the face change when the audio is read. |
Thank you for your taking the time to answer. So as I understand, the first would be the gold standard but I'm afraid of this adding latency as it would be one instruction at the time? Or would it be that it's one big answer containing multiple sentences along with their emotions? |
With instructor you could make the LLM send a list of pairs of sentences and emotions and stream everything back token by token so you would have only minimal latency added. I've been thinking about an upgrade to my LocalAIVoiceChat project, where I plan to do this with different voice references for every emotion. |
Look here (watch the little clip) |
Hi!
One of my goals is to be able to trigger an animation based on a keyword or smiley included in the LLM answer.
Therefore, I could animate expressive faces on OBS for instance via websockets, by the script being triggered by the smiley (It's ignored be elevenlabs).
Not sure if it's the right project - it might be better for linguflex as a module, but is that possible?
The text was updated successfully, but these errors were encountered: