You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've built a simple voice agent using the latest Agent SDK and VoicePipeline with the default configurations. However, the voice conversation feels noticeably slow. Can you explain how the speech-to-text (STT), workflow, and text-to-speech (TTS) processes work behind the scenes, and offer suggestions on how to improve the speed or reduce latency in the pipeline?
Our case is via phone line (twilio).
The text was updated successfully, but these errors were encountered:
In the tracing config, the Text to Speech starts after the whole POST request has been made and text has been retrieved, why does it happen if text is streamign and TTS is cofigured to output the partial text?
I've built a simple voice agent using the latest Agent SDK and VoicePipeline with the default configurations. However, the voice conversation feels noticeably slow. Can you explain how the speech-to-text (STT), workflow, and text-to-speech (TTS) processes work behind the scenes, and offer suggestions on how to improve the speed or reduce latency in the pipeline?
Our case is via phone line (twilio).
The text was updated successfully, but these errors were encountered: