Speech to Text to Speech, sends text as OSC messages
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Robust Speech Recognition via Large-Scale Weak Supervision
Real-time voice interactive digital human
TEN, a voice agent framework to create conversational AI.
Speech-to-text, text-to-speech, and speaker recognition
Build voice-based LLM agents. Modular + open source
In-App assistant SDK to build a multimodal conversational UX websites
The behavior guidance framework for customer-facing LLM agents
Conversational voice AI agents
Map location picker component for Android
Repo of Qwen2-Audio chat & pretrained large audio language model
A free, open source, and extensible speech-to-text application
Assistant SDK to build a multimodal conversational UX for Android
In-App assistant SDK to build a multimodal conversational UX for iOS
Deploy your private Gemini application for free with one click
Build your own AI friend
Bailing is a voice dialogue robot similar to GPT-4o
Transform your voice in real-time voxal voice changer
Amica is an open source interface for interactive communication
Video translation and dubbing tool powered by LLMs
Toolkit for conversational AI
Telegram Web A, GPL v3
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant