BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
Features
- Portable to Unix-like Systems with the G++ compiler and SWIG
- Both C++ and Python interfaces
- Abundant classes and functions for microphone array processing and speech recognition
- Efficient handling for a block of incoming audio samples that makes BTK suitable for real-time prototypes
- Free software
Follow Distant Speech Recognition
Other Useful Business Software
Auth0 for AI Agents now in GA
Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.