Robust Speech Recognition via Large-Scale Weak Supervision
Data manipulation and transformation for audio signal processing
Industrial-level controllable zero-shot text-to-speech system
A Conversational Speech Generation Model
Implementation of NÜWA, attention network for text to video synthesis
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Facebook AI research's automatic speech recognition toolkit
Open source speech models for Julius in English and other languages.
Beamforming and Speech Recognition Toolkit