Open Source OCR Engine
Face recognition with deep neural networks
Offline speech recognition API for Android, iOS, Raspberry Pi
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
A Lightweight Face Recognition and Facial Attribute Analysis
State-of-the-art 2D and 3D Face Analysis Project
Speech recognition module for Python
Captcha solver extension for humans
Speech-to-text, text-to-speech, and speaker recognition
Port of OpenAI's Whisper model in C/C++
Contexts Optical Compression
kaldi-asr/kaldi is the official location of the Kaldi project
A pure Javascript Multilingual OCR
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OpenVINO™ Toolkit repository
On-device Speech Recognition for Apple Silicon
A PyTorch-based Speech Toolkit
Open Source Computer Vision Library
Multilingual Automatic Speech Recognition with word-level timestamps
Library for OCR-related tasks powered by Deep Learning
Open-Source Python3 tool for recognizing layouts, tables, and math
Interactive video and image annotation tool for computer vision
Build your own AI friend
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition