Skip to content
View auzxb's full-sized avatar
😌
I may be slow to respond.
😌
I may be slow to respond.

Block or report auzxb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
145 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 93,718 11,688 Updated Dec 15, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,526 6,600 Updated Jan 22, 2026

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,263 9,413 Updated Dec 15, 2025

Deepfakes Software For All

Python 54,924 13,433 Updated Jan 5, 2026

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,216 5,215 Updated Jun 27, 2024

TensorFlow code and pre-trained models for BERT

Python 39,833 9,711 Updated Jul 23, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,388 4,771 Updated Jun 2, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,046 4,240 Updated Aug 6, 2024

Let us control diffusion models!

Python 33,592 2,999 Updated Feb 25, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,445 2,874 Updated Apr 30, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 28,024 2,595 Updated Jan 26, 2026

Generative Models by Stability AI

Python 26,856 3,027 Updated Dec 16, 2025

Fully open reproduction of DeepSeek-R1

Python 25,842 2,411 Updated Nov 24, 2025

Fast and memory-efficient exact attention

Python 21,862 2,316 Updated Jan 25, 2026

Magenta: Music and Art Generation with Machine Intelligence

Python 19,776 3,802 Updated Jan 6, 2026

PyTorch implementations of Generative Adversarial Networks.

Python 17,415 4,101 Updated Jun 18, 2024

Train transformer language models with reinforcement learning.

Python 17,143 2,449 Updated Jan 26, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,626 3,306 Updated Jan 24, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,369 1,246 Updated Nov 4, 2025

Official implementation of AnimateDiff.

Python 11,996 1,039 Updated Jul 31, 2024

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,898 878 Updated Jul 18, 2024

An open-source NLP research library, built on PyTorch.

Python 11,888 2,237 Updated Nov 22, 2022

A PyTorch-based Speech Toolkit

Python 11,090 1,632 Updated Jan 25, 2026

TensorFlow-based neural network library

Python 9,898 1,302 Updated Jan 14, 2026

End-to-End Speech Processing Toolkit

Python 9,704 2,374 Updated Jan 24, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,670 792 Updated May 27, 2025

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 9,220 1,192 Updated Apr 2, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,956 841 Updated Nov 21, 2025

vits2 backbone with multilingual-bert

Python 8,674 1,261 Updated Jan 19, 2026
Next