first person shooter, space invader
PyTorch code and models for VJEPA2 self-supervised learning from video
CLIP, Predict the most relevant text snippet given an image
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
PyTorch code and models for V-JEPA self-supervised learning from video
Synchronized Translation for Videos
A collective list of free APIs
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Diffusion Transformer with Fine-Grained Chinese Understanding
Context data platform for building observable, self-learning AI agents
Generate Any 3D Scene in Seconds
Implementation of the Surya Foundation Model for Heliophysics
Comprehensive study guide for coding interviews
Mini website for testing both general CS knowledge and enforce coding
Generate high-definition story short videos with one click using AI
SOTA discrete acoustic codec models with 40/75 tokens per second
Language modeling in a sentence representation space
Large Multimodal Models for Video Understanding and Editing
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
deletes junk files to free disk space and improve privacy
Generate 3D objects conditioned on text or images
General Mission Analysis Tool
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Software for molecular simulations and trajectory analysis
Code release for "Detecting Twenty-thousand Classes