Skip to content
View vivian-9907's full-sized avatar

Block or report vivian-9907

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,934 1,918 Updated Jun 21, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,526 607 Updated Jun 21, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,488 18,293 Updated Jun 22, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,862 535 Updated Jun 19, 2026

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…

Python 2,958 453 Updated Jun 21, 2026

Development repository for the Triton language and compiler

MLIR 19,494 2,952 Updated Jun 21, 2026