vivian-9907

vivian-9907

1 follower · 2 following

Stars

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,934 1,918 Updated Jun 21, 2026

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,526 607 Updated Jun 21, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,488 18,293 Updated Jun 22, 2026

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,862 535 Updated Jun 19, 2026

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…

Python 2,958 453 Updated Jun 21, 2026

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 19,494 2,952 Updated Jun 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly