huangmaa

Follow

huangma huangmaa

Follow

1 follower · 1 following

Achievements

Achievements

Stars

ggml-org / ggml

Tensor library for machine learning

C++ 13,641 1,411 Updated Nov 24, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,269 11,649 Updated Nov 30, 2025

llvm / torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,684 619 Updated Nov 28, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,195 31,275 Updated Nov 29, 2025

iree-org / iree-turbine

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 102 73 Updated Nov 26, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,761 284 Updated Nov 28, 2025

LlamaFamily / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,750 1,302 Updated Apr 6, 2025

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,476 802 Updated Nov 30, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,708 2,414 Updated Nov 30, 2025

microsoft / triton-shared

Shared Middle-Layer for Triton Compilation

MLIR 315 79 Updated Oct 27, 2025

Cambricon / triton-linalg

Development repository for the Triton-Linalg conversion

C++ 206 25 Updated Feb 7, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,807 2,174 Updated Nov 25, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,823 4,026 Updated Nov 30, 2025

buddy-compiler / buddy-mlir

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

C++ 663 219 Updated Nov 28, 2025

sophgo / tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 824 193 Updated Nov 28, 2025