Skip to content
View huangmaa's full-sized avatar

Block or report huangmaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tensor library for machine learning

C++ 13,641 1,411 Updated Nov 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,269 11,649 Updated Nov 30, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,684 619 Updated Nov 28, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,195 31,275 Updated Nov 29, 2025

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 102 73 Updated Nov 26, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,761 284 Updated Nov 28, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,750 1,302 Updated Apr 6, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,476 802 Updated Nov 30, 2025

Development repository for the Triton language and compiler

MLIR 17,708 2,414 Updated Nov 30, 2025

Shared Middle-Layer for Triton Compilation

MLIR 315 79 Updated Oct 27, 2025

Development repository for the Triton-Linalg conversion

C++ 206 25 Updated Feb 7, 2025

Fast and memory-efficient exact attention

Python 20,807 2,174 Updated Nov 25, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,823 4,026 Updated Nov 30, 2025

An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

C++ 663 219 Updated Nov 28, 2025

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 824 193 Updated Nov 28, 2025