Skip to content
View sleepcoo's full-sized avatar
🤒
On the way
🤒
On the way
  • beijing
  • 06:34 (UTC +08:00)

Block or report sleepcoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch native library for training speculative decoding models

Python 32 3 Updated Mar 11, 2026

Draft-Target Disaggregation LLM Serving System via Parallel Speculative Decoding.

Python 179 26 Updated Mar 6, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 729 180 Updated Mar 14, 2026

Train your Agent model via our easy and efficient framework

Python 1,718 161 Updated Dec 5, 2025

My learning notes for ML SYS.

Python 5,667 368 Updated Mar 2, 2026

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 905 102 Updated Mar 11, 2026

GPU operators for sparse tensor operations

Python 35 1 Updated Mar 11, 2024

An easy-to-use package for implementing SmoothQuant for LLMs

Python 111 10 Updated Apr 7, 2025