archit-spec

😱

accha

archit archit-spec

😱

accha

37 followers · 246 following

Achievements

Starred repositories

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 625 65 Updated Mar 8, 2026

jawond / bird

Bird is a cli for twitter, so your agents can tweet.

TypeScript 41 183 Updated Dec 5, 2025

Gen-Verse / Open-AgentRL

An open-source RL (DemyAgent & RLAnything) for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool use, and distributed training.

Python 370 43 Updated Feb 27, 2026

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,112 175 Updated Aug 26, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,156 182 Updated Mar 14, 2026

Infatoshi / MegaQwen

Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)

Cuda 82 5 Updated Feb 10, 2026

harbor-framework / harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 999 762 Updated Mar 15, 2026

NousResearch / torchtitan

Forked from pytorch/torchtitan

A PyTorch native library for large model training

Python 19 9 Updated Mar 13, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 4,772 636 Updated Mar 13, 2026

Coding-Solo / godot-mcp

MCP server for interfacing with Godot game engine. Provides tools for launching the editor, running projects, and capturing debug output.

JavaScript 2,367 266 Updated Jan 30, 2026

habx / lib-py-sketchfab

Sketchfab python library & CLI

Python 4 Updated Jul 26, 2021

davidallendj / gdpm

CLI tool to automate managing Godot game engine assets from the command-line.

C++ 7 1 Updated Oct 26, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,393 1,105 Updated Mar 11, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,708 495 Updated Mar 13, 2026

Noumena-Network / nmoe

MoE training for Me and You and maybe other people

Python 373 31 Updated Feb 7, 2026

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,210 662 Updated Mar 15, 2026

microsoft / InferredBugs

InferredBugs: a metadata-rich dataset of bugs and fixes in Java and C# programming languages extracted with the Infer static analyzer

37 6 Updated Nov 30, 2023

deepseek-ai / LPLB

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 499 34 Updated Nov 19, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,232 186 Updated Dec 2, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 12,002 1,167 Updated Mar 13, 2026

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 269 27 Updated Oct 3, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 1,254 201 Updated Mar 14, 2026

furlat / Abstractions

A Collection of Pydantic Models to Abstract IRL

Python 38 3 Updated Dec 10, 2025

secemp9 / rubrics

a bunch of rubrics I made in different format and structure for llm judge and other use cases

13 Updated Sep 22, 2025

humanlayer / 12-factor-agents

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 18,728 1,424 Updated Sep 21, 2025

epfml / schedules-and-scaling

Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"

Python 92 8 Updated Oct 30, 2024

onlook-dev / onlook

The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI

TypeScript 24,894 1,871 Updated Feb 27, 2026

PrimeIntellect-ai / pccl

PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP

C++ 147 8 Updated Sep 12, 2025

SalesforceAIResearch / MCP-Universe

MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents

Python 567 72 Updated Mar 10, 2026

joey00072 / Multi-Head-Latent-Attention-MLA-

working implimention of deepseek MLA

Python 45 5 Updated Jan 8, 2025

archit archit-spec

Starred repositories

Machine learning

Deep learning

Rust

Bitcoin