Skip to content
View parker-ai-fang's full-sized avatar

Block or report parker-ai-fang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
32 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,724 12,337 Updated Jan 3, 2026

The best ChatGPT that $100 can buy.

Python 39,636 5,059 Updated Jan 1, 2026

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 35,136 5,460 Updated Dec 30, 2025

Fully open reproduction of DeepSeek-R1

Python 25,782 2,406 Updated Nov 24, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 24,701 2,234 Updated Dec 23, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,091 3,925 Updated Jan 3, 2026

Best Practices on Recommendation Systems

Python 21,317 3,278 Updated Jan 3, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,143 1,937 Updated Dec 29, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,913 1,359 Updated Oct 6, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,793 1,029 Updated Dec 4, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,854 583 Updated May 3, 2024

Papers on Computational Advertising

Python 4,379 1,192 Updated Feb 9, 2021

A unified, comprehensive and efficient recommendation library

Python 4,187 711 Updated Feb 24, 2025

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,629 534 Updated Oct 16, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 3,113 559 Updated Apr 15, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,010 222 Updated Dec 22, 2025

A Deep Learning Recommender System

Python 2,700 867 Updated Jun 2, 2024

Minimalistic large language model 3D-parallelism training

Python 2,402 263 Updated Dec 11, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,227 405 Updated Dec 15, 2025

A standard framework for modelling Deep Learning Models for tabular data

Python 1,617 161 Updated Jan 1, 2026

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,340 219 Updated Jun 16, 2025

Autoregressive Entity Retrieval

Python 797 102 Updated Jul 6, 2023

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…

Python 790 94 Updated Mar 13, 2025

Official implementation of "Continuous Autoregressive Language Models"

Python 681 82 Updated Dec 1, 2025

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 571 77 Updated Aug 26, 2025

Analyzing Hacker News discussions from a decade ago in hindsight with LLMs

Python 528 51 Updated Dec 10, 2025

Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance.

Python 303 30 Updated Dec 16, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 147 29 Updated Dec 19, 2025
Next