parker-ai-fang

parker-ai-fang

1 follower · 0 following

Stars

32 stars written in Python

Clear filter

deepseek-ai / DeepSeek-V3

Python 100,939 16,442 Updated Aug 28, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,724 12,337 Updated Jan 3, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,636 5,059 Updated Jan 1, 2026

microsoft / qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 35,136 5,460 Updated Dec 30, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,782 2,406 Updated Nov 24, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 24,701 2,234 Updated Dec 23, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,091 3,925 Updated Jan 3, 2026

recommenders-team / recommenders

Best Practices on Recommendation Systems

Python 21,317 3,278 Updated Jan 3, 2026

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,143 1,937 Updated Dec 29, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,913 1,359 Updated Oct 6, 2025

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,793 1,029 Updated Dec 4, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,854 583 Updated May 3, 2024

wzhe06 / Ad-papers

Papers on Computational Advertising

Python 4,379 1,192 Updated Feb 9, 2021

RUCAIBox / RecBole

A unified, comprehensive and efficient recommendation library

Python 4,187 711 Updated Feb 24, 2025

princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,629 534 Updated Oct 16, 2024

deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 3,113 559 Updated Apr 15, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,010 222 Updated Dec 22, 2025

wzhe06 / SparrowRecSys

A Deep Learning Recommender System

Python 2,700 867 Updated Jun 2, 2024

safety-research / circuit-tracer

Python 2,518 281 Updated Jan 2, 2026

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,402 263 Updated Dec 11, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,227 405 Updated Dec 15, 2025

pytorch-tabular / pytorch_tabular

A standard framework for modelling Deep Learning Models for tabular data

Python 1,617 161 Updated Jan 1, 2026

reczoo / FuxiCTR

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,340 219 Updated Jun 16, 2025

facebookresearch / GENRE

Autoregressive Entity Retrieval

Python 797 102 Updated Jul 6, 2023

ScienceOne-AI / DeepSeek-671B-SFT-Guide

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…

Python 790 94 Updated Mar 13, 2025

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 681 82 Updated Dec 1, 2025

bytedance / HLLM

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 571 77 Updated Aug 26, 2025

karpathy / hn-time-capsule

Analyzing Hacker News discussions from a decade ago in hindsight with LLMs

Python 528 51 Updated Dec 10, 2025

antgroup / Agentar-Scale-SQL

Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance.

Python 303 30 Updated Dec 16, 2025

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 147 29 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly