-
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda GNU General Public License v3.0 UpdatedNov 28, 2025 -
ggml Public
Forked from ggml-org/ggmlTensor library for machine learning
-
stable-diffusion.cpp Public
Forked from leejet/stable-diffusion.cppStable Diffusion in pure C/C++
C++ MIT License UpdatedNov 26, 2025 -
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C++ MIT License UpdatedNov 19, 2025 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedNov 12, 2025 -
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedNov 12, 2025 -
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedNov 7, 2025 -
-
ai-performance-engineering Public
Forked from cfregly/ai-performance-engineeringPython Apache License 2.0 UpdatedOct 23, 2025 -
conv_op_optimization Public
Forked from Qwesh157/conv_op_optimizationThis project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
C++ UpdatedOct 21, 2025 -
-
MatmulTutorial Public
Forked from KnowingNothing/MatmulTutorialA Easy-to-understand TensorOp Matmul Tutorial
C++ Apache License 2.0 UpdatedSep 7, 2025 -
cs_books Public
Forked from AzatAI/cs_booksComputer science books Recommended by AzatAI. (Education ONLY)
Python UpdatedAug 27, 2025 -
-
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedAug 20, 2025 -
how-to-optim-algorithm-in-cuda Public
Forked from BBuf/how-to-optim-algorithm-in-cudahow to optimize some algorithm in cuda.
Cuda UpdatedJul 16, 2025 -
openCNN Public
Forked from UDC-GAC/openCNNA Winograd Minimal Filter Implementation in CUDA
Cuda Apache License 2.0 UpdatedJul 15, 2025 -
diffusion-fast Public
Forked from huggingface/diffusion-fastFaster generation with text-to-image diffusion models.
Python Apache License 2.0 UpdatedJun 28, 2025 -
leetcode-doocs Public
Forked from doocs/leetcode😏 LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
Java Creative Commons Attribution Share Alike 4.0 International UpdatedJun 22, 2025 -
pmpp4 Public
Forked from tugot17/pmppComplete solutions to the Programming Massively Parallel Processors Edition 4
Jupyter Notebook MIT License UpdatedJun 18, 2025 -
twitterxdownload Public
Forked from kevinfsz/twitterxdownloada powerful twitter video downloader and twitter marketing tool.
JavaScript GNU Affero General Public License v3.0 UpdatedJun 13, 2025 -
text-behind-image Public
Forked from RexanWONG/text-behind-imagehttps://textbehindimage.rexanwong.xyz - create text behind image designs easily
TypeScript GNU Affero General Public License v3.0 UpdatedJun 11, 2025 -
good-kernels Public
Forked from ScalingIntelligence/good-kernelsSamples of good AI generated CUDA kernels
Python UpdatedMay 30, 2025 -
HGEMM Public
Forked from xlite-dev/HGEMM⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.
Cuda GNU General Public License v3.0 UpdatedMay 10, 2025 -
WebToEpub Public
Forked from dteviot/WebToEpubA simple Chrome (and Firefox) Extension that converts Web Novels (and other web pages) into an EPUB.
JavaScript Other UpdatedMay 5, 2025 -
JavaScript30 Public
Forked from wesbos/JavaScript3030 Day Vanilla JS Challenge
HTML UpdatedApr 25, 2025 -
-
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedMar 13, 2025 -
twitter-web-exporter Public
Forked from prinsss/twitter-web-exporterExport tweets, bookmarks, lists and much more from Twitter(X) web app. (推文/书签/收藏/列表导出工具)
TypeScript MIT License UpdatedMar 12, 2025 -

