Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

llama-graph : fix text position for mrope
#13159 opened Apr 28, 2025 by ngxson Loading…
CUDA: fix non-cont. inputs for batched mat mul ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13155 opened Apr 28, 2025 by JohannesGaessler Loading…
musa: enable MMA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13149 opened Apr 28, 2025 by yeahdongcn Draft
PowerPC: Enable MMA for BF16 in llamafile_sgemm ggml changes relating to the ggml tensor library for machine learning
#13148 opened Apr 28, 2025 by shalinib-ibm Loading…
mtmd : add qwen2vl and qwen2.5vl examples
#13141 opened Apr 27, 2025 by ngxson Loading…
CUDA: build archs as virtual for GGML_NATIVE=OFF ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13135 opened Apr 27, 2025 by JohannesGaessler Loading…
convert : improve model arch handling python python script changes
#13122 opened Apr 26, 2025 by ngxson Loading…
sycl : Implemented reorder Q4_K mmvq ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109 opened Apr 25, 2025 by sgeor255 Loading…
1 task
ggml-backend : add load_tensor() to backend API Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13106 opened Apr 25, 2025 by rgerganov Draft
[sync #10544] llama/ggml: add LLM training support examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#13105 opened Apr 25, 2025 by ggerganov Draft
1 task
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#13104 opened Apr 25, 2025 by bachelor-dou Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency ggml changes relating to the ggml tensor library for machine learning
#13079 opened Apr 23, 2025 by SongXiaoXi Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized. build Compilation issues ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13071 opened Apr 22, 2025 by GermanAizek Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file python python script changes
#13058 opened Apr 22, 2025 by glide-the Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel ggml changes relating to the ggml tensor library for machine learning
#13053 opened Apr 21, 2025 by eddnjjn Loading…
[CANN]Support OP MUL_MAT_ID Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#13042 opened Apr 21, 2025 by noemotiovon Loading…
gguf-py : avoid requiring PySide6 for packaged scripts bugfix fixes an issue or bug devops improvements to build systems and github actions nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes
#13036 opened Apr 20, 2025 by compilade Loading…
Bitnet: directly use scale instead of inverting it twice python python script changes
#13026 opened Apr 19, 2025 by viraatdas Loading…
Nix portability improvements devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005 opened Apr 18, 2025 by hacker1024 Loading…
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling examples ggml changes relating to the ggml tensor library for machine learning
#12995 opened Apr 17, 2025 by max-krasnyansky Loading…
ProTip! Updated in the last three days: updated:>2025-04-25.