-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
common: Ensure libcommon.so is build if BUILD_SHARED_LIBS=ON (#13156)
#13158
opened Apr 28, 2025 by
kinchahoy
Loading…
CUDA: fix non-cont. inputs for batched mat mul
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13155
opened Apr 28, 2025 by
JohannesGaessler
Loading…
musa: enable MMA
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13149
opened Apr 28, 2025 by
yeahdongcn
•
Draft
PowerPC: Enable MMA for BF16 in llamafile_sgemm
ggml
changes relating to the ggml tensor library for machine learning
#13148
opened Apr 28, 2025 by
shalinib-ibm
Loading…
CUDA: build archs as virtual for GGML_NATIVE=OFF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13135
opened Apr 27, 2025 by
JohannesGaessler
Loading…
convert : improve model arch handling
python
python script changes
#13122
opened Apr 26, 2025 by
ngxson
Loading…
sycl : Implemented reorder Q4_K mmvq
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109
opened Apr 25, 2025 by
sgeor255
Loading…
1 task
ggml-backend : add load_tensor() to backend API
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#13104
opened Apr 25, 2025 by
bachelor-dou
Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency
ggml
changes relating to the ggml tensor library for machine learning
#13079
opened Apr 23, 2025 by
SongXiaoXi
Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized.
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13071
opened Apr 22, 2025 by
GermanAizek
Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Update README.md for tts example to use afplay on MacOS
examples
#13056
opened Apr 22, 2025 by
maxxam1221
Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel
ggml
changes relating to the ggml tensor library for machine learning
#13053
opened Apr 21, 2025 by
eddnjjn
Loading…
[CANN]Support OP MUL_MAT_ID
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#13042
opened Apr 21, 2025 by
noemotiovon
Loading…
gguf-py : avoid requiring PySide6 for packaged scripts
bugfix
fixes an issue or bug
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
#13036
opened Apr 20, 2025 by
compilade
Loading…
quantize: improve pattern matching for allowed tensors
examples
#13033
opened Apr 20, 2025 by
EAddario
Loading…
Bitnet: directly use scale instead of inverting it twice
python
python script changes
#13026
opened Apr 19, 2025 by
viraatdas
Loading…
Nix portability improvements
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005
opened Apr 18, 2025 by
hacker1024
Loading…
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling
examples
ggml
changes relating to the ggml tensor library for machine learning
#12995
opened Apr 17, 2025 by
max-krasnyansky
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-04-25.