-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
musa: enable MMA
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13149
opened Apr 28, 2025 by
yeahdongcn
•
Draft
PowerPC: Enable MMA for BF16 in llamafile_sgemm
ggml
changes relating to the ggml tensor library for machine learning
#13148
opened Apr 28, 2025 by
shalinib-ibm
Loading…
llama : (mrope) allow using normal 1D position for text token
examples
#13138
opened Apr 27, 2025 by
ngxson
Loading…
clip : refactor set input for cgraph + fix qwen2.5vl input
examples
#13136
opened Apr 27, 2025 by
ngxson
Loading…
CUDA: build archs as virtual for GGML_NATIVE=OFF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13135
opened Apr 27, 2025 by
JohannesGaessler
Loading…
convert : improve model arch handling
python
python script changes
#13122
opened Apr 26, 2025 by
ngxson
Loading…
sycl : Implemented reorder Q4_K mmvq
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109
opened Apr 25, 2025 by
sgeor255
Loading…
1 task
ggml-backend : add load_tensor() to backend API
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#13104
opened Apr 25, 2025 by
bachelor-dou
Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency
ggml
changes relating to the ggml tensor library for machine learning
#13079
opened Apr 23, 2025 by
SongXiaoXi
Loading…
SYCL: Add all missing unary kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13074
opened Apr 23, 2025 by
qnixsynapse
Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized.
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13071
opened Apr 22, 2025 by
GermanAizek
Loading…
fix(rpc): Improve input validation and error handling
ggml
changes relating to the ggml tensor library for machine learning
#13069
opened Apr 22, 2025 by
thevilledev
Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Update README.md for tts example to use afplay on MacOS
examples
#13056
opened Apr 22, 2025 by
maxxam1221
Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel
ggml
changes relating to the ggml tensor library for machine learning
#13053
opened Apr 21, 2025 by
eddnjjn
Loading…
[CANN]Support OP MUL_MAT_ID
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#13042
opened Apr 21, 2025 by
noemotiovon
Loading…
gguf-py : avoid requiring PySide6 for packaged scripts
bugfix
fixes an issue or bug
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
#13036
opened Apr 20, 2025 by
compilade
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.