Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Run several single thread operators parellel threading Parallel processing and thread management
#850 opened Apr 8, 2023 by howard0su Loading…
Q4_0 scale selection using RMSE enhancement New feature or request Less than 4 bits Efforts related to viable quantized models using <4 bits research 🔬 Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#835 opened Apr 7, 2023 by sw Draft
Optimize locking behavior threading Parallel processing and thread management
#813 opened Apr 6, 2023 by janekb04 Loading…
ProTip! Filter pull requests by the default branch with base:master.