-
Notifications
You must be signed in to change notification settings - Fork 13.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Run several single thread operators parellel
threading
Parallel processing and thread management
#850
opened Apr 8, 2023 by
howard0su
Loading…
Q4_0 scale selection using RMSE
enhancement
New feature or request
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research 🔬
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
Optimize locking behavior
threading
Parallel processing and thread management
#813
opened Apr 6, 2023 by
janekb04
Loading…
Add "-e"/"--eval-threads" to distinguish thread counts for single-token eval and prompt eval
threading
Parallel processing and thread management
#744
opened Apr 3, 2023 by
MagisterLuddite
•
Draft
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.