Project: https://github.com/orgs/ggml-org/projects/6 List: - [guide : running gpt-oss with llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/15396) - [guide : adding new model architectures](https://github.com/ggml-org/llama.cpp/discussions/16770) - [guide : using the new WebUI of llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/16938) - [tutorial : offline agentic coding with llama-server](https://github.com/ggml-org/llama.cpp/discussions/14758) - [tutorial : compute embeddings using llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/7712) - [tutorial : parallel inference using Hugging Face dedicated endpoints](https://github.com/ggml-org/llama.cpp/discussions/9041) - [tutorial : KV cache reuse with llama-server](https://github.com/ggml-org/llama.cpp/discussions/13606) - [tutorial : measuring time to first token (TTFT) and time between tokens (TBT)](https://github.com/ggml-org/llama.cpp/discussions/14115) - [tutorial : reusing multiple prompt prefixes with slots (-np) in llama-server](https://github.com/ggml-org/llama.cpp/discussions/15530) TODO: - [x] https://github.com/ggml-org/llama.cpp/discussions/13488 - [ ] https://github.com/ggml-org/llama.cpp/discussions/13134 - [ ] https://github.com/ggml-org/llama.cpp/discussions/13251 - [ ] https://github.com/ggml-org/llama.cpp/discussions/12742 - [ ] How to get started with webui development (ref: https://github.com/ggml-org/llama.cpp/issues/13523#issuecomment-2879256096) - [ ] etc. Simply search for "How to" in the Discussions: https://github.com/ggml-org/llama.cpp/discussions?discussions_q=is%3Aopen+How+to Contributions for writing tutorials are welcome!
Project: ggml-org : tutorials
List:
TODO:
Simply search for "How to" in the Discussions: https://github.com/ggml-org/llama.cpp/discussions?discussions_q=is%3Aopen+How+to
Contributions for writing tutorials are welcome!