tutorials : list for llama.cpp

Project: https://github.com/orgs/ggml-org/projects/6

List:

- [guide : running gpt-oss with llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/15396)
- [guide : adding new model architectures](https://github.com/ggml-org/llama.cpp/discussions/16770)
- [guide : using the new WebUI of llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/16938)
- [tutorial : offline agentic coding with llama-server](https://github.com/ggml-org/llama.cpp/discussions/14758)
- [tutorial : compute embeddings using llama.cpp](https://github.com/ggml-org/llama.cpp/discussions/7712)
- [tutorial : parallel inference using Hugging Face dedicated endpoints](https://github.com/ggml-org/llama.cpp/discussions/9041)
- [tutorial : KV cache reuse with llama-server](https://github.com/ggml-org/llama.cpp/discussions/13606)
- [tutorial : measuring time to first token (TTFT) and time between tokens (TBT)](https://github.com/ggml-org/llama.cpp/discussions/14115)
- [tutorial : reusing multiple prompt prefixes with slots (-np) in llama-server](https://github.com/ggml-org/llama.cpp/discussions/15530)

TODO:
- [x] https://github.com/ggml-org/llama.cpp/discussions/13488
- [ ] https://github.com/ggml-org/llama.cpp/discussions/13134
- [ ] https://github.com/ggml-org/llama.cpp/discussions/13251
- [ ] https://github.com/ggml-org/llama.cpp/discussions/12742
- [ ] How to get started with webui development (ref: https://github.com/ggml-org/llama.cpp/issues/13523#issuecomment-2879256096)
- [ ] etc.

Simply search for "How to" in the Discussions: https://github.com/ggml-org/llama.cpp/discussions?discussions_q=is%3Aopen+How+to

Contributions for writing tutorials are welcome!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tutorials : list for llama.cpp #13523

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

tutorials : list for llama.cpp #13523

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions