Skip to content

server: enable jinja by default, update docs#17524

Merged
ngxson merged 2 commits into
ggml-org:masterfrom
ngxson:xsn/server_jinja_enabled_default
Nov 27, 2025
Merged

server: enable jinja by default, update docs#17524
ngxson merged 2 commits into
ggml-org:masterfrom
ngxson:xsn/server_jinja_enabled_default

Conversation

@ngxson
Copy link
Copy Markdown
Contributor

@ngxson ngxson commented Nov 26, 2025

On server, we want to enable jinja by default to allow tool calling and default system prompts. More and more models require this, so I think it's finally time to make it enabled by default.

However, we don't want to enable this for other examples (like llama-cli or llama-run), because these examples cannot yet handle rolling back tokens. This can happen when the chat template wants to modify pass tokens, for example, one can delete the reasoning content from the formatted chat.

This PR also update the auto-generated docs via llama-gen-docs command

@ngxson ngxson requested a review from ggerganov as a code owner November 26, 2025 16:15
@CISC
Copy link
Copy Markdown
Member

CISC commented Nov 26, 2025

However, we don't want to enable this for other examples (like llama-cli or llama-run), because these examples cannot yet handle rolling back tokens. This can happen when the chat template wants to modify pass tokens, for example, one can delete the reasoning content from the formatted chat.

Supported in #16603 should probably make jinja default there as well.

@ngxson ngxson merged commit e509411 into ggml-org:master Nov 27, 2025
75 of 76 checks passed
@rankaiyx
Copy link
Copy Markdown
Contributor

"disable jinja template for chat (default: enabled)"

Is there a mistake here?

@rankaiyx
Copy link
Copy Markdown
Contributor

Hmm, that's a bit ambiguous.

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026
* server: enable jinja by default, update docs

* fix tests
blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026
* server: enable jinja by default, update docs

* fix tests
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
* server: enable jinja by default, update docs

* fix tests
ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026
* server: enable jinja by default, update docs

* fix tests
my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026
* server: enable jinja by default, update docs

* fix tests
my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026
* server: enable jinja by default, update docs

* fix tests
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

examples python python script changes server

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants