models : fix LFM2 tensors #17548

ggerganov · 2025-11-27T12:55:48Z

alt #17248

Force token embeddings to be at the start of the graph
Fix LFM2 output norm tensor
Fix LLM_TENSOR_TOKEN_EMBD_NORM tensor info

CISC · 2025-11-27T13:20:22Z

Oh, ooops, cc/ @tdakhran

tdakhran

I overlooked it, thank you for the fix @ggerganov!

CISC · 2025-11-27T13:29:46Z

src/llama-arch.cpp

    {LLM_TENSOR_POS_EMBD,                   {LLM_TENSOR_LAYER_INPUT, GGML_OP_GET_ROWS}},
-    {LLM_TENSOR_TOKEN_EMBD_NORM,            {LLM_TENSOR_LAYER_INPUT, GGML_OP_GET_ROWS}},
    {LLM_TENSOR_TOKEN_TYPES,                {LLM_TENSOR_LAYER_INPUT, GGML_OP_GET_ROWS}},
+    {LLM_TENSOR_TOKEN_EMBD_NORM,            {LLM_TENSOR_LAYER_INPUT, GGML_OP_MUL}},


I've never fully grasped where this is used and how, I guess this won't have any side-effects for other models?

A quick search of tok_norm_b indicate that the token embd norm is only used by older archs like bloom or bert. I guess because at the time they have different (less efficient) training technique, thus the norm is there to make training more stable.

It should not make much of a different though, because MUL should be available on most backend now (so it's likely to be supported by what every backend holding input layer)

I added a comment to clarify how this information is used: #17550

Thanks, that's quite helpful.

models : fix LFM2 tensors

d93ff58

ggerganov requested a review from CISC as a code owner November 27, 2025 12:55

ggerganov mentioned this pull request Nov 27, 2025

Offload input layer for select models. #17248

Open

tdakhran approved these changes Nov 27, 2025

View reviewed changes

CISC approved these changes Nov 27, 2025

View reviewed changes

loci-dev mentioned this pull request Nov 27, 2025

UPSTREAM PR #17548: models : fix LFM2 tensors auroralabs-loci/llama.cpp#347

Open

github-actions bot added the model Model specific label Nov 27, 2025

ggerganov merged commit 6783b11 into master Nov 27, 2025
64 of 66 checks passed

ggerganov mentioned this pull request Nov 27, 2025

arch : add description about LLM_TENSOR_INFOS #17550

Merged

CISC mentioned this pull request Nov 29, 2025

Override SSM_A op for Qwen3 Next to reduce splits #17587

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

models : fix LFM2 tensors #17548

models : fix LFM2 tensors #17548

ggerganov commented Nov 27, 2025 •

edited

Loading

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

tdakhran left a comment

Uh oh!

CISC Nov 27, 2025

Uh oh!

ngxson Nov 27, 2025

Uh oh!

ggerganov Nov 27, 2025

Uh oh!

CISC Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

models : fix LFM2 tensors #17548

models : fix LFM2 tensors #17548

Conversation

ggerganov commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

tdakhran left a comment

Choose a reason for hiding this comment

Uh oh!

CISC Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

ggerganov Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ggerganov commented Nov 27, 2025 •

edited

Loading