Add script to convert old ggml files to newer version #539

thement · 2023-03-26T20:48:52Z

Followup to: #526

anzz1 · 2023-03-26T22:51:14Z

convert-unversioned-ggml-to-ggml.py

+    (magic, vocab_size, dim, multiple_of, n_heads, n_layers, rot, ftype) = header
+
+    if magic != 0x67676d6c:
+        raise Exception('Invalid file magic. Must be an old style ggml file.')


wait im confused, isn't it supposed to be old style ggml file for this?

It expects the unversioned GGML model and produces the versioned one.
#define LLAMA_FILE_MAGIC_UNVERSIONED 0x67676d6c // pre-versioned files

anzz1 · 2023-03-27T05:41:23Z

llama.cpp

@@ -320,7 +320,7 @@ static bool llama_model_load(
        uint32_t magic;
        fin.read((char *) &magic, sizeof(magic));
        if (magic == LLAMA_FILE_MAGIC_UNVERSIONED) {
-            fprintf(stderr, "%s: invalid model file '%s' (too old, regenerate your model files!)\n",
+            fprintf(stderr, "%s: invalid model file '%s' (too old, regenerate your model files or convert them with convert-unversioned-ggml-to-ggml.py!)\n",


this shouldn't be changed, as the conversion script is unsupported.

I think it's user friendly if it gives a hint to user how to resolve the situation with incompatible model.

anzz1 · 2023-03-27T05:43:18Z

I think it would be good to have the script print a disclaimer that the script is unsupported and results are not guaranteed, and to not post any issues regarding models generated with this.

anzz1 · 2023-03-27T14:36:46Z

Or another idea, post it in the discussions as an attachment? So it can be found there when needed, but at the same time it wouldn't promote using old model conversions?

Idk really, I'd like to hear other opinions too 😄

thement · 2023-03-27T19:15:26Z

I think it would be good to have the script print a disclaimer that the script is unsupported and results are not guaranteed, and to not post any issues regarding models generated with this.

The model isn't all that different. The only missing thing is score in vocabulary and I could theoretically fill that in from tokenizer.model.

ggerganov

Let's add this for now, but it will eventually be removed once everyone updates their models

Add script to convert old ggml files to newer version

53a187d

Green-Sky requested a review from eiz March 26, 2023 21:04

anzz1 reviewed Mar 26, 2023

View reviewed changes

anzz1 suggested changes Mar 27, 2023

View reviewed changes

anzz1 added enhancement New feature or request script Script related labels Mar 27, 2023

ggerganov approved these changes Mar 28, 2023

View reviewed changes

ggerganov merged commit d0aaff5 into ggml-org:master Mar 28, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to convert old ggml files to newer version #539

Add script to convert old ggml files to newer version #539

thement commented Mar 26, 2023

anzz1 Mar 26, 2023

thement Mar 27, 2023

anzz1 Mar 27, 2023

thement Mar 27, 2023

anzz1 commented Mar 27, 2023 •

edited

Loading

anzz1 commented Mar 27, 2023

thement commented Mar 27, 2023

ggerganov left a comment

Add script to convert old ggml files to newer version #539

Add script to convert old ggml files to newer version #539

Conversation

thement commented Mar 26, 2023

anzz1 Mar 26, 2023

Choose a reason for hiding this comment

thement Mar 27, 2023

Choose a reason for hiding this comment

anzz1 Mar 27, 2023

Choose a reason for hiding this comment

thement Mar 27, 2023

Choose a reason for hiding this comment

anzz1 commented Mar 27, 2023 • edited Loading

anzz1 commented Mar 27, 2023

thement commented Mar 27, 2023

ggerganov left a comment

Choose a reason for hiding this comment

anzz1 commented Mar 27, 2023 •

edited

Loading