Error to loading the latest Chinese LLaMa Alpaca model #1464

done434 · 2023-05-15T08:05:58Z

Errors when loading the latest Chinese LLaMA/Alpaca Plus-13B model:

./main -m ../ggml-alpaca13b-q5_1.bin -n 256 --repeat_penalty 1.0 --color -i -r "[Steve]:" -f chat-with-vicuna-v1.txt
main: build = 526 (e6a46b0)
main: seed = 1684135558
llama.cpp: loading model from ../ggml-alpaca13b-q5_1.bin
error loading model: unknown (magic, version) combination: 67676a74, 0000000; is this really a GGML file?
llama_init_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model '../ggml-alpaca13b-q5_1.bin'
main: error: unable to load model

Here are the steps to combine the Chinese alpaca model with original llama model:

Using merge_llama_with_chinese_lora.py from the Chinese-LLaMa-Alpaca project to combine the Chinese-LLaMA-Plus-13B, chinese-alpaca-plus-lora-13b together with the original llama model, the output is pth format.
Using this project's convert.py models/13B/ to convert the combined model to ggml format.
Using this project's quantize to quantize the model to 4-bit (using q5_1 method)

No errors in the above steps.

But when loading the model using main program, there are errors like above.
It seemed that (magic, version) combination: 67676a74, 0000000 are not supported when loading the model.

Any solution or suggestion about this? Thanks!

The text was updated successfully, but these errors were encountered:

mingxing0769 · 2023-05-15T10:29:49Z

Download the previous version ：
https://github.com/ggerganov/llama.cpp/releases

jamesljl · 2023-05-15T10:32:34Z

I'v encountered the same problem. The some steps above , no errors while merging and quantizing. just hangs when loading the model, before ">" appears. but the model seems loaded without error.

jamesljl · 2023-05-15T10:48:20Z

Download the previous version ： https://github.com/ggerganov/llama.cpp/releases

the previous version don't work , even can't load 8-bit quantized model

done434 · 2023-05-16T02:59:03Z

I found the problem of it. The original document suggest to convert the model using the command like this:
python convert.py zh-models/7B/

I read the convert.py carefully and found it has a parameter of vocab-dir:
"--vocab-dir", type=Path, help="directory containing tokenizer.model, if separate from model file"

The document ask to put the tokenizer.model in the upper level directory, I guess maybe it can't use this tokenizer.model file and in fact the tokenizer.model in the Chinese Alpaca model is different with the original LLaMa model.

So at last I add the --vocab-dir parameter to specify the directory of the Chinese Alpaca's tokenizer.model.
Then everything is ok now.

jamesljl · 2023-05-16T12:04:22Z

I just reinstalled ubuntu vm and clone the latest version, re-compiled it. then it works. that's weird

MrLiu199 · 2023-05-21T15:04:24Z

I pull the latest release version, rerun make and ./main, and it worked. The reason for me is that I convert and quantize with the newer version but trying to run with an older version.

github-actions · 2024-04-09T01:09:08Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions bot added the stale label Mar 25, 2024

github-actions bot closed this as completed Apr 9, 2024

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error to loading the latest Chinese LLaMa Alpaca model #1464

Error to loading the latest Chinese LLaMa Alpaca model #1464

done434 commented May 15, 2023

mingxing0769 commented May 15, 2023

jamesljl commented May 15, 2023 •

edited

Loading

jamesljl commented May 15, 2023

done434 commented May 16, 2023

jamesljl commented May 16, 2023 •

edited

Loading

MrLiu199 commented May 21, 2023 •

edited

Loading

github-actions bot commented Apr 9, 2024

Error to loading the latest Chinese LLaMa Alpaca model #1464

Error to loading the latest Chinese LLaMa Alpaca model #1464

Comments

done434 commented May 15, 2023

mingxing0769 commented May 15, 2023

jamesljl commented May 15, 2023 • edited Loading

jamesljl commented May 15, 2023

done434 commented May 16, 2023

jamesljl commented May 16, 2023 • edited Loading

MrLiu199 commented May 21, 2023 • edited Loading

github-actions bot commented Apr 9, 2024

jamesljl commented May 15, 2023 •

edited

Loading

jamesljl commented May 16, 2023 •

edited

Loading

MrLiu199 commented May 21, 2023 •

edited

Loading