llama.cpp acts too dumb while running on phone!! #802

Shreyas-ITB · 2023-04-06T06:43:33Z

I was trying llama.cpp on phone with termux installed. but look at this image

Specifications
The phone has 8 gigs of RAM and 7 gigs is free and the CPU has 8 cores so its not the issue of the RAM and CPU.
Model used: alpaca-7B-lora
llama.cpp version: latest
prompt: chat-with-bob.txt

I really don't know what is causing the issue here. The problem happening is, when i ask a question to it, it just either answers the question in a very dumb way or it just repeats the same question not answering anything. With the same model, prompt and llama.cpp version on my PC with 4GB ram works as expected it answers every question with almost 98% accuracy. Can any of you guys help me out with this? or update the llama.cpp and fix the mobile issues please?

Thankyou

BarfingLemurs · 2023-04-06T10:43:14Z

try playing with settings, like increasing temperature

Shreyas-ITB · 2023-04-06T11:12:32Z

Okay ill try it and see then ill let you know

Shreyas-ITB · 2023-04-06T16:32:29Z

@BarfingLemurs Nope it doesnt work, it stays the same (acts way way dumber and still repeats the question)
@gjmulder Please add a bug or some issue label here to this please. It needs some more development.

gjmulder · 2023-04-06T19:16:29Z

You may want to log an issue with the Stanford Alpaca. It is the training set fo the Alpaca model you are using.

Shreyas-ITB · 2023-04-07T04:59:12Z

@gjmulder im using alpaca lora maybe thats the issue? I mean its not the problem of the alpaca model tho it worked fine on my laptop and for many users on their PCs on mobile it malfunctions

FNsi · 2023-04-07T05:59:51Z

Maybe because of termux. I don't even know how to use the Firefox in termux to watching YouTube 720p without crash😅😂
I guess it's the ram restrictions from your phone to the termux app.

Shreyas-ITB · 2023-04-07T06:06:05Z

@FNsi lol its not due to termux. The same issue happens with userland (an app thats intended to run ubuntu on android without root)

FNsi · 2023-04-07T06:08:29Z

@FNsi lol its not due to termux. The same issue happens with userland (an app thats intended to run ubuntu on android without root)

I think it's almost the same since there all the emulated terminal? I saw an android fork in Google market,maybe you can try it?

#750

gjmulder · 2023-04-07T06:44:24Z

The same model should work the similarly with llama.cpp on any platform, if the same temp, top_k, etc. parameters are being passed to llama.cpp. The random number generator is different so you will never get the exact same output, but the outputs should be simular in quality. The only other other difference I could imagine would be performance.

unbounded · 2023-04-08T00:14:14Z

There are various optimized code paths that are only enabled for certain platform and feature sets, there could be differences in the implementation of those.

Could you post the initial output with the system_info and model parameters?

Shreyas-ITB · 2023-04-08T04:53:53Z

@unbounded the output i get is in the start of the issue (there is a screenshot of what the model is saying)
Model parameters are the same as the chat.sh file in the repository's example directory.

System Info
Arm cortex A53 octa core processor
8 GB RAM
Android 12 and there is no AVX or AVX2 Flags in the CPU as its an ARM processor.

unbounded · 2023-04-10T22:06:32Z

Could be related to #876 which was fixed in 684da25

unbounded · 2023-04-15T17:23:11Z

Closing as assumed fixed by 684da25 , feel free to reopen if this still happens with the latest version.

gjmulder added the generation quality Quality of model output label Apr 6, 2023

Shreyas-ITB changed the title ~~llama.cpp acts why dumber on phone!!~~ llama.cpp acts too dumb while running on phone!! Apr 7, 2023

unbounded closed this as completed Apr 15, 2023

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp acts too dumb while running on phone!! #802

llama.cpp acts too dumb while running on phone!! #802

Shreyas-ITB commented Apr 6, 2023

BarfingLemurs commented Apr 6, 2023

Shreyas-ITB commented Apr 6, 2023

Shreyas-ITB commented Apr 6, 2023

gjmulder commented Apr 6, 2023

Shreyas-ITB commented Apr 7, 2023

FNsi commented Apr 7, 2023 •

edited

Loading

Shreyas-ITB commented Apr 7, 2023 •

edited

Loading

FNsi commented Apr 7, 2023 •

edited

Loading

gjmulder commented Apr 7, 2023

unbounded commented Apr 8, 2023

Shreyas-ITB commented Apr 8, 2023

unbounded commented Apr 10, 2023

unbounded commented Apr 15, 2023

llama.cpp acts too dumb while running on phone!! #802

llama.cpp acts too dumb while running on phone!! #802

Comments

Shreyas-ITB commented Apr 6, 2023

BarfingLemurs commented Apr 6, 2023

Shreyas-ITB commented Apr 6, 2023

Shreyas-ITB commented Apr 6, 2023

gjmulder commented Apr 6, 2023

Shreyas-ITB commented Apr 7, 2023

FNsi commented Apr 7, 2023 • edited Loading

Shreyas-ITB commented Apr 7, 2023 • edited Loading

FNsi commented Apr 7, 2023 • edited Loading

gjmulder commented Apr 7, 2023

unbounded commented Apr 8, 2023

Shreyas-ITB commented Apr 8, 2023

unbounded commented Apr 10, 2023

unbounded commented Apr 15, 2023

FNsi commented Apr 7, 2023 •

edited

Loading

Shreyas-ITB commented Apr 7, 2023 •

edited

Loading

FNsi commented Apr 7, 2023 •

edited

Loading