iPad App #844
Replies: 7 comments 6 replies
-
Please do share! |
Beta Was this translation helpful? Give feedback.
-
I think it's going to get a lot of attention when it comes out |
Beta Was this translation helpful? Give feedback.
-
Thanks for the interest. I'll keep working on it. The largest remaining item I need to resolve within llama itself is this: https://github.com/ggerganov/llama.cpp/blob/master/llama.cpp#L60 The allocations for scratch in a 7B param model are pushing just over the limits of available memory, so the only way this runs locally is if I hack these allocations to be dynamic. I'll see if I can clean up that patch and submit it soon. |
Beta Was this translation helpful? Give feedback.
-
There seems to be a llama.cpp with mmap out there now, which might already
do what you need. Look through the issues on github
…On Sunday, April 9, 2023, Oliver Clark Rickard ***@***.***> wrote:
Thanks for the interest. I'll keep working on it. The largest remaining
item I need to resolve within llama itself is this:
https://github.com/ggerganov/llama.cpp/blob/master/llama.cpp#L60
The allocations for scratch in a 7B param model are pushing just over the
limits of available memory, so the only way this runs locally is if I hack
these allocations to be dynamic. I'll see if I can clean up that patch and
submit it soon.
—
Reply to this email directly, view it on GitHub
<#844 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABWGTOI3LQM7BCWN5VXCBW3XAH5PDANCNFSM6AAAAAAWXFY4Y4>
.
You are receiving this because you commented.Message ID: <ggerganov/llama.
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Any success getting this to work @ocrickard and others? Interested in doing the same thing. |
Beta Was this translation helpful? Give feedback.
-
Hello. I'm doing something similar. I am writing an application in SwiftUI so that you can select different models and inference. LLaMa and GPTNeox are currently supported. OpenLLaMa 3B works well even on iPhone. Unfortunately, 7B and largest models can only be launched on MacOS and the iOS Simulator due to a |
Beta Was this translation helpful? Give feedback.
-
Hi, I want to write an app for iphone to run llama.cpp. Can I add your discord account to ask you how to integrate the model with swift UI? My discord username: shiyu_liu. Thank you very much!!! |
Beta Was this translation helpful? Give feedback.
-
I've been playing with using llama to help me tell stories to my daughter at night. I wrote a simple native iPad app that uses llama.cpp, and provides some nice model / thread management capabilities on top of it. It runs quite well on my M2 iPad after a few tweaks to the memory allocations in llama.
I'm curious if there's enough interest in something like this for me to continue polishing it and share it somewhere.
trimmed.mp4
Beta Was this translation helpful? Give feedback.
All reactions