What makes this version so lighter and faster ? #1228

hijcard · 2023-04-29T11:37:29Z

hijcard
Apr 29, 2023

Hello,
first of all, thank you very much for this authentic piece of GOLD!
I just wanted to understand why the official version from facebook, as well as alpaca and vicuna, require a lot RAM to run on the CPU or a lot of memory to run on the GPU, while this version can successfully be used on my CPU with only 16GB RAM decently "fast".
Thanks

omarkazmi · 2023-04-29T15:58:47Z

omarkazmi
Apr 29, 2023

1 reply

hijcard Apr 30, 2023
Author

Yea dude, thanks for stating the obvious. I was looking for a technical explanation though

jarcen · 2023-04-30T10:17:36Z

jarcen
Apr 30, 2023

This repo supports 4 bit quantization. Models released by researchers are usually in full or half precision. Those who have supercomputers are not pressured to squeeze models into Raspberry Pi and 4 bit quantization doesn't work well with training anyways.

Different priorities, simply speaking. Researchers are less interested in delivering working product.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What makes this version so lighter and faster ? #1228

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

What makes this version so lighter and faster ? #1228

Uh oh!

hijcard Apr 29, 2023

Replies: 2 comments · 1 reply

Uh oh!

omarkazmi Apr 29, 2023

Uh oh!

hijcard Apr 30, 2023 Author

Uh oh!

jarcen Apr 30, 2023

hijcard
Apr 29, 2023

Replies: 2 comments 1 reply

omarkazmi
Apr 29, 2023

hijcard Apr 30, 2023
Author

jarcen
Apr 30, 2023