[Proposal] "Stable" C API #171

Ronsor · 2023-03-15T18:01:09Z

I propose refactoring main.cpp into a library (llama.cpp, compiled to llama.so/llama.a/whatever) and making main.cpp a simple driver program. A simple C API should be exposed to access the model, and then bindings can more easily be written for Python, node.js, or whatever other language.

This would partially solve #82 and #162.

Edit: on that note, is it possible to do inference from two or more prompts on different threads? If so, serving multiple people would be possible without multiple copies of model weights in RAM.

The text was updated successfully, but these errors were encountered:

bakkot · 2023-03-15T19:19:28Z

For anyone wanting to do this, see an initial attempt in #77, and in particular this comment on ggerganov's preferred approach. Should be pretty straightforward I think.

v3ss0n · 2023-03-15T19:29:22Z

It is already ongoing , check PR #77

ggerganov · 2023-03-15T19:55:57Z

Yes, see the comment #77 (review) as @bakkot suggested. This is the way 🦙

v3ss0n · 2023-03-15T20:22:43Z

ah @bakkot beat me to it while i was writing. @Ronsor please close this , and the project have Discussion now https://github.com/ggerganov/llama.cpp/discussions

ggml-org locked and limited conversation to collaborators Mar 15, 2023

gjmulder converted this issue into discussion #177 Mar 15, 2023

gjmulder added enhancement New feature or request duplicate This issue or pull request already exists labels Mar 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

[Proposal] "Stable" C API #171

[Proposal] "Stable" C API #171

Ronsor commented Mar 15, 2023 •

edited

Loading

bakkot commented Mar 15, 2023

v3ss0n commented Mar 15, 2023

ggerganov commented Mar 15, 2023

v3ss0n commented Mar 15, 2023

This issue was moved to a discussion.

This issue was moved to a discussion.

[Proposal] "Stable" C API #171

[Proposal] "Stable" C API #171

Comments

Ronsor commented Mar 15, 2023 • edited Loading

bakkot commented Mar 15, 2023

v3ss0n commented Mar 15, 2023

ggerganov commented Mar 15, 2023

v3ss0n commented Mar 15, 2023

This issue was moved to a discussion.

Ronsor commented Mar 15, 2023 •

edited

Loading