Skip to content

Where is tokenizer.model? #870

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
Thresher12 opened this issue Apr 10, 2023 · 3 comments
Closed

Where is tokenizer.model? #870

Thresher12 opened this issue Apr 10, 2023 · 3 comments

Comments

@Thresher12
Copy link

Thresher12 commented Apr 10, 2023

Hello, sorry if this is a simple question but I am trying to convert the GPT4All model with the code giving in the description.

python3 convert-gpt4all-to-ggml.py models/gpt4all-7B/gpt4all-lora-quantized.bin ./models/tokenizer.model

but there is no such tokenizer.model file in the repo, no hint on where to get it and even googling comes up with nothing. Where are you supposed to get this file? thanks

@Thresher12
Copy link
Author

I randomly found a suitable tokenizer.model among the gpt4-x-alpaca files

@arkilis
Copy link

arkilis commented Apr 10, 2023

@Thresher12 Facing the same issue, is it possible to share that file pls? Thank you

@Thresher12
Copy link
Author

@Thresher12 Facing the same issue, is it possible to share that file pls? Thank you

Its here. It would be nice to have links or some hint as to where to find to all the required dependencies that aren't already included.

https://huggingface.co/chavinlo/gpt4-x-alpaca/tree/main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants