Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

convert to guff and ollama #10

Open
Aniwine opened this issue Sep 22, 2024 · 6 comments
Open

convert to guff and ollama #10

Aniwine opened this issue Sep 22, 2024 · 6 comments

Comments

@Aniwine
Copy link

Aniwine commented Sep 22, 2024

I want to deploy it via ollama, so I firstly convert it to .guff file by llama.cpp's convert_hf_to_guff.py,but I got an error that KeyError "<|user|>",so I found it not in added_tokens_decoder of tokenizer_config.json while it exists in convert_hf_to_guff.py.So I comment this line out, code can finish normally.And ollama create model also successfully completed.But when ollama run model,I got tensor number errors,
image

I wonder what happens after removing the <|user|> keyword and how should I fix this?

@ExtReMLapin
Copy link

Same issue.

@ExtReMLapin
Copy link

Working on it, managed to get a gguf working

@ExtReMLapin
Copy link

@Aniwine Aniwine closed this as completed Sep 30, 2024
@Aniwine
Copy link
Author

Aniwine commented Sep 30, 2024

https://huggingface.co/LPN64/LongCite-llama3.1-8b-GGUF

thank you, i want to know how did you solve this problem please?

@Aniwine Aniwine reopened this Sep 30, 2024
@ExtReMLapin
Copy link

I used the gguf llamacpp conversion tool but it was missing files so I literally copy pasted files from llama 3.1 (non instruct model) and modified the end of sentence token in generation config file

@Aniwine
Copy link
Author

Aniwine commented Sep 30, 2024

I used the gguf llamacpp conversion tool but it was missing files so I literally copy pasted files from llama 3.1 (non instruct model) and modified the end of sentence token in generation config file

thank you~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants