-
nomic-embed-text-v1.5.Q8_0.gguf claims to reach 8192 context length, but AutoGGUFEmbeddings loaded with this model still have only 512 context length even model was setNCtx(8191) and model attribute nCtx has 8191 as value. any advice on how to config model to be able to increase context length would be appreciated, thanks.
|
Beta Was this translation helpful? Give feedback.
Answered by
DevinTDHa
Mar 8, 2025
Replies: 1 comment 3 replies
-
Hi @pwyang123 |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi @pwyang123,
I was able to reproduce it and I'm working on a fix. There are some issues with the error handling and it shouldn't fail silently. I'll update this discussion, when the fix is ready. Thanks for reporting!
In the meantime, can you try the following:
Explanation (for reference see this discussion):
llama.cpp allows for setting of the 1. logical and 2. phys…