nomic-ai
/

nomic-embed-text-v1-GGUF

Jared Van Bortel commited on Feb 14, 2024

Commit

a4ff1a1

1 Parent(s): d7e1f72

README: add note about llama.cpp version

Files changed (1) hide show

README.md CHANGED Viewed

@@ -25,6 +25,8 @@ This repo contains llama.cpp-compatible files for [nomic-embed-text-v1](https://
 llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.
 ## Example `llama.cpp` Command
 Compute a single embedding:

 llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.
+These files were converted and quantized with llama.cpp commit [6c00a0669](https://github.com/ggerganov/llama.cpp/commit/6c00a066928b0475b865a2e3e709e2166e02d548).
 ## Example `llama.cpp` Command
 Compute a single embedding: