michaelfeil
commited on
Commit
•
6f4ed08
1
Parent(s):
3c5e6ea
Update README.md
Browse files
README.md
CHANGED
@@ -54,8 +54,8 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
|
|
54 |
- [GGUF by Crusoe](https://huggingface.co/crusoeai/Llama-3-8B-Instruct-1048k-GGUF). Note that you need to add 128009 as [special token with llama.cpp](https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k/discussions/13).
|
55 |
- [MLX-4bit](https://huggingface.co/mlx-community/Llama-3-8B-Instruct-1048k-4bit)
|
56 |
- [Ollama](https://ollama.com/library/llama3-gradient)
|
57 |
-
- vLLM docker image, recommended to load via `--max-model-len
|
58 |
-
|
59 |
|
60 |
## The Gradient AI Team
|
61 |
|
|
|
54 |
- [GGUF by Crusoe](https://huggingface.co/crusoeai/Llama-3-8B-Instruct-1048k-GGUF). Note that you need to add 128009 as [special token with llama.cpp](https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k/discussions/13).
|
55 |
- [MLX-4bit](https://huggingface.co/mlx-community/Llama-3-8B-Instruct-1048k-4bit)
|
56 |
- [Ollama](https://ollama.com/library/llama3-gradient)
|
57 |
+
- vLLM docker image, recommended to load via `--max-model-len 32768`
|
58 |
+
- If you are interested in a hosted version, drop us a mail below.
|
59 |
|
60 |
## The Gradient AI Team
|
61 |
|