michaelfeil commited on
Commit
6f4ed08
1 Parent(s): 3c5e6ea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -54,8 +54,8 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
54
  - [GGUF by Crusoe](https://huggingface.co/crusoeai/Llama-3-8B-Instruct-1048k-GGUF). Note that you need to add 128009 as [special token with llama.cpp](https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k/discussions/13).
55
  - [MLX-4bit](https://huggingface.co/mlx-community/Llama-3-8B-Instruct-1048k-4bit)
56
  - [Ollama](https://ollama.com/library/llama3-gradient)
57
- - vLLM docker image, recommended to load via `--max-model-len 65536`
58
-
59
 
60
  ## The Gradient AI Team
61
 
 
54
  - [GGUF by Crusoe](https://huggingface.co/crusoeai/Llama-3-8B-Instruct-1048k-GGUF). Note that you need to add 128009 as [special token with llama.cpp](https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k/discussions/13).
55
  - [MLX-4bit](https://huggingface.co/mlx-community/Llama-3-8B-Instruct-1048k-4bit)
56
  - [Ollama](https://ollama.com/library/llama3-gradient)
57
+ - vLLM docker image, recommended to load via `--max-model-len 32768`
58
+ - If you are interested in a hosted version, drop us a mail below.
59
 
60
  ## The Gradient AI Team
61