gradientai
/

Llama-3-8B-Instruct-262k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

michaelfeil commited on Apr 25

Commit

5b4d67e

•

1 Parent(s): e637c65

ADD GGUF (#8)

- ADD GGUF (1f7460f994d73ff45efcba6404d7a3336f31f645)

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -24,6 +24,10 @@ This model extends LLama-3 8B's context length from 8k to > 160K, developed by G
 We build on top of the EasyContext Blockwise RingAttention library [3] to scalably and efficiently train on contexts up to 262144 tokens on [Crusoe Energy](https://huggingface.co/crusoeai) high performance L40S cluster.
 **Data:**
 For training data, we generate long contexts by augmenting [SlimPajama](https://huggingface.co/datasets/cerebras/SlimPajama-627B).

 We build on top of the EasyContext Blockwise RingAttention library [3] to scalably and efficiently train on contexts up to 262144 tokens on [Crusoe Energy](https://huggingface.co/crusoeai) high performance L40S cluster.
+**Quantized versions and GGUF**
+GGUF is available on on Crusoe's huggingface account. Check it out here: [crusoeai/Llama-3-8B-Instruct-262k-GGUF](https://huggingface.co/crusoeai/Llama-3-8B-Instruct-262k-GGUF)
 **Data:**
 For training data, we generate long contexts by augmenting [SlimPajama](https://huggingface.co/datasets/cerebras/SlimPajama-627B).