turboderp
/

Llama-3.1-70B-Instruct-exl2

Model card Files Files and versions Community

turboderp commited on Jul 24, 2024

Commit

08704f1

•

1 Parent(s): acfb5c2

Create README.md

Files changed (1) hide show

README.md +16 -0

README.md ADDED Viewed

	@@ -0,0 +1,16 @@

+---
+license: llama3.1
+---
+EXL2 quants of [Llama-3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct/tree/main)
+_**This model requires the dev branch of ExLlamaV2 for now. New release coming soon with the necessary changes.**_
+[2.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/2.5bpw)
+[3.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/3.0bpw)
+[3.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/3.5bpw)
+[4.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/4.0bpw)
+[4.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/4.5bpw)
+[5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/5.0bpw)
+[6.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/6.0bpw)
+[measurement.json](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/blob/main/measurement.json)