IHaveNoClueAndIMustPost
/

Llama-3-11.5B-Instruct-v2_GGUF

Inference Endpoints

Model card Files Files and versions Community

IHaveNoClueAndIMustPost commited on May 18, 2024

Commit

9c8a059

·

verified ·

1 Parent(s): 91a5fcc

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -5,7 +5,15 @@ license_link: https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/blob/ma
 ---
 GGUF of [Replete-AI Llama 3 11.5B Instruct V2](https://huggingface.co/Replete-AI/Llama-3-11.5B-Instruct-v2)
-Quantized with llama.cpp commit <s>[b2710](https://github.com/ggerganov/llama.cpp/releases/tag/b2710)</s> [b2780](https://github.com/ggerganov/llama.cpp/releases/tag/b2780)
 Original model description below<hr>

 ---
 GGUF of [Replete-AI Llama 3 11.5B Instruct V2](https://huggingface.co/Replete-AI/Llama-3-11.5B-Instruct-v2)
+Quantized with llama.cpp commit <s>[b2710](https://github.com/ggerganov/llama.cpp/releases/tag/b2710)</s> <s>[b2780](https://github.com/ggerganov/llama.cpp/releases/tag/b2780)</s> [b2876](https://github.com/ggerganov/llama.cpp/releases/tag/b2876), verified no warnings in llama.cpp
+Simple PPL comparison<br>
+<code>
+<i>perplexity.exe -[MODEL] -f wiki.test.raw -b 512 -ngl 99</i>
+SFR-Iterative-DPO-LLaMA-3-11.5B-R-Q6_K.gguf = Final estimate: <b>PPL = 8.4438 +/- 0.06271</b><br>
+Meta-Llama-3-8B-Instruct-Q6_K = Final estimate: <b>PPL = 8.4727 +/- 0.06308</b>
+</code>
 Original model description below<hr>