IHaveNoClueAndIMustPost
/

SFR-Iterative-DPO-LLaMA-3-11.5B-R-GGUF

Inference Endpoints

Model card Files Files and versions Community

IHaveNoClueAndIMustPost commited on May 18

Commit

b8d6a13

•

1 Parent(s): 3d513f8

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -8,7 +8,15 @@ tags:
 ---
 A zero training self-merge test of [Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R](https://huggingface.co/Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R) using settings mentioned on [mistral-11b-slimorca](https://huggingface.co/chargoddard/mistral-11b-slimorca)<br>
 <b><u>Tools used / version</u></b><br>
 <b>Mergekit:</b> <i>c93c9bb</i><br>

 ---
 A zero training self-merge test of [Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R](https://huggingface.co/Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R) using settings mentioned on [mistral-11b-slimorca](https://huggingface.co/chargoddard/mistral-11b-slimorca)<br>
+It's....not dumber I guess 🤷‍♀️
+Simple PPL comparison<br>
+<code>
+<i>perplexity.exe -[MODEL] -f wiki.test.raw -b 512 -ngl 99</i><br>
+<i>SFR-Iterative-DPO-LLaMA-3-8B-R-F16.gguf</i> - Final estimate: <b>PPL = 7.0279 +/- 0.04493</b><br>
+<i>SFR-Iterative-DPO-LLaMA-3-11.5B-R-Q6_K.gguf</i> - Final estimate: <b>PPL = 7.0500 +/- 0.04516</b><br>
+<i>Meta-Llama-3-8B-Instruct-Q6_K</i> - Final estimate: <b>PPL = 8.4727 +/- 0.06308</b>
+</code>
 <b><u>Tools used / version</u></b><br>
 <b>Mergekit:</b> <i>c93c9bb</i><br>