InferenceIllusionist
/

L3-70B-Euryale-v2.1-iMat-GGUF

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on 21 days ago

Commit

2cfd468

•

1 Parent(s): 1b808ad

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -17,6 +17,18 @@ license: cc-by-nc-4.0
 Quantized from fp16.
 * Weighted quantizations were creating using fp16 GGUF and groups_merged.txt in 88 chunks and n_ctx=512
 For a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)
 <i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>

 Quantized from fp16.
 * Weighted quantizations were creating using fp16 GGUF and groups_merged.txt in 88 chunks and n_ctx=512
+## Recommended Sampler Settings (From Original Model Card)
+```
+Temperature - 1.17
+min_p - 0.075
+Repetition Penalty - 1.10
+```
+**SillyTavern Instruct Settings**:
+<br>Context Template: Llama-3-Instruct-Names
+<br>Instruct Presets: [Euryale-v2.1-Llama-3-Instruct](https://huggingface.co/Sao10K/L3-70B-Euryale-v2.1/blob/main/Euryale-v2.1-Llama-3-Instruct.json)
 For a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)
 <i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>