woofwolfy
/

WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF-Imatrix

@@ -116,46 +116,82 @@ model-index:
       name: Open LLM Leaderboard
 ---
-# woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF
 This model was converted to GGUF format from [`icefog72/WestIceLemonTeaRP-32k-7b`](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b) for more details on the model.
-## Use with llama.cpp
-Install llama.cpp through brew (works on Mac and Linux)
-```bash
-brew install llama.cpp
-```
-Invoke the llama.cpp server or the CLI.
-### CLI:
-```bash
-llama-cli --hf-repo woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF --hf-file westicelemontearp-32k-7b-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
-```
-### Server:
-```bash
-llama-server --hf-repo woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF --hf-file westicelemontearp-32k-7b-q5_k_m-imat.gguf -c 2048
-```
-Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
-Step 1: Clone llama.cpp from GitHub.
-```
-git clone https://github.com/ggerganov/llama.cpp
-```
-Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
-```
-cd llama.cpp && LLAMA_CURL=1 make
-```
-Step 3: Run inference through the main binary.
-```
-./llama-cli --hf-repo woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF --hf-file westicelemontearp-32k-7b-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
-```
-or
-```
-./llama-server --hf-repo woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF --hf-file westicelemontearp-32k-7b-q5_k_m-imat.gguf -c 2048
 ```

       name: Open LLM Leaderboard
 ---
+# woofwolfy/WestIceLemonTeaRP-32k-7b-Q5_K_M-GGUF-Imatrix
 This model was converted to GGUF format from [`icefog72/WestIceLemonTeaRP-32k-7b`](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b) for more details on the model.
+# WestIceLemonTeaRP-32k-7b
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63407b719dbfe0d48b2d763b/RxJ8WbYsu_OAd8sICmddp.png)
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+Prompt template: Alpaca, maybe ChatML
+* measurement.json for quanting exl2 included.
+- [4.2bpw-exl2](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b-4.2bpw-exl2)
+- [6.5bpw-exl2](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b-6.5bpw-exl2)
+- [8bpw-exl2](https://huggingface.co/icefog72/WestIceLemonTeaRP-32k-7b-8bpw-exl2)
+thx mradermacher and SilverFan for
+* [mradermacher/WestIceLemonTeaRP-32k-GGUF](https://huggingface.co/mradermacher/WestIceLemonTeaRP-32k-GGUF)
+* [SilverFan/WestIceLemonTeaRP-7b-32k-GGUF](https://huggingface.co/SilverFan/WestIceLemonTeaRP-7b-32k-GGUF)
+### Merge Method
+This model was merged using the SLERP merge method.
+### Models Merged
+The following models were included in the merge:
+* [IceLemonTeaRP-32k-7b](https://huggingface.co/icefog72/IceLemonTeaRP-32k-7b)
+* WestWizardIceLemonTeaRP
+  * [SeverusWestLake-7B-DPO](https://huggingface.co/s3nh/SeverusWestLake-7B-DPO)
+  * WizardIceLemonTeaRP
+    * [Not-WizardLM-2-7B](https://huggingface.co/amazingvince/Not-WizardLM-2-7B)
+    * [IceLemonTeaRP-32k-7b](https://huggingface.co/icefog72/IceLemonTeaRP-32k-7b)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+slices:
+  - sources:
+      - model: IceLemonTeaRP-32k-7b
+        layer_range: [0, 32]
+      - model: WestWizardIceLemonTeaRP
+        layer_range: [0, 32]
+merge_method: slerp
+base_model: IceLemonTeaRP-32k-7b
+parameters:
+  t:
+    - filter: self_attn
+      value: [0, 0.5, 0.3, 0.7, 1]
+    - filter: mlp
+      value: [1, 0.5, 0.7, 0.3, 0]
+    - value: 0.5
+dtype: float16
 ```
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/63407b719dbfe0d48b2d763b/GX-kV-H8_zAJz5hHL8A7G.png)
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_icefog72__WestIceLemonTeaRP-32k-7b)
+|             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |71.27|
+|AI2 Reasoning Challenge (25-Shot)|68.77|
+|HellaSwag (10-Shot)              |86.89|
+|MMLU (5-Shot)                    |64.28|
+|TruthfulQA (0-shot)              |62.47|
+|Winogrande (5-shot)              |80.98|
+|GSM8k (5-shot)                   |64.22|