froggeric
/

WestLake-10.7B-v2

@@ -11,20 +11,13 @@ language:
 ---
 # WestLake-10.7B-v2: Role-Play & Text Generation Specialist Model
-* Base: WestLake-7B-v2 based of Mistral-7B-v0.1
-* Context size: **8192** (even though Mistral-7B is 32k, WestLake was trained with 8k, and using a larger context is likely to cause problems)
-* Prompt format: in general, Mistral based models are able to understand many prompt formats, but the following ones produce the best results, and are recommended
-  - **ChatML** (used during WestLake training)
-  - **Zephyr** (variant of ChatML which sometimes produces better results)
-  - **Alpaca** (reported by senseable as working better than ChatML)
-  - **Mistral Instruct** (original format from Mistral-7B)
-This is my first viable self-merge of the fantastic WestLake-7B-v2 model, obtained after 12 rounds of testing with different
-merge settings. In my [LLM Creativity Benchmark](https://huggingface.co/datasets/froggeric/creativity), it greatly improves over the original 7B model, and ranks between miqu-1-120b
-and goliath-120b! I would describe the improvements as a better writing style, with more details. It does have
-a small negative point, which is it has a bit more difficulties following instruction, but not by much.
-It is also the first model I have tested to obtain a perfect score with the following test!
 ```
 Write a sequence of nominal groups that flow into one another, using the following rules:
 - each nominal group is made of exactly 3 words
@@ -38,21 +31,23 @@ Present your solution as a list numbered with roman numerals.
 Finally, explain why you chose your specific theme.
 ```
-## Merge Details
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-### Merge Method
-This model was merged using the passthrough merge method.
-### Models Merged
 The following models were included in the merge:
 * [senseable/WestLake-7B-v2](https://huggingface.co/senseable/WestLake-7B-v2)
-### Configuration
 The following YAML configuration was used to produce this model:
 ```yaml
@@ -78,13 +73,11 @@ slices:
 ---
-# Original model card
 **Update Notes:**
 *Version 2 trained 1 additional epoch cycle for 3 total*
-# Westlake-7Bv2: Role-Play & Text Generation Specialist Model
 Welcome to the documentation of Westlake-7B, a cutting-edge language model designed for exceptional role-play and text generation tasks. This README file aims to provide an overview of our capabilities, usage guidelines, and potential applications.
 ## About Westlake-7Bv2

 ---
 # WestLake-10.7B-v2: Role-Play & Text Generation Specialist Model
+[GGUF version available here](https://huggingface.co/froggeric/WestLake-10.7B-v2-GGUF)
+This is my first viable self-merge of the fantastic WestLake-7B-v2 model, obtained after more than 12 rounds of testing different
+merge configurations. In my [LLM Creativity Benchmark](https://huggingface.co/datasets/froggeric/creativity), it greatly improves over the original 7B model, and ranks between miqu-1-120b
+and goliath-120b! I would describe the improvements as a better writing style, with more details. It has a bit more difficulties following instructions, but not by much.
+It is also the first model I have tested to obtain a perfect score with the following test:
 ```
 Write a sequence of nominal groups that flow into one another, using the following rules:
 - each nominal group is made of exactly 3 words
 Finally, explain why you chose your specific theme.
 ```
+## Usage
+* Base model: senseable/WestLake-7B-v2 based of Mistral-7B-v0.1
+* Context size: **8192** (even though Mistral-7B is 32k, WestLake was trained with 8k, and using a larger context is likely to cause problems)
+* Prompt format: in general, Mistral based models are able to understand many prompt formats, but the following produce the best results, and are recommended
+  - **ChatML** (used during WestLake training)
+  - **Zephyr** (variant of ChatML which I have found to sometimes produce better results)
+  - **Alpaca** (reported by senseable as working better than ChatML)
+  - **Mistral Instruct** (original format from Mistral-7B)
+## Merge Details
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).\
+This model was merged using the passthrough merge method.\
 The following models were included in the merge:
 * [senseable/WestLake-7B-v2](https://huggingface.co/senseable/WestLake-7B-v2)
 The following YAML configuration was used to produce this model:
 ```yaml
 ---
+# Original model card: Westlake-7Bv2: Role-Play & Text Generation Specialist Model
 **Update Notes:**
 *Version 2 trained 1 additional epoch cycle for 3 total*
 Welcome to the documentation of Westlake-7B, a cutting-edge language model designed for exceptional role-play and text generation tasks. This README file aims to provide an overview of our capabilities, usage guidelines, and potential applications.
 ## About Westlake-7Bv2