Doctor-Shotgun commited on
Commit
de252bb
1 Parent(s): 5a55fcb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -16,14 +16,17 @@ This is a Llama 2-based model consisting of a gradient merge between:
16
  - [Chronos 13b v2](https://huggingface.co/elinas/chronos-13b-v2)
17
  - [Nous Hermes Llama2 13b](https://huggingface.co/NousResearch/Nous-Hermes-Llama2-13b)
18
 
 
 
 
 
 
19
  The merge was performed using [BlockMerge_Gradient](https://github.com/Gryphe/BlockMerge_Gradient) by Gryphe
20
 
21
  The intended objective was to combine NH2's superior instruction following capabilities with the creativity and response length of Chronos v2. Merge ratios used are identical to those used in [Chronoboros Grad](https://huggingface.co/kingbri/chronoboros-grad-l2-13B), with NH2 starting with a weight of 0.9 at the 1st layer and phasing out by the 25th layer. The method is illustrated in the image below, with green representing NH2 and blue representing Chronos v2:
22
 
23
  ![hermeboros-illustration](https://files.catbox.moe/18sjej.png)
24
 
25
- added_tokens.json was padded with dummy tokens to reach 32 added tokens in order to allow GGML conversion in llama.cpp without error due to vocab size mismatch.
26
-
27
  ## Usage:
28
 
29
  Intended to be prompted with the Alpaca instruction format of the base models:
 
16
  - [Chronos 13b v2](https://huggingface.co/elinas/chronos-13b-v2)
17
  - [Nous Hermes Llama2 13b](https://huggingface.co/NousResearch/Nous-Hermes-Llama2-13b)
18
 
19
+ Quantized Models Provided by TheBloke (Thanks!):
20
+ - [GGML](https://huggingface.co/TheBloke/Chronohermes-Grad-L2-13B-GGML)
21
+ - [GPTQ](https://huggingface.co/TheBloke/Chronohermes-Grad-L2-13B-GPTQ)
22
+
23
+
24
  The merge was performed using [BlockMerge_Gradient](https://github.com/Gryphe/BlockMerge_Gradient) by Gryphe
25
 
26
  The intended objective was to combine NH2's superior instruction following capabilities with the creativity and response length of Chronos v2. Merge ratios used are identical to those used in [Chronoboros Grad](https://huggingface.co/kingbri/chronoboros-grad-l2-13B), with NH2 starting with a weight of 0.9 at the 1st layer and phasing out by the 25th layer. The method is illustrated in the image below, with green representing NH2 and blue representing Chronos v2:
27
 
28
  ![hermeboros-illustration](https://files.catbox.moe/18sjej.png)
29
 
 
 
30
  ## Usage:
31
 
32
  Intended to be prompted with the Alpaca instruction format of the base models: