cookinai commited on
Commit
012f744
·
1 Parent(s): 514b83a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md CHANGED
@@ -1,3 +1,48 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ Slerp merge of mindy-labs/mindy-7b-v2 with jondurbin/bagel-dpo-7b-v0.1. This model was then slerp merged with rishiraj/CatPPT.
5
+
6
+ Heard some talk of jondurbin/bagel-dpo-7b-v0.1 in the community and it sounds intresting. Merged it with two high preforming models to get cookinai/Valkyrie-V1
7
+
8
+ Slerp 1:
9
+
10
+ ```.yaml:
11
+ slices:
12
+ - sources:
13
+ - model: jondurbin/bagel-dpo-7b-v0.1
14
+ layer_range: [0, 32]
15
+ - model: mindy-labs/mindy-7b-v2
16
+ layer_range: [0, 32]
17
+ merge_method: slerp
18
+ base_model: mindy-labs/mindy-7b-v2
19
+ parameters:
20
+ t:
21
+ - filter: self_attn
22
+ value: [0, 0.5, 0.3, 0.7, 1]
23
+ - filter: mlp
24
+ value: [1, 0.5, 0.7, 0.3, 0]
25
+ - value: 0.5 # fallback for rest of tensors
26
+ dtype: bfloat16
27
+ ```
28
+
29
+ Slerp 2:
30
+
31
+ ```.yaml:
32
+ slices:
33
+ - sources:
34
+ - model: previous/model/path
35
+ layer_range: [0, 32]
36
+ - model: rishiraj/CatPPT
37
+ layer_range: [0, 32]
38
+ merge_method: slerp
39
+ base_model: previous/model/path
40
+ parameters:
41
+ t:
42
+ - filter: self_attn
43
+ value: [0, 0.5, 0.3, 0.7, 1]
44
+ - filter: mlp
45
+ value: [1, 0.5, 0.7, 0.3, 0]
46
+ - value: 0.5 # fallback for rest of tensors
47
+ dtype: bfloat16
48
+ ```