Nabushika commited on
Commit
b16a789
1 Parent(s): c51f542

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,89 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ ---
4
+ # Join our Discord! https://discord.gg/Nbv9pQ88Xb
5
+ ## Nearly 2500 members strong 💪
6
+ ### Now with more channels! A hub for creatives and makers alike!
7
+ ---
8
+
9
+ [BeaverAI](https://huggingface.co/BeaverAI) proudly presents...
10
+
11
+ # Behemoth 123B v2.2 🦣
12
+
13
+ > Nothing in the void is foreign to us. The place we go is the place we belong.
14
+
15
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/fLdJM1oTjLpEKJsbl1BB7.png)
16
+
17
+ ## Links
18
+ - Original: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2
19
+ - GGUF: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2-GGUF
20
+ - iMatrix: https://huggingface.co/bartowski/Behemoth-123B-v2.2-GGUF (recommended for smaller quants)
21
+
22
+ ## Description
23
+
24
+ Behemoth v2.x is a finetune of the new Largestral 2411 with system prompt support. Testers have noted that **everything** felt improved.
25
+
26
+ ### Usage
27
+ Testers say this frankenformat maximizes the model's potential: **Metharme** with Mistral's new system tokens
28
+ - `[SYSTEM_PROMPT] <|system|>{{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
29
+ - `<|system|>[SYSTEM_PROMPT] {{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
30
+
31
+ *Take note that the opening system tag SHOULD ALWAYS have a leading whitespace after it.*
32
+
33
+ Complete SillyTavern Settings in BeaverAI Club: https://discord.com/channels/1238219753324281886/1309968730301792370/1309968730301792370
34
+
35
+ ### Versions
36
+ - [v2.0](https://huggingface.co/TheDrummer/Behemoth-123B-v2) is equivalent to Behemoth v1.0 (Classic)
37
+ - [v2.1](https://huggingface.co/TheDrummer/Behemoth-123B-v2.1) is equivalent to Behemoth v1.1 (Creative Boost)
38
+ - [v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2) is an improvement of Behemoth v2.1 (Creative++)
39
+
40
+ ## Special Thanks
41
+
42
+ Thank you to each and everyone who donated/subscribed in [Ko-Fi](https://ko-fi.com/thedrummer) 🙇 I hope to never disappoint!
43
+
44
+ ```
45
+ Toasty Pigeon
46
+ theguywhogamesalot
47
+ Grozi
48
+ F
49
+ Marinara
50
+ Ko-fi Supporter
51
+ Grozi
52
+ Phaelon
53
+ ONTHEREDTEAM
54
+ EvarinSharath'fe(USM-Valor)
55
+ Silva
56
+ Dakkidaze
57
+ AlexTheVP
58
+ Pseudo
59
+ Kistara
60
+ Dr. Fjut
61
+ Grozi 🥈
62
+ KinjiHakari777
63
+ dustywintr
64
+ Syd
65
+ HumbleConsumer
66
+ Syd
67
+ Ko-fi Supporter
68
+ Arkamist
69
+ joe 🥇
70
+ Toad
71
+ Lied
72
+ Konnect
73
+ Kistara
74
+ Grozi 🥉
75
+ SleepDeprived3
76
+ Luigi
77
+ Nestor
78
+ ```
79
+
80
+ https://ko-fi.com/thedrummer/leaderboard
81
+
82
+ ```
83
+ Finetuned by yours truly,
84
+ Drummer
85
+ ```
86
+
87
+ Thank you Gargy for the GPUs!
88
+
89
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/KvyYIIA1zkxQNEdGro007.png)
config.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "merged/BEHEMOTH-SLERP",
3
+ "architectures": [
4
+ "MistralForCausalLM"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 1,
8
+ "eos_token_id": 2,
9
+ "head_dim": 128,
10
+ "hidden_act": "silu",
11
+ "hidden_size": 12288,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 28672,
14
+ "max_position_embeddings": 131072,
15
+ "model_type": "mistral",
16
+ "num_attention_heads": 96,
17
+ "num_hidden_layers": 88,
18
+ "num_key_value_heads": 8,
19
+ "rms_norm_eps": 1e-05,
20
+ "rope_theta": 1000000.0,
21
+ "sliding_window": null,
22
+ "tie_word_embeddings": false,
23
+ "torch_dtype": "bfloat16",
24
+ "transformers_version": "4.46.3",
25
+ "use_cache": true,
26
+ "vocab_size": 32768
27
+ }
measurement.json ADDED
The diff for this file is too large to render. See raw diff
 
output-00001-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d639195a8b3cdf6592e9e12dd1d795554b5963ab73f545aaf17dab12cd490a26
3
+ size 2109838320
output-00002-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d14184222741b64e596388ce7e3e7c8a1cd575bd18f897b7439c1ed730ecc4e
3
+ size 2126500970
output-00003-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bdf43551d13c940740ba5bcb7f50bbc280c2e100874ea7bb88aad43abd876480
3
+ size 2123410632
output-00004-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:335f5b69cd99937b30d317316f456f96d5ec535ff8632c008bf1942e391f32cb
3
+ size 2144467348
output-00005-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0075b64d8bc630d20aa9d3f64b34e672d49ef12190f9c192c488b5e1c5e536fb
3
+ size 2144475054
output-00006-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f6be7fb65f563432a7339b5d257d9229127c33b6620166d84f3983ec1939ab2
3
+ size 2102349726
output-00007-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6ae4ec30f4ead41e8846ce7a632aee978cdddf68cdda71d2aa5cee30a2286876
3
+ size 2089066830
output-00008-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d507a4f1298c5a440c645eaa206cdec0d7b45c9416fc8d9cef40c0a6ed5ef2e6
3
+ size 2118310388
output-00009-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8154b11f0f6d8401e6580108b5073039b4dcd9fe5c3cc99d07bc43fd6b7dec4
3
+ size 2107770792
output-00010-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d629d5dd7e138e2d593cc47895b44d929cc07850827e51152b67179bdedce458
3
+ size 2127456360
output-00011-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa638fcd7c43bafc11dc27a2567304cbabb0b61356c6c3f793285e0e454d2278
3
+ size 2138774076
output-00012-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4fb1e4741cedc057170c617c0946159455b05236ffd76aa03b88ff13a88c1cea
3
+ size 2126373324
output-00013-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:54d771b610520ab5e87d3e4ad1b3ec5674c646a62a0e9d9c31701d6bf4c9c72e
3
+ size 2035079784
output-00014-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ea6f5e37bf2923aff2bb9651cb6ba9a98c2e8aee3f69ff0a7d3c43cd4ca2230
3
+ size 2126570812
output-00015-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bb4a384b02511b7f96c03a899c5b08191867e4f6cf494125ab1a5f8c224b82c
3
+ size 2142441552
output-00016-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f50ab3234ee9e2049349dc234fe0ea88e28756ee4db4cd40213600ff1b8e3ba4
3
+ size 2147042344
output-00017-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c309cce07878d48786ec4eafd589214ddf3dc68c25dc52cd1e090743ccc1e1a4
3
+ size 2146541404
output-00018-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7637be8b20deb85f16b3bf984ad1f6681b812ab3987080cae73082b00a1039e
3
+ size 2105639140
output-00019-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:657ef972a75d51adbfb0b2299b3d4caba966b6dd4c4a9c51b3813ae63fa0b65c
3
+ size 2145232940
output-00020-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b72468feef948ed335aef1c5e17dfe8e4d785678f91b86ba493ac0721211f3b8
3
+ size 2101482416
output-00021-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5287fa404fd1542394606fe5cdd9b6910ebedb3af7811d62d516b894bb843897
3
+ size 2116395696
output-00022-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cb1af2437abaa4a6ec4eefa2561bf3627dfcae602c0e2592384e222b1eea150
3
+ size 1961312536
output-00023-of-00023.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19608ea3e8597c99f6482d70630b1df85549cbca89f19485a7019f130085ee7b
3
+ size 317718624
special_tokens_map.json ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "</s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "unk_token": {
17
+ "content": "<unk>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ }
23
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b968b8dc352f42192367337c78ccc61e1eaddc6d641a579372d4f20694beb7a
3
+ size 587562
tokenizer_config.json ADDED
The diff for this file is too large to render. See raw diff