Upload folder using huggingface_hub
Browse files- README.md +89 -0
- config.json +27 -0
- measurement.json +0 -0
- output-00001-of-00023.safetensors +3 -0
- output-00002-of-00023.safetensors +3 -0
- output-00003-of-00023.safetensors +3 -0
- output-00004-of-00023.safetensors +3 -0
- output-00005-of-00023.safetensors +3 -0
- output-00006-of-00023.safetensors +3 -0
- output-00007-of-00023.safetensors +3 -0
- output-00008-of-00023.safetensors +3 -0
- output-00009-of-00023.safetensors +3 -0
- output-00010-of-00023.safetensors +3 -0
- output-00011-of-00023.safetensors +3 -0
- output-00012-of-00023.safetensors +3 -0
- output-00013-of-00023.safetensors +3 -0
- output-00014-of-00023.safetensors +3 -0
- output-00015-of-00023.safetensors +3 -0
- output-00016-of-00023.safetensors +3 -0
- output-00017-of-00023.safetensors +3 -0
- output-00018-of-00023.safetensors +3 -0
- output-00019-of-00023.safetensors +3 -0
- output-00020-of-00023.safetensors +3 -0
- output-00021-of-00023.safetensors +3 -0
- output-00022-of-00023.safetensors +3 -0
- output-00023-of-00023.safetensors +3 -0
- special_tokens_map.json +23 -0
- tokenizer.json +0 -0
- tokenizer.model +3 -0
- tokenizer_config.json +0 -0
README.md
ADDED
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
---
|
4 |
+
# Join our Discord! https://discord.gg/Nbv9pQ88Xb
|
5 |
+
## Nearly 2500 members strong 💪
|
6 |
+
### Now with more channels! A hub for creatives and makers alike!
|
7 |
+
---
|
8 |
+
|
9 |
+
[BeaverAI](https://huggingface.co/BeaverAI) proudly presents...
|
10 |
+
|
11 |
+
# Behemoth 123B v2.2 🦣
|
12 |
+
|
13 |
+
> Nothing in the void is foreign to us. The place we go is the place we belong.
|
14 |
+
|
15 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/fLdJM1oTjLpEKJsbl1BB7.png)
|
16 |
+
|
17 |
+
## Links
|
18 |
+
- Original: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2
|
19 |
+
- GGUF: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2-GGUF
|
20 |
+
- iMatrix: https://huggingface.co/bartowski/Behemoth-123B-v2.2-GGUF (recommended for smaller quants)
|
21 |
+
|
22 |
+
## Description
|
23 |
+
|
24 |
+
Behemoth v2.x is a finetune of the new Largestral 2411 with system prompt support. Testers have noted that **everything** felt improved.
|
25 |
+
|
26 |
+
### Usage
|
27 |
+
Testers say this frankenformat maximizes the model's potential: **Metharme** with Mistral's new system tokens
|
28 |
+
- `[SYSTEM_PROMPT] <|system|>{{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
|
29 |
+
- `<|system|>[SYSTEM_PROMPT] {{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
|
30 |
+
|
31 |
+
*Take note that the opening system tag SHOULD ALWAYS have a leading whitespace after it.*
|
32 |
+
|
33 |
+
Complete SillyTavern Settings in BeaverAI Club: https://discord.com/channels/1238219753324281886/1309968730301792370/1309968730301792370
|
34 |
+
|
35 |
+
### Versions
|
36 |
+
- [v2.0](https://huggingface.co/TheDrummer/Behemoth-123B-v2) is equivalent to Behemoth v1.0 (Classic)
|
37 |
+
- [v2.1](https://huggingface.co/TheDrummer/Behemoth-123B-v2.1) is equivalent to Behemoth v1.1 (Creative Boost)
|
38 |
+
- [v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2) is an improvement of Behemoth v2.1 (Creative++)
|
39 |
+
|
40 |
+
## Special Thanks
|
41 |
+
|
42 |
+
Thank you to each and everyone who donated/subscribed in [Ko-Fi](https://ko-fi.com/thedrummer) 🙇 I hope to never disappoint!
|
43 |
+
|
44 |
+
```
|
45 |
+
Toasty Pigeon
|
46 |
+
theguywhogamesalot
|
47 |
+
Grozi
|
48 |
+
F
|
49 |
+
Marinara
|
50 |
+
Ko-fi Supporter
|
51 |
+
Grozi
|
52 |
+
Phaelon
|
53 |
+
ONTHEREDTEAM
|
54 |
+
EvarinSharath'fe(USM-Valor)
|
55 |
+
Silva
|
56 |
+
Dakkidaze
|
57 |
+
AlexTheVP
|
58 |
+
Pseudo
|
59 |
+
Kistara
|
60 |
+
Dr. Fjut
|
61 |
+
Grozi 🥈
|
62 |
+
KinjiHakari777
|
63 |
+
dustywintr
|
64 |
+
Syd
|
65 |
+
HumbleConsumer
|
66 |
+
Syd
|
67 |
+
Ko-fi Supporter
|
68 |
+
Arkamist
|
69 |
+
joe 🥇
|
70 |
+
Toad
|
71 |
+
Lied
|
72 |
+
Konnect
|
73 |
+
Kistara
|
74 |
+
Grozi 🥉
|
75 |
+
SleepDeprived3
|
76 |
+
Luigi
|
77 |
+
Nestor
|
78 |
+
```
|
79 |
+
|
80 |
+
https://ko-fi.com/thedrummer/leaderboard
|
81 |
+
|
82 |
+
```
|
83 |
+
Finetuned by yours truly,
|
84 |
+
Drummer
|
85 |
+
```
|
86 |
+
|
87 |
+
Thank you Gargy for the GPUs!
|
88 |
+
|
89 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/KvyYIIA1zkxQNEdGro007.png)
|
config.json
ADDED
@@ -0,0 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"_name_or_path": "merged/BEHEMOTH-SLERP",
|
3 |
+
"architectures": [
|
4 |
+
"MistralForCausalLM"
|
5 |
+
],
|
6 |
+
"attention_dropout": 0.0,
|
7 |
+
"bos_token_id": 1,
|
8 |
+
"eos_token_id": 2,
|
9 |
+
"head_dim": 128,
|
10 |
+
"hidden_act": "silu",
|
11 |
+
"hidden_size": 12288,
|
12 |
+
"initializer_range": 0.02,
|
13 |
+
"intermediate_size": 28672,
|
14 |
+
"max_position_embeddings": 131072,
|
15 |
+
"model_type": "mistral",
|
16 |
+
"num_attention_heads": 96,
|
17 |
+
"num_hidden_layers": 88,
|
18 |
+
"num_key_value_heads": 8,
|
19 |
+
"rms_norm_eps": 1e-05,
|
20 |
+
"rope_theta": 1000000.0,
|
21 |
+
"sliding_window": null,
|
22 |
+
"tie_word_embeddings": false,
|
23 |
+
"torch_dtype": "bfloat16",
|
24 |
+
"transformers_version": "4.46.3",
|
25 |
+
"use_cache": true,
|
26 |
+
"vocab_size": 32768
|
27 |
+
}
|
measurement.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
output-00001-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d639195a8b3cdf6592e9e12dd1d795554b5963ab73f545aaf17dab12cd490a26
|
3 |
+
size 2109838320
|
output-00002-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3d14184222741b64e596388ce7e3e7c8a1cd575bd18f897b7439c1ed730ecc4e
|
3 |
+
size 2126500970
|
output-00003-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bdf43551d13c940740ba5bcb7f50bbc280c2e100874ea7bb88aad43abd876480
|
3 |
+
size 2123410632
|
output-00004-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:335f5b69cd99937b30d317316f456f96d5ec535ff8632c008bf1942e391f32cb
|
3 |
+
size 2144467348
|
output-00005-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0075b64d8bc630d20aa9d3f64b34e672d49ef12190f9c192c488b5e1c5e536fb
|
3 |
+
size 2144475054
|
output-00006-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8f6be7fb65f563432a7339b5d257d9229127c33b6620166d84f3983ec1939ab2
|
3 |
+
size 2102349726
|
output-00007-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6ae4ec30f4ead41e8846ce7a632aee978cdddf68cdda71d2aa5cee30a2286876
|
3 |
+
size 2089066830
|
output-00008-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d507a4f1298c5a440c645eaa206cdec0d7b45c9416fc8d9cef40c0a6ed5ef2e6
|
3 |
+
size 2118310388
|
output-00009-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b8154b11f0f6d8401e6580108b5073039b4dcd9fe5c3cc99d07bc43fd6b7dec4
|
3 |
+
size 2107770792
|
output-00010-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d629d5dd7e138e2d593cc47895b44d929cc07850827e51152b67179bdedce458
|
3 |
+
size 2127456360
|
output-00011-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aa638fcd7c43bafc11dc27a2567304cbabb0b61356c6c3f793285e0e454d2278
|
3 |
+
size 2138774076
|
output-00012-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4fb1e4741cedc057170c617c0946159455b05236ffd76aa03b88ff13a88c1cea
|
3 |
+
size 2126373324
|
output-00013-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:54d771b610520ab5e87d3e4ad1b3ec5674c646a62a0e9d9c31701d6bf4c9c72e
|
3 |
+
size 2035079784
|
output-00014-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2ea6f5e37bf2923aff2bb9651cb6ba9a98c2e8aee3f69ff0a7d3c43cd4ca2230
|
3 |
+
size 2126570812
|
output-00015-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2bb4a384b02511b7f96c03a899c5b08191867e4f6cf494125ab1a5f8c224b82c
|
3 |
+
size 2142441552
|
output-00016-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f50ab3234ee9e2049349dc234fe0ea88e28756ee4db4cd40213600ff1b8e3ba4
|
3 |
+
size 2147042344
|
output-00017-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c309cce07878d48786ec4eafd589214ddf3dc68c25dc52cd1e090743ccc1e1a4
|
3 |
+
size 2146541404
|
output-00018-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c7637be8b20deb85f16b3bf984ad1f6681b812ab3987080cae73082b00a1039e
|
3 |
+
size 2105639140
|
output-00019-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:657ef972a75d51adbfb0b2299b3d4caba966b6dd4c4a9c51b3813ae63fa0b65c
|
3 |
+
size 2145232940
|
output-00020-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b72468feef948ed335aef1c5e17dfe8e4d785678f91b86ba493ac0721211f3b8
|
3 |
+
size 2101482416
|
output-00021-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5287fa404fd1542394606fe5cdd9b6910ebedb3af7811d62d516b894bb843897
|
3 |
+
size 2116395696
|
output-00022-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3cb1af2437abaa4a6ec4eefa2561bf3627dfcae602c0e2592384e222b1eea150
|
3 |
+
size 1961312536
|
output-00023-of-00023.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:19608ea3e8597c99f6482d70630b1df85549cbca89f19485a7019f130085ee7b
|
3 |
+
size 317718624
|
special_tokens_map.json
ADDED
@@ -0,0 +1,23 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"bos_token": {
|
3 |
+
"content": "<s>",
|
4 |
+
"lstrip": false,
|
5 |
+
"normalized": false,
|
6 |
+
"rstrip": false,
|
7 |
+
"single_word": false
|
8 |
+
},
|
9 |
+
"eos_token": {
|
10 |
+
"content": "</s>",
|
11 |
+
"lstrip": false,
|
12 |
+
"normalized": false,
|
13 |
+
"rstrip": false,
|
14 |
+
"single_word": false
|
15 |
+
},
|
16 |
+
"unk_token": {
|
17 |
+
"content": "<unk>",
|
18 |
+
"lstrip": false,
|
19 |
+
"normalized": false,
|
20 |
+
"rstrip": false,
|
21 |
+
"single_word": false
|
22 |
+
}
|
23 |
+
}
|
tokenizer.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
tokenizer.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1b968b8dc352f42192367337c78ccc61e1eaddc6d641a579372d4f20694beb7a
|
3 |
+
size 587562
|
tokenizer_config.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|