File size: 8,954 Bytes
24c49dd
212d24c
 
24c49dd
212d24c
 
 
 
 
24c49dd
95595e2
4de1fc6
95595e2
 
430bb14
 
95595e2
 
 
 
 
 
4de1fc6
 
df19ea8
 
4de1fc6
 
430bb14
43d8278
 
82b2009
2164213
43d8278
 
 
de948ae
43d8278
7ca0a6d
96d30e7
43d8278
 
cdf31e0
43d8278
2164213
 
0d0e0e1
4de1fc6
95595e2
ac8b6e5
 
df19ea8
ac8b6e5
df19ea8
ee770a6
 
 
b0a4b17
 
 
 
 
 
4de1fc6
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
language:
- en
library_name: transformers
license: llama2
quantized_by: mradermacher
tags:
- moe
- moerge
---
## About

weighted quants of https://huggingface.co/ibivibiv/giant-hydra-moe-240b

<!-- provided-files -->
static quants are available at https://huggingface.co/mradermacher/giant-hydra-moe-240b-GGUF
## Usage

If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.

## Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ1_S.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ1_S.gguf.split-ab) | i1-IQ1_S | 49.5 | for the desperate |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_XXS.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_XXS.gguf.split-ab) | i1-IQ2_XXS | 63.4 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_XS.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_XS.gguf.split-ab) | i1-IQ2_XS | 70.5 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_S.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_S.gguf.part2of2) | i1-IQ2_S | 72.2 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_M.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ2_M.gguf.part2of2) | i1-IQ2_M | 79.2 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q2_K.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q2_K.gguf.split-ab) | i1-Q2_K | 87.6 | IQ3_XXS probably better |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_XXS.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_XXS.gguf.split-ab) | i1-IQ3_XXS | 92.7 | fast, lower quality |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_XS.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_XS.gguf.split-ab) | i1-Q3_K_XS | 96.5 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_XS.gguf.part1of3) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_XS.gguf.part2of3) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_XS.gguf.part3of3) | i1-IQ3_XS | 97.4 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_S.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_S.gguf.split-ab) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_S.gguf.split-ac) | i1-Q3_K_S | 103.5 | IQ3_XS probably better |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_S.gguf.part1of3) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_S.gguf.part2of3) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_S.gguf.part3of3) | i1-IQ3_S | 103.5 | fast, beats Q3_K* |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_M.gguf.part1of3) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_M.gguf.part2of3) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-IQ3_M.gguf.part3of3) | i1-IQ3_M | 105.5 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_M.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_M.gguf.split-ab) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_M.gguf.split-ac) | i1-Q3_K_M | 114.8 | IQ3_S probably better |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_L.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_L.gguf.split-ab) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q3_K_L.gguf.split-ac) | i1-Q3_K_L | 124.3 | IQ3_M probably better |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_S.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_S.gguf.split-ab) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_S.gguf.split-ac) | i1-Q4_K_S | 136.2 | optimal size/speed/quality |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_M.gguf.split-aa) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_M.gguf.split-ab) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q4_K_M.gguf.split-ac) | i1-Q4_K_M | 144.8 | fast, medium quality |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_S.gguf.part1of4) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_S.gguf.part2of4) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_S.gguf.part3of4) [PART 4](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_S.gguf.part4of4) | i1-Q5_K_S | 164.7 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_M.gguf.part1of4) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_M.gguf.part2of4) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_M.gguf.part3of4) [PART 4](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q5_K_M.gguf.part4of4) | i1-Q5_K_M | 169.7 |  |
| [PART 1](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q6_K.gguf.part1of5) [PART 2](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q6_K.gguf.part2of5) [PART 3](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q6_K.gguf.part3of5) [PART 4](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q6_K.gguf.part4of5) [PART 5](https://huggingface.co/mradermacher/giant-hydra-moe-240b-i1-GGUF/resolve/main/giant-hydra-moe-240b.i1-Q6_K.gguf.part5of5) | i1-Q6_K | 196.3 | practically like static Q6_K |


Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)

And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

## Thanks

I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time.

<!-- end -->