dranger003 nanoflooder commited on
Commit
b7e6fa6
1 Parent(s): a628832

Add model sizes (#5)

Browse files

- Add model sizes (1312b4aba1a7d4cc42b47b7b19aaa1d567826b07)


Co-authored-by: nanoflooder <nanoflooder@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -16,6 +16,24 @@ The PR has been approved, we should expect it to be merged shortly into the main
16
  * How do I use imatrix quants? Just like any other GGUF, the `.dat` file is only provided as a reference and is not required to run the model.
17
  * If your last resort is to use an IQ1 quant then go for IQ1_M.
18
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  > C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities, this includes Retrieval Augmented Generation (RAG) and tool use to automate sophisticated tasks. The tool use in this model generation enables multi-step tool use which allows the model to combine multiple tools over multiple steps to accomplish difficult tasks. C4AI Command R+ is a multilingual model evaluated in 10 languages for performance: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, and Simplified Chinese. Command R+ is optimized for a variety of use cases including reasoning, summarization, and question answering.
20
 
21
  | Layers | Context | [Template](https://huggingface.co/CohereForAI/c4ai-command-r-plus#tool-use--multihop-capabilities) |
 
16
  * How do I use imatrix quants? Just like any other GGUF, the `.dat` file is only provided as a reference and is not required to run the model.
17
  * If your last resort is to use an IQ1 quant then go for IQ1_M.
18
 
19
+ | Quant | Size |
20
+ |--------|--------|
21
+ |iq1\_s | 23.2 |
22
+ |iq1\_m | 25.2 |
23
+ |iq2\_xxs| 28.6 |
24
+ |iq2\_xs | 31.6 |
25
+ |iq2\_s | 33.3 |
26
+ |iq2\_m | 36.0 |
27
+ |iq3\_xxs| 40.7 |
28
+ |iq3\_xs | 43.6 |
29
+ |iq3\_s | 46.0 |
30
+ |iq3\_m | 47.7 |
31
+ |iq4\_xs | 56.3 |
32
+ |q5\_k\_s| 71.8 |
33
+ |q6\_k | 85.1 |
34
+ |q8\_0 | 110.3 |
35
+ |fp16 | 207.8 |
36
+
37
  > C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities, this includes Retrieval Augmented Generation (RAG) and tool use to automate sophisticated tasks. The tool use in this model generation enables multi-step tool use which allows the model to combine multiple tools over multiple steps to accomplish difficult tasks. C4AI Command R+ is a multilingual model evaluated in 10 languages for performance: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, and Simplified Chinese. Command R+ is optimized for a variety of use cases including reasoning, summarization, and question answering.
38
 
39
  | Layers | Context | [Template](https://huggingface.co/CohereForAI/c4ai-command-r-plus#tool-use--multihop-capabilities) |