README.md · dranger003/c4ai-command-r-plus-iMat.GGUF at 6079f57c5af789c0a43b13bc580b304aa72c31f3

metadata

license: cc-by-nc-4.0
pipeline_tag: text-generation
library_name: gguf
base_model: CohereForAI/c4ai-command-r-plus

2024-04-05: Support for this model is still being worked on - PR#6491.
For now, you can test the model using this fork: https://github.com/dranger003/llama.cpp/tree/Noeda/commandr-plus

GGUF importance matrix (imatrix) quants for https://huggingface.co/CohereForAI/c4ai-command-r-plus
The importance matrix was trained for ~100K tokens (200 batches of 512 tokens) using wiki.train.raw.
Which GGUF is right for me? (from Artefact2)
The imatrix is being used on the K-quants as well (only for < Q6_K).
You can merge GGUFs with gguf-split --merge <first-chunk> <output-file> although this is not required since f482bb2e.

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities, this includes Retrieval Augmented Generation (RAG) and tool use to automate sophisticated tasks. The tool use in this model generation enables multi-step tool use which allows the model to combine multiple tools over multiple steps to accomplish difficult tasks. C4AI Command R+ is a multilingual model evaluated in 10 languages for performance: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, and Simplified Chinese. Command R+ is optimized for a variety of use cases including reasoning, summarization, and question answering.

Layers	Context	Template
64	131072	<BOS_TOKEN><\|START_OF_TURN_TOKEN\|><\|USER_TOKEN\|>{prompt}<\|END_OF_TURN_TOKEN\|><\|START_OF_TURN_TOKEN\|><\|CHATBOT_TOKEN\|>{response}