MatrixC7
/

karakuri-lm-70b-chat-v0.1-exl2

Text Generation

Model card Files Files and versions Community

Edit model card

This repository contains the exl2 quants of karakuri-ai/karakuri-lm-70b-chat-v0.1, calibrated using the default dataset built by exllamav2.

Compatible with exllamav2 version 0.0.11 and later. For optimal model loading, it is recommended to use tabbyAPI.

The measurement file is attached in the branch main and all quants are stored in their respective branches.

The chart below presents the calibration perplexity and wikitext-103-v1 test perplexity for all provided quants.

Quant	Calibration Perplexity	wikitext-103-v1 Test Perplexity
2.4bpw-h6	8.3509	7.1054
2.65bpw-h6	8.0491	6.4045
3.0bpw-h6	7.7330	6.0661
4.0bpw-h6	7.4540	5.6886
4.65bpw-h6	7.4089	5.5980
5.0bpw-h6	7.3924
6.0bpw-h6	7.3674
8.0bpw-h8	7.3603

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Examples

Text Generation

Unable to determine this model's library. Check the docs .

Model tree for MatrixC7/karakuri-lm-70b-chat-v0.1-exl2

Base model

meta-llama/Llama-2-70b-hf

Finetuned

(28)

this model

Datasets used to train MatrixC7/karakuri-lm-70b-chat-v0.1-exl2