wolfram commited on
Commit
75fa698
1 Parent(s): c31949f

Update README.md

Browse files

Removed EXL2 extra info from HF model card.

Files changed (1) hide show
  1. README.md +0 -4
README.md CHANGED
@@ -20,10 +20,6 @@ tags:
20
  - HF: wolfram/miquliz-120b-v2.0
21
  - GGUF: [Q2_K | IQ3_XXS | Q4_K_M | Q5_K_M](https://huggingface.co/wolfram/miquliz-120b-v2.0-GGUF)
22
  - EXL2: [2.4bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-2.4bpw-h6-exl2) | [2.65bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-2.65bpw-h6-exl2) | [3.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-3.0bpw-h6-exl2) | [3.5bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-3.5bpw-h6-exl2) | [4.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-4.0bpw-h6-exl2) | [5.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-5.0bpw-h6-exl2)
23
- - **Max Context w/ 48 GB VRAM:** (24 GB VRAM is not enough, even for 2.4bpw, use [GGUF](https://huggingface.co/wolfram/miquliz-120b-v2.0-GGUF) instead!)
24
- - **2.4bpw:** 32K (32768 tokens) w/ 8-bit cache, 21K (21504 tokens) w/o 8-bit cache
25
- - **2.65bpw:** 30K (30720 tokens) w/ 8-bit cache, 15K (15360 tokens) w/o 8-bit cache
26
- - **3.0bpw:** 12K (12288 tokens) w/ 8-bit cache, 6K (6144 tokens) w/o 8-bit cache
27
 
28
  This is v2.0 of a 120b frankenmerge created by interleaving layers of [miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf) with [lzlv_70b_fp16_hf](https://huggingface.co/lizpreciatior/lzlv_70b_fp16_hf) using [mergekit](https://github.com/cg123/mergekit). Better than v1.0 thanks to the improved recipe adapted from [TheProfessor-155b](https://huggingface.co/abacusai/TheProfessor-155b) by [Eric Hartford](https://erichartford.com/), it is now achieving top rank with double perfect scores in [my LLM comparisons/tests](https://www.reddit.com/r/LocalLLaMA/search?q=author%3AWolframRavenwolf+Comparison%2FTest&sort=new&t=all).
29
 
 
20
  - HF: wolfram/miquliz-120b-v2.0
21
  - GGUF: [Q2_K | IQ3_XXS | Q4_K_M | Q5_K_M](https://huggingface.co/wolfram/miquliz-120b-v2.0-GGUF)
22
  - EXL2: [2.4bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-2.4bpw-h6-exl2) | [2.65bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-2.65bpw-h6-exl2) | [3.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-3.0bpw-h6-exl2) | [3.5bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-3.5bpw-h6-exl2) | [4.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-4.0bpw-h6-exl2) | [5.0bpw](https://huggingface.co/wolfram/miquliz-120b-v2.0-5.0bpw-h6-exl2)
 
 
 
 
23
 
24
  This is v2.0 of a 120b frankenmerge created by interleaving layers of [miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf) with [lzlv_70b_fp16_hf](https://huggingface.co/lizpreciatior/lzlv_70b_fp16_hf) using [mergekit](https://github.com/cg123/mergekit). Better than v1.0 thanks to the improved recipe adapted from [TheProfessor-155b](https://huggingface.co/abacusai/TheProfessor-155b) by [Eric Hartford](https://erichartford.com/), it is now achieving top rank with double perfect scores in [my LLM comparisons/tests](https://www.reddit.com/r/LocalLLaMA/search?q=author%3AWolframRavenwolf+Comparison%2FTest&sort=new&t=all).
25