turboderp
/

Qwen2-VL-72B-Instruct-exl2

Model card Files Files and versions Community

turboderp commited on Nov 19, 2024

Commit

c1d4c8f

·

verified ·

1 Parent(s): eb07e25

Update README.md

Files changed (1) hide show

README.md +2 -4

README.md CHANGED Viewed

@@ -3,13 +3,11 @@ license: apache-2.0
 ---
 EXL2 quants of [Qwen2-VL-72B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-72B-Instruct)
-[2.30 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/2.3bpw)
-[2.50 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/2.5bpw)
-[3.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/3.0bpw)
-[3.50 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/3.5bpw)
 [4.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/4.0bpw)
 [4.50 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/4.5bpw)
 [5.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/5.0bpw)
 [6.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/6.0bpw)
 [measurement.json](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/blob/main/measurement.json)

 ---
 EXL2 quants of [Qwen2-VL-72B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-72B-Instruct)
 [4.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/4.0bpw)
 [4.50 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/4.5bpw)
 [5.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/5.0bpw)
 [6.00 bits per weight](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/tree/6.0bpw)
+(2.3bpw to 3.5bpw revisions are in also this repo, but they are unstable. Working on it.)
 [measurement.json](https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2/blob/main/measurement.json)