turboderp's picture
Update README.md
c1d4c8f verified
metadata
license: apache-2.0

EXL2 quants of Qwen2-VL-72B-Instruct

4.00 bits per weight
4.50 bits per weight
5.00 bits per weight
6.00 bits per weight

(2.3bpw to 3.5bpw revisions are in also this repo, but they are unstable. Working on it.)

measurement.json