Higher end quants to preserve model quality. Built for TabbyAPI and other exllamav2 supporting inference engines. Ready to deploy.
-
bigstorm/dolphin-2.9.2-qwen2-72b-6.0bpw-exl2
Text Generation • Updated • 10 • 1 -
bigstorm/dolphin-2.9.1-llama-3-70b-6.0bpw-exl2
Text Generation • Updated • 5 • 3 -
bigstorm/Hermes-2-Theta-Llama-3-8B-8.0bpw-8hb-exl2
Text Generation • Updated • 9 -
bigstorm/Yi-1.5-34B-Chat-16K-8.0bpw-8hb-exl2
Text Generation • Updated • 3