Quant request
#1
by
OrangeApples
- opened
Hi
@zaq-hack
! I'm thankful that you're still making these rpcal models. Will you be making exl2 qaunts of the newly released:
xxx777xxxASD/L3-ChaoticSoliloquy-v1.5-4x8B
6.5bpw would probably be ideal for 24GB, 8k conext, and Q4 cache, but after testing this 6bpw it works great as well.
Nevermind. Upon testing, v1.5 seems to be quite unhinged (and imo worse) compared to the first version.
OrangeApples
changed discussion status to
closed