Please open mouth kiss the homies.

by snombler - opened 12 days ago

12 days ago

This could be us but you playin'.

(Making exl2s before ggufs is a crime.)

12 days ago

•

llama.cpp's tokenization handling in the past two months is perhaps equally criminal

12 days ago

Not wrong! But until someone else wants to support split loading, it's all we've really got, sadly. Also, thanks for all your contributions.

12 days ago

tbh exl2 simply produces better outputs.

12 days ago

I am graciously willing to accept 3090s to run exl2s for anyone who has them to spare. I'll need enough to run at least 64k context.

11 days ago

Only see 2 bit exl but 4KM gguf. We got different definitions of "before"

11 days ago

It's just proof that bullying works.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment