Can you please make a 3bpw branch?

#2
by CTXEE - opened

I want use tabbyAPI with draft_model to speed up, and leave some vram for other applications or more ctx..

Sign up or log in to comment