which model i need to use in my single 3090 and 32gb RAM

by cemo702 - opened Sep 26, 2024

Sep 26, 2024

which model i need to use in my single 3090 and 32gb RAM

Arli AI org Sep 26, 2024

I would run GPTQ Q4 or GGUF Q5.

tbonge

Sep 27, 2024

With Q5_K_L.GGUF I can get all 59 layers in 24g VRAM with 16k context size on a 4090.

Arli AI org Sep 28, 2024

Nice! Thanks for the report.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment