RAM requirements for running Llama-3.3-70B-Instruct-Q5_K_M.gguf

by hyadav22 - opened 5 days ago

5 days ago

I have a server with 250 GB of RAM but no GPU. I attempted to run the model Llama-3.3-70B-Instruct-Q5_K_M.gguf, but it failed to load. I’d like to know the memory requirements for running other Unsloft quantized models, such as:

Llama-3.3-70B-Instruct-Q2_K.gguf
Llama-3.3-70B-Instruct-Q3_K_M.gguf

shimmyshimmer

Unsloth AI org 4 days ago

•

edited 4 days ago

I have a server with 250 GB of RAM but no GPU. I attempted to run the model Llama-3.3-70B-Instruct-Q5_K_M.gguf, but it failed to load. I’d like to know the memory requirements for running other Unsloft quantized models, such as:

Llama-3.3-70B-Instruct-Q2_K.gguf
Llama-3.3-70B-Instruct-Q3_K_M.gguf

Should definitely work not sure why it's not. Did you try the other GGUF's as well and see if it works?

Also did you enable offloading?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment