llama.cpp support?

by ct-2 - opened Apr 24, 2024

Discussion

ct-2

Apr 24, 2024

Is there a way to run this on RAM or via disc with transformers in 4bit? Thanks!

ChristianAzinn

Apr 25, 2024

Support is being worked on at llama.cpp, follow the issue at https://github.com/ggerganov/llama.cpp/issues/6877. That requires not only support for the model, but someone to actually up and make quantizations, which will also take a very long time considering the size of the model (and be wildly impractical for most users).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment