4bit gglm version?

by jbollenbacher - opened Apr 17, 2023

Discussion

jbollenbacher

Apr 17, 2023

•

edited Apr 17, 2023

Hi team! Love your work on OpenAssistant!

Can we get these models ported to a 4bit gglm version? Especially the pythia-based models, but the llama ones would be nice too. This would make them much more portable and easy to experiment with.

Thanks!

byroneverson

Apr 29, 2023

Please see my repositories here and on github, GPT-NeoX based models (OpenAssistant StableLM and Pythia models) will not run in llama.cpp or ggml at the moment, but my fork of llama.cpp will. Best of luck!

https://github.com/byroneverson/gptneox.cpp

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment