Model Fails to run on featherless.ai

by Darok-recursal - opened Aug 21

Aug 21

I dont know if it's the same as for your benchmarks, i suspect the model could be quantized and not specified. Anyone found out why

netcat420

Owner 9 days ago

im so sorry for this late response! this was an earlier version of my model that is still currently being updated and i always make sure to release 4-bit medium and small gguf quants for each model release as these are the models i use personally!

Darok-recursal

3 days ago

Thanks for your response. Having an fp8 version sounds standard operation to me. If the model is quanted, it should be specified in the model name so it avoids any confusion. Could you provide an fp8 version? People on featherless want to try your model :D

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment