Plz quants for Firefly

#412

by invisietch - opened Nov 5, 2024

Discussion

invisietch

Nov 5, 2024

Please could you add https://huggingface.co/invisietch/MiS-Firefly-v0.1-22B to your queue?

It may need tokenizer.model from Mistral Small repo (it did for me locally).

mradermacher

Owner Nov 5, 2024

Great, I hate hacks like that :(

It's added, and I will replace the file when needed.

Cheers!

mradermacher changed discussion status to closed Nov 5, 2024

mradermacher

Owner Nov 5, 2024

I assume you mean https://huggingface.co/mistralai/Mistral-Small-Instruct-2409 - I don't have access to that repo at the moment, any alternative?

invisietch

Nov 5, 2024

•

edited Nov 5, 2024

Do you have a Discord? I can send the file. Mine is the same as username here.

invisietch

Nov 5, 2024

gguf-my-repo seems to quant it without the file but my local llama.cpp wouldn't.

mradermacher

Owner Nov 5, 2024

Ah, ok, I'll try without any hacks first.

invisietch

Nov 5, 2024

If you give the model a try, let me know what you think. I get 70B vibes from it especially in creative writing.

mradermacher

Owner Nov 5, 2024

Worked fine, should be done... very soon.

invisietch

Nov 5, 2024

Already added to model card!

mradermacher

Owner Nov 5, 2024

Too fast :) But seriously, you should link the imatrix quants instead once finished.

invisietch

Nov 5, 2024

Too fast :) But seriously, you should link the imatrix quants instead once finished.

I linked both ;)

invisietch

Nov 7, 2024

Hi, apologies. The model worked well at Q8 but there was a bug with quantized models (not your fault, happened when I tried making Q6 locally too).

It's fixed in https://huggingface.co/invisietch/MiS-Firefly-v0.2-22B -- would you mind doing GGUF for this when its uploaded in a couple of hours?

mradermacher

Owner Nov 7, 2024

Sure, just notify me when it's fully uploaded :)

invisietch

Nov 7, 2024

It's up, and thanks!

mradermacher

Owner Nov 7, 2024

Aaand its queued :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment