Model config.json has Mistral params instead of Mixtral, breaking ExLlama quants and maybe affecting others too

by TheBloke - opened Dec 15, 2023

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-4

TheBloke

Dec 15, 2023

I got reports that ExLlamav2 wasn't working with this GPTQ. Turns out that's because it's trying to load it as a Mistral model, which is due to the architecture being set to Mistral instead of Mixtral

Also, the rope_theta should be 1000000.0 for Mixtral - this can affect inference quality.

I don't think any of this would stop k-quants working though, so that issue might be unrelated. I'll try making some anyway though.

Model config.json has Mistral params instead of Mixtral, breaking ExLlama quants and maybe affecting others tooab09f248

Undi95 changed pull request status to merged Dec 15, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment