Undi95
/

LewdGem-40B-GGUF

GGUF

Not-For-All-Audiences

nsfw

Merge

Inference Endpoints

Model card Files Files and versions Community

Fairly pleased

by Utochi - opened Apr 10

Discussion

Utochi

Apr 10

I'm decently happy with this model using the Q5_0 variant. i cant say that its perfect but it really tries its best. Tested on a 2700 token character card with complex instructions and it does better than most. doesn't quite meet the precision of QuartetAnemoi Q2_k variant but where it lacks in precision instruction following it makes up for in speed and creativity IMO.
Using faraday GUI with default settings at 6k context

Utochi

Apr 28

@Undi95 any chance you could make a 40b MOE with llama-3?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment