Some benchmarks

by ChuckMcSneed - opened Oct 13

Oct 13

Your model scores the highest among Largetral finetunes(not merges) on UGI and my benchmark. Good work.
In my personal experience it feels a bit dumber than the official, but less than the other community tunes. It is also got hornier and better at negativity. Feels almost worth the sacrifice in intelligence.

TheDrummer

Owner Oct 14

Thanks! I've got one more trick up my sleeve that might bring Behemoth v2 closer to OG Largestral.

gghfez

Oct 14

Using it for a few days, this is my favorite model for writing, and it's still smart enough to have loaded for coding/work, etc. Whatever you did with your slop removal experiments on the smaller models is working.

TheDrummer

Owner Oct 14

@gghfez I haven't used the slop removal on anything but Nemo yet xD

I'll try it on Cydonia soon.

gghfez

Oct 14

Ah okay. I haven't used this it "role-playing" but I'm finding it's great at "write X in the style of " style prompts.

Prompt: ""Write a story based on Battlestar Galactica in the prose of Haruki Murakami from the perspective of Gius Baltar""

https://imgur.com/a/vyMdeES

The Behemoth story is the only one which feels like a Murakami novel but also understands the character in the sci fi series I referenced.
Mistral-Large on the other hand, feels like a Mistral-Large story with it's "hushed corridors".

TheDrummer

Owner Oct 14

@gghfez wow, that's actually pretty good. did you use metharme or mistral?

gghfez

Oct 15

Mistral.

Generally I've noticed that with these finetunes of Instruct models; if you use the original template, the prose/voice changes still come through.

BigHuggyD

Oct 15

This model has become my favorite multi-purpose tool. Subjectively, it is the best balance of creativity and smarts available today. It has become my current 'daily driver' ... well done

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment