Disclaimer

This model is not production ready, and really shouldn't be used for the most part due to its instability, unless you're okay with monitoring it so it doesn't go on forever!

Info

This is an experimental finetune of SicariusSicarriStuff's Redemption Wind 24B finetune / Mistral Small V3 for reasoning tasks. It is overfitted on 5 samples of data for 15 epochs using LoRA, and can exhibit some pretty amazing results! Assuming it's not a GGUF, oddly enough. I cannot replicate the following with a Q4_K_M quant. Here's the strawberry test:

<reasoning>
Okay, let me think.

First, I need to identify all the letters in "strawberry."

Then, I can count how many times the letter 'R' appears in that word.

So, what's the word again? "Strawberry."

Alright, let me break down the letters.

First, let me write down the word "strawberry" letter by letter: S T R A W B E R R Y.

Now, I see several letters here.

To find the 'R's, I should look for that particular letter.

Hmm, where are the 'R's in this word? Let me check each letter one by one.

First letter is 'S', not an 'R'.

Second letter is 'T', not an 'R' either.

Third letter is 'R' - ah, there's one!

Fourth letter is 'A', not an 'R'.

Fifth letter is 'W', no 'R' there.

Sixth letter is 'B', not an 'R'.

Seventh letter is 'E', no 'R' again.

Eighth letter is 'R' - another one!

Ninth letter is 'R' - a third one!

Tenth letter is 'Y', not an 'R'.

So, it seems there are three 'R's in "strawberry."

Let me verify this.

I think the 'R's are in positions 3, 8, and 9.

Yes, that seems right. So, the answer should be three.

Wait, let me double-check.

I'll read the word slowly and count the 'R's again: S-T-R-A-W-B-E-R-R-Y.

Yes, definitely three 'R's.

Alternatively, I could think about the word "strawberry" phonetically.

When I say "strawberry," I hear the sound of 'R' three times: in "straw," "ber," and "ry."

Each of these syllables contains an 'R'. So, phonetically, it confirms there are three 'R's.

This method also works and gives the same result.

Therefore, both methods of counting confirm that there are three 'R's in "strawberry."

I think this is solid.

No steps seem missing here, and the conclusion makes sense contextually.

The letter 'R' appears three times in the word "strawberry," both visually and phonetically.

So, the final answer should be 3.
</reasoning>

There are 3 R's in strawberry. ๐Ÿ“๐Ÿ“๐Ÿ“ ๐Ÿ“๐Ÿ“๐Ÿ“ ๐Ÿ“๐Ÿ“๐Ÿ“ ๐Ÿ“๐Ÿ“๐Ÿ“ ๐Ÿ“๐Ÿ“๐Ÿ“

(The ๐Ÿ“'s are an example of the model never stopping. I had to keyboard interrupt it.)

Future plans

I plan on doing more experiments with this model to help stop responses never ending. I have a feeling unsloth's "train on responses only" might have accidentally excluded the <|im_end|> token, and as a result, the model never stops.

Downloads last month
2
Safetensors
Model size
23.6B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for QuietImpostor/Mistral-R1-Preview

Finetuned
(1)
this model
Quantizations
1 model