Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss

Experimental model, using a limarp qlora trained at 10k ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on mistralai/Mixtral-8x7B-v0.1 using Charles Goddard's ZLoss and Megablocks-based fork of transformers, and then fused to mistralai/Mixtral-8x7B-Instruct-v0.1 at 0.5 weight.

My current generation settings are:

Temperature: 1.25
Min-p: 0.05
Repetition penalty: 1.05
Repetition penalty: range 1024

And this seems to avoid the Mixtral looping pitfalls for me so far. Play around with it and see what works well for you.

Peft Adapter

Quants courtesy of TheBloke:

Exl2 Quants courtesy of LoneStriker:

Usage:

The intended prompt format is the Alpaca instruction format of LimaRP v3:

### Instruction:
Character's Persona: {bot character description}

User's Persona: {user character description}

Scenario: {what happens in the story}

Play the role of Character. Taking the above information into consideration, you must engage in a roleplaying chat with User below this line. Do not write dialogues and narration for User.

### Input:
User: {utterance}

### Response:
Character: {utterance}

### Input:
User: {utterance}

### Response:
Character: {utterance}

(etc.)

My current templates have been uploaded to a folder.

Message length control

Due to the inclusion of LimaRP v3, it is possible to append a length modifier to the response instruction sequence, like this:

### Input
User: {utterance}

### Response: (length = medium)
Character: {utterance}

This has an immediately noticeable effect on bot responses. The available lengths are: micro, tiny, short, medium, long, massive, huge, enormous, humongous, unlimited. The recommended starting length is medium. Keep in mind that the AI may ramble or impersonate the user with very long messages.

Bias, Risks, and Limitations

The model will show biases similar to those observed in niche roleplaying forums on the Internet, besides those exhibited by the base model. It is not intended for supplying factual information or advice in any form.

Training Details

This model is a merge. Please refer to the link repositories of the merged models for details.

Downloads last month
17
Safetensors
Model size
46.7B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss

Merges
11 models
Quantizations
5 models

Dataset used to train Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss

Collection including Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss