metadata

library_name: transformers
license: llama3
base_model:
  - flammenai/Mahou-1.3-llama3-8B
datasets:
  - flammenai/MahouMix-v1
  - flammenai/FlameMix-DPO-v1

Mahou-1.3a-llama3-8B

Mahou is our attempt to build a production-ready conversational/roleplay LLM.

Future versions will be released iteratively and finetuned from flammen.ai conversational data.

License

This model is based on Meta Llama-3-8B and is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT.

Chat Format

This model has been trained to use ChatML format. Note the additional tokens in tokenizer_config.json.

<|im_start|>system
{{system}}<|im_end|>
<|im_start|>{{char}}
{{message}}<|im_end|>
<|im_start|>{{user}}
{{message}}<|im_end|>

Roleplay Format

Speech without quotes.
Actions in *asterisks*

*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.

ST Settings

Use ChatML for the Context Template.
Enable Instruct Mode.
Use the Mahou preset.
Recommended: Add newline as a stopping string: ["\n"]

Method

Finetuned for 3 epochs using an A100 on Google Colab.

Fine-tune Llama 3 with ORPO - Maxime Labonne