HikariBloom-v0.3-RP / README.md
Rookied2's picture
Update README.md
1fa356c verified
---
license: apache-2.0
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
pipeline_tag: text-generation
tags:
- RP
- roleplay
- nsfw
- not-for-all-audiences
---
# HikariBloom-v0.3-RP
<img src="gumi.png" alt="Image" style="display: block; margin-left: auto; margin-right: auto; width: 65%;">
HikariBloom-v0.3-RP is a chatbot model built upon meta-llama/Llama-3.1-8B-Instruct, with additional SFT training.
This model is designed to enable engaging conversations with a variety of characters.
However, it may encounter safety issues related to toxic or NSFW content.
We are not liable for any commercial damage or losses incurred from the use of this model.
### Look forward to our next model! We are preparing a Preference Fine-Tuning model using a reward model.
## How to start
```import transformers
import torch
model_id = "Rookied2/HikariBloom-v0.3-RP"
pipeline = transformers.pipeline(
"text-generation",
model=model_id,
model_kwargs={"torch_dtype": torch.bfloat16},
device_map="auto",
)
messages = [
{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
{"role": "user", "content": "Who are you?"},
]
outputs = pipeline(
messages,
max_new_tokens=256,
)
print(outputs[0]["generated_text"][-1])
```
## Recommended chat templates : This is a chat template we've used a lot in training.
```
### template 1
character_name : {character_name}
character_description : {character_description}
you're roleplaying as a character.
### template 2
character_name : {character_name}
character_description : {character_description}
chat_exmaple:
{example}
see chat_exmaple, you are roleplaying as a character.
```
### Training Data
- PygmalionAI/PIPPA (https://huggingface.co/datasets/PygmalionAI/PIPPA)
- MinervaAI/Aesir-Preview (https://huggingface.co/datasets/MinervaAI/Aesir-Preview)
- HuggingFaceH4/no_robots (https://huggingface.co/datasets/HuggingFaceH4/no_robots)
- HuggingFaceTB/smol-smoltalk (https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk)
- Undi95/toxic-dpo-v0.1-sharegpt (https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
- NobodyExistsOnTheInternet/ToxicQAFinal (https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
### Contact Email
For more information, feel free to contact us
Email: Harry@supergene.co