|
--- |
|
license: apache-2.0 |
|
language: |
|
- en |
|
base_model: |
|
- meta-llama/Llama-3.1-8B-Instruct |
|
pipeline_tag: text-generation |
|
tags: |
|
- RP |
|
- roleplay |
|
- nsfw |
|
- not-for-all-audiences |
|
--- |
|
|
|
|
|
# HikariBloom-v0.3-RP |
|
|
|
<img src="gumi.png" alt="Image" style="display: block; margin-left: auto; margin-right: auto; width: 65%;"> |
|
|
|
HikariBloom-v0.3-RP is a chatbot model built upon meta-llama/Llama-3.1-8B-Instruct, with additional SFT training. |
|
This model is designed to enable engaging conversations with a variety of characters. |
|
However, it may encounter safety issues related to toxic or NSFW content. |
|
|
|
We are not liable for any commercial damage or losses incurred from the use of this model. |
|
|
|
### Look forward to our next model! We are preparing a Preference Fine-Tuning model using a reward model. |
|
|
|
## How to start |
|
|
|
```import transformers |
|
import torch |
|
|
|
model_id = "Rookied2/HikariBloom-v0.3-RP" |
|
|
|
pipeline = transformers.pipeline( |
|
"text-generation", |
|
model=model_id, |
|
model_kwargs={"torch_dtype": torch.bfloat16}, |
|
device_map="auto", |
|
) |
|
|
|
messages = [ |
|
{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"}, |
|
{"role": "user", "content": "Who are you?"}, |
|
] |
|
|
|
outputs = pipeline( |
|
messages, |
|
max_new_tokens=256, |
|
) |
|
print(outputs[0]["generated_text"][-1]) |
|
``` |
|
|
|
## Recommended chat templates : This is a chat template we've used a lot in training. |
|
``` |
|
### template 1 |
|
|
|
character_name : {character_name} |
|
|
|
character_description : {character_description} |
|
|
|
you're roleplaying as a character. |
|
|
|
### template 2 |
|
character_name : {character_name} |
|
|
|
character_description : {character_description} |
|
|
|
chat_exmaple: |
|
{example} |
|
|
|
see chat_exmaple, you are roleplaying as a character. |
|
``` |
|
|
|
### Training Data |
|
- PygmalionAI/PIPPA (https://huggingface.co/datasets/PygmalionAI/PIPPA) |
|
- MinervaAI/Aesir-Preview (https://huggingface.co/datasets/MinervaAI/Aesir-Preview) |
|
- HuggingFaceH4/no_robots (https://huggingface.co/datasets/HuggingFaceH4/no_robots) |
|
- HuggingFaceTB/smol-smoltalk (https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk) |
|
- Undi95/toxic-dpo-v0.1-sharegpt (https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt) |
|
- NobodyExistsOnTheInternet/ToxicQAFinal (https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal) |
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
|
|
### Contact Email |
|
For more information, feel free to contact us |
|
|
|
Email: Harry@supergene.co |
|
|
|
|
|
|
|
|
|
|