Rookied2
/

HikariBloom-v0.3-RP

Text Generation

nsfw

Not-For-All-Audiences

Model card Files Files and versions Community

HikariBloom-v0.3-RP / README.md

Rookied2's picture

Update README.md

1fa356c verified 3 months ago

|

history blame contribute delete

2.52 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- meta-llama/Llama-3.1-8B-Instruct
	pipeline_tag: text-generation
	tags:
	- RP
	- roleplay
	- nsfw
	- not-for-all-audiences
	---


	# HikariBloom-v0.3-RP

	<img src="gumi.png" alt="Image" style="display: block; margin-left: auto; margin-right: auto; width: 65%;">

	HikariBloom-v0.3-RP is a chatbot model built upon meta-llama/Llama-3.1-8B-Instruct, with additional SFT training.
	This model is designed to enable engaging conversations with a variety of characters.
	However, it may encounter safety issues related to toxic or NSFW content.

	We are not liable for any commercial damage or losses incurred from the use of this model.

	### Look forward to our next model! We are preparing a Preference Fine-Tuning model using a reward model.

	## How to start

	```import transformers
	import torch

	model_id = "Rookied2/HikariBloom-v0.3-RP"

	pipeline = transformers.pipeline(
	"text-generation",
	model=model_id,
	model_kwargs={"torch_dtype": torch.bfloat16},
	device_map="auto",
	)

	messages = [
	{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
	{"role": "user", "content": "Who are you?"},
	]

	outputs = pipeline(
	messages,
	max_new_tokens=256,
	)
	print(outputs[0]["generated_text"][-1])
	```

	## Recommended chat templates : This is a chat template we've used a lot in training.
	```
	### template 1

	character_name : {character_name}

	character_description : {character_description}

	you're roleplaying as a character.

	### template 2
	character_name : {character_name}

	character_description : {character_description}

	chat_exmaple:
	{example}

	see chat_exmaple, you are roleplaying as a character.
	```

	### Training Data
	- PygmalionAI/PIPPA (https://huggingface.co/datasets/PygmalionAI/PIPPA)
	- MinervaAI/Aesir-Preview (https://huggingface.co/datasets/MinervaAI/Aesir-Preview)
	- HuggingFaceH4/no_robots (https://huggingface.co/datasets/HuggingFaceH4/no_robots)
	- HuggingFaceTB/smol-smoltalk (https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk)
	- Undi95/toxic-dpo-v0.1-sharegpt (https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
	- NobodyExistsOnTheInternet/ToxicQAFinal (https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
	<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

	### Contact Email
	For more information, feel free to contact us

	Email: Harry@supergene.co