DanielSc4
/

RedPajama-INCITE-Chat-3B-v1-FT-LoRA-8bit-test1

Model card Files Files and versions Community

DanielSc4 commited on Aug 10, 2023

Commit

f477d24

•

1 Parent(s): e0be977

Create README.md

Files changed (1) hide show

README.md +15 -0

README.md ADDED Viewed

	@@ -0,0 +1,15 @@

+---
+license: apache-2.0
+language:
+- en
+---
+Pre-trained model fine-tuned using Reinforcement Learning on [DIALOCONAN](https://github.com/marcoguerini/CONAN#dialoconan) dataset using [facebook/roberta-hate-speech-dynabench-r4-target](https://huggingface.co/facebook/roberta-hate-speech-dynabench-r4-target) as reward model.
+Toxicity results on [allenai/real-toxicity-prompts](https://huggingface.co/datasets/allenai/real-toxicity-prompts) dataset using custom prompts (see 🥞[RewardLM](https://github.com/DanielSc4/RewardLM) for details).
+| Toxicity Level | RedPajama-INCITE-Chat-3B |
+|:--------------:|:------------------------:|
+|             Pre-Trained |           0.217          |
+|             **Fine-Tuned** |           **0.129**          |
+|  [RL](https://huggingface.co/DanielSc4/RedPajama-INCITE-Chat-3B-v1-RL-LoRA-8bit-test1) |         0.160        |