nicholasKluge
/

ToxicityModelPT

Text Classification

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Dec 28, 2023

Commit

190f7f3

•

1 Parent(s): 43df3be

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -57,7 +57,7 @@ This repository has the [source code](https://github.com/Nkluge-correa/Aira) use
 The ToxicityModelPT was trained as an auxiliary reward model for RLHF training (its logit outputs can be treated as penalizations/rewards). Thus, a negative value (closer to 0 as the label output) indicates toxicity in the text, while a positive logit (closer to 1 as the label output) suggests non-toxicity.
-Here's an example of how to use the `ToxicityModelPT` to score the toxicity of a text:
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification

 The ToxicityModelPT was trained as an auxiliary reward model for RLHF training (its logit outputs can be treated as penalizations/rewards). Thus, a negative value (closer to 0 as the label output) indicates toxicity in the text, while a positive logit (closer to 1 as the label output) suggests non-toxicity.
+Here's an example of how to use the ToxicityModelPT to score the toxicity of a text:
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification