Transformers
PyTorch
English
trl
rlhf

Refocus language on specific harms

#8
by yjernite HF staff - opened
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -59,8 +59,8 @@ which constitutes a significant part of the StackExchange data,
59
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
60
  - May generate answers that are incorrect or misleading.
61
  - May copy answers from the training data verbatim.
62
- - Contains extremely NSFW data.
63
- - Suggested answers maybe illegal, unethical and/or distateful.
64
 
65
 
66
  ### Recommendations
 
59
  most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
60
  - May generate answers that are incorrect or misleading.
61
  - May copy answers from the training data verbatim.
62
+ - May generate language that is hateful or promotes discrimination ([example](https://huggingface.co/trl-lib/llama-7b-se-rl-peft/discussions/7#64376083369f6f907f5bfe4c)).
63
+ - May generate language that is offensive to direct or indirect users or to people or groups mentioned.
64
 
65
 
66
  ### Recommendations