Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
Aux-RewardModel
like
0
Text Classification
Transformers
Safetensors
nicholasKluge/toxic-aira-dataset
Anthropic/hh-rlhf
English
roberta
reward model
alignment
preference model
RLHF
Carbon Emissions
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Aux-RewardModel
Commit History
Update README.md
1ba5b15
verified
nicholasKluge
commited on
Jun 18, 2024
Update config.json
144ae68
verified
nicholasKluge
commited on
May 27, 2024
Update README.md
fed22d0
verified
nicholasKluge
commited on
May 27, 2024
Update README.md
7a584f3
verified
nicholasKluge
commited on
May 27, 2024
Upload hh_rlhf_eval.parquet
5d2556a
verified
nicholasKluge
commited on
May 27, 2024
Upload LICENSE
cbccea9
verified
nicholasKluge
commited on
May 27, 2024
Create README.md
c3ba6f1
verified
nicholasKluge
commited on
May 27, 2024
Upload emissions.csv with huggingface_hub
72aa516
verified
nicholasKluge
commited on
May 27, 2024
Upload folder using huggingface_hub
70e65be
verified
nicholasKluge
commited on
May 27, 2024
initial commit
12aaab0
verified
nicholasKluge
commited on
May 27, 2024