Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sileod
/
deberta-v3-large-tasksource-rlhf-reward-model
like
11
Text Classification
Transformers
PyTorch
Anthropic/hh-rlhf
English
deberta-v2
rlhf
Eval Results
Inference Endpoints
arxiv:
2204.05862
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
main
deberta-v3-large-tasksource-rlhf-reward-model
Commit History
Update README.md
2787455
sileod
commited on
Mar 28, 2023
Upload DebertaV2ForSequenceClassification
213bdda
sileod
commited on
Mar 28, 2023
Update README.md
683961f
sileod
commited on
Mar 28, 2023
Update README.md
e226218
sileod
commited on
Mar 28, 2023
Update README.md
d60ef35
sileod
commited on
Mar 28, 2023
Create README.md
052ab03
sileod
commited on
Mar 28, 2023
Upload tokenizer
bc9d816
sileod
commited on
Mar 28, 2023
Upload DebertaV2ForMultipleChoice
99b1e35
sileod
commited on
Mar 28, 2023
initial commit
d44f6e4
sileod
commited on
Mar 28, 2023