Transformers
Safetensors
English
deberta-v2
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints

Commit History

Update README.md
69f3ec1
verified

maywell commited on

Update README.md
2fc8ad5
verified

maywell commited on

Update README.md
0319c87
verified

maywell commited on

Update README.md
c9f7a5d
verified

maywell commited on

Update README.md
a775416
verified

maywell commited on

Update README.md
4c4f60b
verified

maywell commited on

Upload folder using huggingface_hub
e50d81e
verified

maywell commited on

Update README.md
90d89ed
verified

maywell commited on

Update README.md
1b901ae
verified

maywell commited on

initial commit
7fc0cce
verified

maywell commited on