Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
llm-blender
/
PairRM
like
191
Follow
LLM Blender
13
Text Generation
Transformers
Safetensors
6 datasets
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
arxiv:
2306.02561
arxiv:
2112.09332
License:
mit
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
a2f8211
PairRM
Commit History
Update README.md
a2f8211
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
edac579
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
8d9ead8
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
00b7e60
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
80230fd
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
0ef6e21
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
bb45a4c
Dongfu Jiang
commited on
Nov 11, 2023
Upload 2 files
41fb3d0
Dongfu Jiang
commited on
Nov 6, 2023
Upload 9 files
7b9ee76
Dongfu Jiang
commited on
Nov 6, 2023
initial commit
d09aaea
DongfuJiang
commited on
Nov 6, 2023