Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
llm-blender
/
PairRM-hf
like
14
Follow
LLM Blender
14
Text Generation
Transformers
Safetensors
6 datasets
English
deberta-v2
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
arxiv:
2306.02561
arxiv:
2112.09332
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
PairRM-hf
Commit History
Update README.md
f9000bd
Dongfu Jiang
commited on
Jan 8, 2024
Update README.md
0d37d21
Dongfu Jiang
commited on
Jan 8, 2024
Update README.md
342b809
Dongfu Jiang
commited on
Jan 8, 2024
Update README.md
8841a8a
Dongfu Jiang
commited on
Jan 6, 2024
Update README.md
096ad41
Dongfu Jiang
commited on
Jan 6, 2024
Update README.md
b9ac13b
Dongfu Jiang
commited on
Jan 5, 2024
Update README.md
28afd59
Dongfu Jiang
commited on
Jan 5, 2024
Update README.md
bfd2da5
Dongfu Jiang
commited on
Jan 5, 2024
Update README.md
02971ea
Dongfu Jiang
commited on
Jan 5, 2024
Upload 8 files
3f84fdc
Dongfu Jiang
commited on
Jan 5, 2024
initial commit
cb453fc
DongfuJiang
commited on
Jan 5, 2024