Text Generation
Transformers
Safetensors
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
Dongfu Jiang commited on
Commit
7333fb2
1 Parent(s): 14d4a72

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -26,6 +26,7 @@ Inspired by [DeBERTa Reward Model Series](https://huggingface.co/OpenAssistant/r
26
 
27
  - Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
28
  - Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
 
29
 
30
  ## Statistics
31
 
 
26
 
27
  - Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
28
  - Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
29
+ - Space Demo: [https://huggingface.co/spaces/llm-blender/LLM-Blender](https://huggingface.co/spaces/llm-blender/LLM-Blender)
30
 
31
  ## Statistics
32