Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
AjouBlue GPTs
Datasets Translated to Korean
Synthetic Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Reward Modeling Datasets
updated
Nov 19
Upvote
-
nvidia/HelpSteer
Viewer
•
Updated
5 days ago
•
37.1k
•
1.74k
•
227
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
10.7k
•
1.23k
stanfordnlp/SHP
Viewer
•
Updated
Oct 10, 2023
•
386k
•
1.46k
•
295
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18
•
164k
•
3.77k
•
120
openai/webgpt_comparisons
Viewer
•
Updated
Dec 19, 2022
•
19.6k
•
404
•
226
openai/summarize_from_feedback
Viewer
•
Updated
Jan 3, 2023
•
194k
•
1.02k
•
189
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16
•
187k
•
6.72k
•
254
berkeley-nest/Nectar
Viewer
•
Updated
Mar 20
•
183k
•
405
•
281
HuggingFaceH4/stack-exchange-preferences
Viewer
•
Updated
Mar 8, 2023
•
10.8M
•
624
•
125
HuggingFaceH4/hhh_alignment
Viewer
•
Updated
Mar 2, 2023
•
221
•
164
•
17
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
Jun 3, 2023
•
1.09M
•
242
•
43
prometheus-eval/Feedback-Collection
Viewer
•
Updated
Oct 14, 2023
•
100k
•
565
•
107
argilla/OpenHermesPreferences
Viewer
•
Updated
Mar 1
•
989k
•
760
•
201
allenai/reward-bench
Viewer
•
Updated
Sep 9
•
8.11k
•
6.2k
•
80
nvidia/HelpSteer2
Viewer
•
Updated
5 days ago
•
21.4k
•
13.1k
•
386
Magpie-Align/Magpie-Pro-DPO-200K
Viewer
•
Updated
Aug 20
•
207k
•
35
•
6
argilla/magpie-ultra-v0.1
Viewer
•
Updated
27 days ago
•
50k
•
328
•
218
Upvote
-
Share collection
View history
Collection guide
Browse collections