HuggingFaceH4/orca_dpo_pairs
Viewer
•
Updated
•
12.9k
•
178
•
25
A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost