Preference Datasets for DPO - a argilla Collection

argilla 's Collections

Synthetic Data Generator

Datasets built with ⚗️ distilabel

Open Image Generation Models

Argilla v2.0 compatible datasets

DIBT Prompt collective SPIN

Preference Datasets for DPO

Preference Datasets for KTO

Domain Specific Data

Preference Datasets for DPO

updated Dec 11, 2024

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs

argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 431 • 70

Note Binarized version of `OpenBMB/UltraFeedback` using the average of the preference ratings of the attributes: helpfulness, honesty, truthfulness, and instruction following
argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 5.29k • 131

Note Iteration on top of `argilla/ultrafeedback-binarized-preferences` to remove the prompts contaminated from TruthfulQA within the original `OpenBMB/UltraFeedback` dataset
argilla/ultrafeedback-multi-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 158k • 52 • 6

Note Iteration on top of `argilla/ultrafeedback-binarized-preferences-cleaned` to keep every rejected response per each one chosen, instead of picking a random one, leading to an augmented dataset to be used for DPO fine-tuning experiments.
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 155k • 46 • 4
berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 702 • 286

Note Ranking dataset used by Starling
argilla/distilabel-intel-orca-dpo-pairs

Viewer • Updated Feb 5, 2024 • 12.9k • 618 • 170
argilla/distilabel-capybara-dpo-7k-binarized

Viewer • Updated Jul 16, 2024 • 7.56k • 1.77k • 180