Cosmos Tokenizer Collection A suite of image and video tokenizers • 10 items • Updated 19 days ago • 20
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 21 days ago • 91
Magpie Conversation Ko Collection Magpie 데이터셋 한국어 번역본 (@nayohan님 번역 모델 사용) • 10 items • Updated 19 days ago • 1
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures Paper • 2406.06565 • Published Jun 3 • 9
Magpie-Qwen2 Datasets Collection Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Sep 14 • 10
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 65
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 22
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22 • 24
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15 • 57
zephyr-7b-sft-full-SPIN Collection Models fine-tuned with SPIN across iterations 0,1,2,3 • 4 items • Updated Feb 7 • 8
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9 • 54