diwank
's Collections
Preview
•
Updated
•
370
•
75
argilla/intel-orca-dpo-pairs-helm-instruct
Viewer
•
Updated
•
5
•
40
•
1
argilla/OpenHermes2.5-dpo-binarized-alpha
Viewer
•
Updated
•
9.79k
•
72
•
64
argilla/ultrafeedback-critique
Viewer
•
Updated
•
253k
•
46
•
4
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
7.79k
•
124
ai2lumos/lumos_maths_plan_onetime
Viewer
•
Updated
•
19.8k
•
51
•
2
ai2lumos/lumos_unified_plan_iterative
Viewer
•
Updated
•
55.4k
•
56
•
2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
•
Updated
•
19.4k
•
60
•
3
Viewer
•
Updated
•
10k
•
232
•
28
lmsys/mt_bench_human_judgments
Viewer
•
Updated
•
5.76k
•
399
•
113
lmsys/chatbot_arena_conversations
Viewer
•
Updated
•
33k
•
569
•
340
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
6.25k
•
343
Qwen/Qwen1.5-32B
Text Generation
•
Updated
•
13.3k
•
81
vicgalle/configurable-system-prompt-multitask
Viewer
•
Updated
•
1.95k
•
146
•
19
paraloq/json_data_extraction
Viewer
•
Updated
•
484
•
71
•
17
Viewer
•
Updated
•
479
•
59
•
4
iamtarun/python_code_instructions_18k_alpaca
Viewer
•
Updated
•
18.6k
•
1.96k
•
230
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
•
2403.15042
•
Published
•
25
Viewer
•
Updated
•
2.35k
•
42
•
1
Paper
•
2402.12219
•
Published
•
15
Viewer
•
Updated
•
20.2k
•
97
•
30
M4-ai/prm_dpo_pairs_cleaned
Viewer
•
Updated
•
7.99k
•
57
•
11
SanjiWatsuki/Kunoichi-DPO-v2-7B
Text Generation
•
Updated
•
850
•
82
Viewer
•
Updated
•
17.3k
•
597
•
20
mlabonne/orpo-dpo-mix-40k
Viewer
•
Updated
•
44.2k
•
1.49k
•
245
Viewer
•
Updated
•
529k
•
1.31k
•
121
meta-llama/Meta-Llama-3-8B
Text Generation
•
Updated
•
707k
•
5.83k
Viewer
•
Updated
•
149k
•
70
•
7
FreedomIntelligence/evol-instruct-hindi
Viewer
•
Updated
•
59k
•
11
•
2
totally-not-an-llm/EverythingLM-data-V3
Viewer
•
Updated
•
1.07k
•
77
•
31
RUCAIBox/Story-Generation
Updated
•
61
•
11
imone/Llama-3-8B-fixed-special-embedding
Text Generation
•
Updated
•
1.04k
•
15
Viewer
•
Updated
•
49.6k
•
389
•
109
Norquinal/claude_multiround_chat_30k
Viewer
•
Updated
•
32.2k
•
30
•
53
Norquinal/claude_multi_instruct_30k
Viewer
•
Updated
•
32.2k
•
23
•
10
Viewer
•
Updated
•
1.72M
•
37
•
9
Locutusque/OpenCerebrum-2.0-SFT
Viewer
•
Updated
•
6.4k
•
50
•
4
Locutusque/OpenCerebrum-2.0-DPO
Viewer
•
Updated
•
720
•
42
•
4
Preview
•
Updated
•
917
•
12
Preview
•
Updated
•
104
•
26
gradientai/Llama-3-70B-Instruct-Gradient-262k
Text Generation
•
Updated
•
167
•
55
princeton-nlp/QuRating-GPT3.5-Judgments
Viewer
•
Updated
•
250k
•
42
•
5
Viewer
•
Updated
•
1.46M
•
42
•
16
mustafaaljadery/gemma-2B-10M
jondurbin/airoboros-70b-3.3
Text Generation
•
Updated
•
2.5k
•
14
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
Updated
•
3.2k
•
55
Viewer
•
Updated
•
21.4k
•
15.1k
•
368
nvidia/Nemotron-4-340B-Reward
Updated
•
353
•
109
Magpie-Align/Magpie-Pro-MT-300K-v0.1
Viewer
•
Updated
•
300k
•
515
•
28
Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.1
Text Generation
•
Updated
•
2.62k
•
4
nvidia/Aegis-AI-Content-Safety-Dataset-1.0
Viewer
•
Updated
•
12k
•
994
•
44
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
•
60k
•
2.64k
•
385
Viewer
•
Updated
•
20.4M
•
8.78k
•
550
diwank/llmlingua-compressed-text
Viewer
•
Updated
•
222k
•
48
•
2
diwank/python-code-execution-output
Viewer
•
Updated
•
3.61k
•
45
•
1
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
•
2406.08451
•
Published
•
23
Viewer
•
Updated
•
99.5k
•
366
•
19
cognitivecomputations/samantha-1.5
Viewer
•
Updated
•
327
•
51
•
11
Viewer
•
Updated
•
728
•
63
•
8
HannahRoseKirk/prism-alignment
Viewer
•
Updated
•
77.9k
•
1.01k
•
60
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
•
11.4k
•
151
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
•
16.4k
•
48
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
•
Updated
•
29.9k
•
318
•
8
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
•
Updated
•
249k
•
333
•
58
Viewer
•
Updated
•
11.1M
•
811
•
54
Viewer
•
Updated
•
68.8k
•
14.4k
•
23
Viewer
•
Updated
•
12.7k
•
19
•
5
imbue/human_question_quality_judgments
Viewer
•
Updated
•
167k
•
44
•
8
Viewer
•
Updated
•
54k
•
56
•
19
imbue/high_quality_public_evaluations
Viewer
•
Updated
•
12.8k
•
44
•
6
imbue/high_quality_private_evaluations
Viewer
•
Updated
•
10.6k
•
137
•
8
google/gemma-2-27b
Text Generation
•
Updated
•
20.1k
•
177
Viewer
•
Updated
•
1.46M
•
1.72k
•
4
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
•
6.87k
•
77
Viewer
•
Updated
•
375k
•
5.51k
•
453
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
94
Viewer
•
Updated
•
1.24M
•
113
•
7
Viewer
•
Updated
•
1.25M
•
244
•
5
Viewer
•
Updated
•
2.05M
•
132
•
3
Viewer
•
Updated
•
326k
•
14
•
8
hubertsiuzdak/snac_24khz
hubertsiuzdak/snac_32khz
hubertsiuzdak/snac_44khz
Updated
•
1.07k
•
7
facebook/chameleon-30b
Image-Text-to-Text
•
Updated
•
931
•
82
facebook/chameleon-7b
Image-Text-to-Text
•
Updated
•
20.5k
•
165
gokaygokay/random_instruct_docci
Viewer
•
Updated
•
14.6k
•
119
•
5
internlm/internlm2_5-7b
Text Generation
•
Updated
•
4.92k
•
15
Gryphe/Opus-WritingPrompts
Viewer
•
Updated
•
14.9k
•
597
•
31
Viewer
•
Updated
•
3k
•
90
•
9
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference
Datasets
Paper
•
2405.18952
•
Published
•
10
OpenGVLab/InternVL2-4B
Image-Text-to-Text
•
Updated
•
79.4k
•
42
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
•
Updated
•
209k
•
204
QuasarResearch/apollo-preview-v0.2
Viewer
•
Updated
•
51.4k
•
402
•
62
fireworks-ai/nexus_parallel_messages
Viewer
•
Updated
•
70
•
40
•
6
fireworks-ai/nexus_parallel_functions
Viewer
•
Updated
•
29
•
40
•
4
Viewer
•
Updated
•
539
•
78
•
23
Viewer
•
Updated
•
18.6k
•
212
•
7
Viewer
•
Updated
•
259
•
92
•
2
Viewer
•
Updated
•
486k
•
73
•
38
Viewer
•
Updated
•
1.75M
•
247
•
79
Viewer
•
Updated
•
860k
•
2.81k
•
206
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
•
Updated
•
181k
•
132
•
76
chargoddard/WebInstructSub-prometheus
Viewer
•
Updated
•
2.39M
•
161
•
16
Viewer
•
Updated
•
1.96k
•
54
•
30
Viewer
•
Updated
•
294k
•
77
•
25
chargoddard/chai-feedback-pairs
Viewer
•
Updated
•
30.1k
•
32
•
5
nayohan/multi_session_chat
Viewer
•
Updated
•
23.4k
•
52
•
1
nvidia/Mistral-NeMo-12B-Instruct
Updated
•
129
•
138
nvidia/Mistral-NeMo-12B-Base
meta-llama/Llama-3.1-8B
Text Generation
•
Updated
•
1.02M
•
1.08k
meta-llama/Prompt-Guard-86M
Text Classification
•
Updated
•
106k
•
189
Viewer
•
Updated
•
6.41k
•
100
•
29
mistralai/Mistral-Large-Instruct-2407
Updated
•
27.8k
•
802
Symbol-LLM/Symbolic_Collection
Viewer
•
Updated
•
975k
•
87
•
7
Viewer
•
Updated
•
100k
•
8.51k
•
120
roborovski/dolly-entity-extraction
Viewer
•
Updated
•
5.95k
•
184
•
2
kalomaze/Opus_Instruct_25k
Viewer
•
Updated
•
25.1k
•
81
•
31
Vezora/Code-Preference-Pairs
Viewer
•
Updated
•
54k
•
59
•
14
Nexusflow/Athene-70B
Text Generation
•
Updated
•
8.61k
•
189
arcee-ai/Arcee-Spark
Text Generation
•
Updated
•
3.06k
•
86
Viewer
•
Updated
•
270k
•
85
•
7
OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k
Text Generation
•
Updated
•
294
•
2
google/gemma-2-2b
Text Generation
•
Updated
•
10.5M
•
420
google/gemma-scope
google/shieldgemma-2b
Text Generation
•
Updated
•
5.36k
•
47
Viewer
•
Updated
•
11.2k
•
68
•
6
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
392
•
213
mlabonne/Llama-3.1-70B-Instruct-lorablated-GGUF
Updated
•
3.63k
•
38
Viewer
•
Updated
•
55.1k
•
116
•
88
internlm/internlm2_5-20b
Text Generation
•
Updated
•
393
•
16
Viewer
•
Updated
•
1.02k
•
137
•
13
Viewer
•
Updated
•
2.39M
•
94
•
8
Viewer
•
Updated
•
6k
•
314
•
169
Viewer
•
Updated
•
282
•
43
•
1
Gryphe/Sonnet3.5-Charcard-Roleplay
Updated
•
380
•
37
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
597
•
214
AlgorithmicResearchGroup/ArXivDLInstruct
Viewer
•
Updated
•
778k
•
169
•
13
upstage/solar-pro-preview-instruct
Text Generation
•
Updated
•
1.3k
•
423
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
•
383
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
•
Updated
•
9.64k
•
174
Skywork/Skywork-Reward-Gemma-2-27B
Text Classification
•
Updated
•
182k
•
37
Viewer
•
Updated
•
59.4k
•
198
•
62
Viewer
•
Updated
•
29.9k
•
204
•
57
argilla/FinePersonas-v0.1
Viewer
•
Updated
•
21.1M
•
3.44k
•
318
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
134
bespokelabs/Bespoke-MiniCheck-7B
Text Classification
•
Updated
•
8.06k
•
48
Viewer
•
Updated
•
13.6k
•
115
•
19
mlabonne/open-perfectblend
Viewer
•
Updated
•
1.42M
•
960
•
44
rombodawg/Everything_Instruct
Viewer
•
Updated
•
4.05M
•
3.26k
•
43
Viewer
•
Updated
•
290k
•
782
•
26