Datasets used to train the LION pipeline. Paper: https://arxiv.org/abs/2407.06542; Code: https://github.com/Columbia-NLP-Lab/LionAlignment
Columbia NLP
university
AI & ML interests
Natural language processing group at Columbia University
Organization Card
Columbia University - NLP
models
20
Columbia-NLP/LION-Gemma-2b-sft-v1.0
Text Generation
•
Updated
•
11
Columbia-NLP/LION-Gemma-2b-dpo-v1.0
Text Generation
•
Updated
•
12
Columbia-NLP/LION-Gemma-2b-odpo-v1.0
Text Generation
•
Updated
•
10
•
4
Columbia-NLP/LION-LLaMA-3-8b-sft-v1.0
Text Generation
•
Updated
•
12
Columbia-NLP/LION-LLaMA-3-8b-dpo-v1.0
Text Generation
•
Updated
•
12
•
2
Columbia-NLP/LION-LLaMA-3-8b-odpo-v1.0
Text Generation
•
Updated
•
10
•
2
Columbia-NLP/llama3-8b-instruct-rewriting-r-Decor
Text Generation
•
Updated
•
8
Columbia-NLP/llama3-8b-instruct-rewriting-nr-Decor
Text Generation
•
Updated
•
9
Columbia-NLP/llama2-7b-rewriting-r-Decor
Text Generation
•
Updated
•
10
Columbia-NLP/llama2-7b-rewriting-nr-Decor
Text Generation
•
Updated
•
13
datasets
17
Columbia-NLP/DPO-hh-rlhf
Viewer
•
Updated
•
169k
•
42
Columbia-NLP/DPO-PKU-SafeRLHF
Viewer
•
Updated
•
136k
•
49
Columbia-NLP/DPO-HelpSteer
Viewer
•
Updated
•
9.17k
•
41
Columbia-NLP/DPO-tldr-summarisation-preferences
Viewer
•
Updated
•
177k
•
54
Columbia-NLP/DPO-py-dpo-v0.1
Viewer
•
Updated
•
9.47k
•
44
Columbia-NLP/DPO-UltraFeedback_binarized
Viewer
•
Updated
•
62.7k
•
43
Columbia-NLP/DPO-distilabel-intel-orca-dpo-pairs_cleaned
Viewer
•
Updated
•
12.8k
•
44
Columbia-NLP/DPO-distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
40
Columbia-NLP/DPO-Nectar
Viewer
•
Updated
•
183k
•
63
Columbia-NLP/VarBench
Preview
•
Updated
•
44