7b tulu 2.5 - a hamishivi Collection

hamishivi 's Collections

Tulu 2 Llama 3 Update

LM Preference Datasets

7b tulu 2.5

updated Jun 25, 2024

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.

hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm

Text Generation • Updated Jun 25, 2024 • 26
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value

Token Classification • Updated Jun 25, 2024 • 22
hamishivi/tulu-v2.5-7b-uf-rm

Text Classification • Updated Jun 25, 2024 • 21