koshirowada
/

pythia_70m_dpo

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

koshirowada commited on Nov 20, 2024

Commit

3debf66

·

verified ·

1 Parent(s): eebbfdd

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -7,6 +7,8 @@ tags:
 - trl
 - dpo
 licence: license
 ---
 # Model Card for pythia_70m_dpo

 - trl
 - dpo
 licence: license
+datasets:
+- tatsu-lab/alpaca_farm
 ---
 # Model Card for pythia_70m_dpo