Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
gemma-2b-orpo
/
notebooks
/
usage.ipynb
Commit History
retry nb visualization
f18f009
anakin87
commited on
Mar 26, 2024
improve notebook visualization
c8b9386
anakin87
commited on
Mar 26, 2024
material
4db7146
anakin87
commited on
Mar 25, 2024