Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
silviasapora
/
gemma-7b-orpo-low-quality
like
0
Text Generation
Transformers
TensorBoard
Safetensors
silviasapora/low_quality_dpo7k
gemma
alignment-handbook
trl
orpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
gemma
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
3f81b20
gemma-7b-orpo-low-quality
/
all_results.json
Commit History
Model save
3f81b20
verified
silviasapora
commited on
Sep 21
Model save
9dc97f6
verified
silviasapora
commited on
Sep 19
End of training
df52d15
verified
silviasapora
commited on
Sep 18
Model save
7d94d4f
verified
silviasapora
commited on
Sep 18
Model save
4fc7839
verified
silviasapora
commited on
Sep 15