Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
christopherthompson81
/
Lite-Mistral-150M-v2-Instruct-SPPO-Iter3
like
0
Safetensors
UCLA-AGI/data-mistral-7b-instruct-sppo-iter1
christopherthompson81/sppo-synthetic-dataset-lite-mistral-150m-v2
mistral
SPPO
alignment-handbook
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
5dce7c2
Lite-Mistral-150M-v2-Instruct-SPPO-Iter3
/
generation_config.json
christopherthompson81
Upload 12 files
08bb349
verified
5 months ago
raw
Copy download link
history
blame
Safe
111 Bytes
{
"_from_model_config"
:
true
,
"bos_token_id"
:
1
,
"eos_token_id"
:
2
,
"transformers_version"
:
"4.44.0"
}