Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
Inference Endpoints
conversational
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama3.1-8B-gsm8k-grpo
/
pytorch_model-00001-of-00004.bin
Commit History
Trained with Unsloth
b42217f
verified
ubermenchh
commited on
15 days ago