Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
Inference Endpoints
conversational
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama3.1-8B-gsm8k-grpo
/
.gitattributes
Commit History
(Trained with Unsloth)
e6b220a
verified
ubermenchh
commited on
14 days ago
Upload tokenizer
b4d6dde
verified
ubermenchh
commited on
14 days ago
initial commit
99dce61
verified
ubermenchh
commited on
15 days ago