Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
suayptalha
/
Maestro-R1-Llama-8B
like
4
Text Generation
Transformers
PyTorch
Safetensors
ServiceNow-AI/R1-Distill-SFT
English
llama
unsloth
trl
sft
conversational
text-generation-inference
Inference Endpoints
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Maestro-R1-Llama-8B
/
maestro-r1-llama-loss.png
Commit History
Upload maestro-r1-llama-loss.png
a1cb89a
verified
suayptalha
commited on
Feb 1