henchen99
/

Llama-3-3B-Open-R1-GRPO-med-cot-1k

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3-3B-Open-R1-GRPO-med-cot-1k / trainer_state.json

henchen99's picture

Model save

d24b670 verified 9 days ago

history contribute delete

319 kB

File too large to display, you can check the raw version instead.