morizon/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 Text Generation • Updated 26 days ago • 198
daichira/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft_math-tanuki_adapter_0.9 Updated 22 days ago
daichira/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft_math-tanuki_adapter Updated 22 days ago
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja2000 Text Generation • Updated 20 days ago • 67
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja5000 Text Generation • Updated 19 days ago • 167