yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • Updated about 22 hours ago • 6 • 20
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 8 days ago • 31
unsloth/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit Text Generation • Updated 16 days ago • 13.5k • 11
unsloth/DeepSeek-R1-Distill-Qwen-32B-unsloth-bnb-4bit Text Generation • Updated 3 days ago • 2.33k • 8
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 27 days ago • 50
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Paper • 2501.15570 • Published 23 days ago • 23
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 22 days ago • 26