SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 2 days ago • 52
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 10 days ago • 28
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 24 days ago • 109
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 18 days ago • 32
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 20 days ago • 120
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 7 days ago • 49
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 29 days ago • 24
view article Article Janus Pro: DeepSeek's Revolutionary Multimodal AI Model By LLMhacker • about 1 month ago • 31
Albertina Collection Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • Jan 23 • 64