Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 20 days ago • 80
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 13 days ago • 34