softmax is not enough (for sharp out-of-distribution) Paper • 2410.01104 • Published Oct 1, 2024 • 1
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published Sep 4, 2024 • 71
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Paper • 2407.01392 • Published Jul 1, 2024 • 39
Transformers Can Represent n-gram Language Models Paper • 2404.14994 • Published Apr 23, 2024 • 18
Scaling Instructable Agents Across Many Simulated Worlds Paper • 2404.10179 • Published Mar 13, 2024 • 27