Beyond Autoregression: Fast LLMs via Self-Distillation Through Time Paper • 2410.21035 • Published Oct 28, 2024
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published Oct 28, 2024 • 77
Going beyond Compositions, DDPMs Can Produce Zero-Shot Interpolations Paper • 2405.19201 • Published May 29, 2024
Object-centric architectures enable efficient causal representation learning Paper • 2310.19054 • Published Oct 29, 2023 • 1
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 53
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning Paper • 2308.03526 • Published Aug 7, 2023 • 26