Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 7 days ago • 26
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Paper • 2307.02321 • Published Jul 5, 2023 • 7