Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts Paper • 2309.04354 • Published Sep 8, 2023 • 13
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks Paper • 1811.00056 • Published Oct 31, 2018 • 2