MoE-LLaVA: Mixture of Experts for Large Vision-Language Models Paper โข 2401.15947 โข Published Jan 29, 2024 โข 49 โข 4
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper โข 2401.14196 โข Published Jan 25, 2024 โข 48 โข 2