view article Article Mergoo: Efficiently Build Your Own MoE LLM By alirezamsh • about 1 month ago • 36
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29 • 70
Zero-shot text classification models Collection Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit. • 9 items • Updated about 5 hours ago • 7
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs Paper • 2403.05020 • Published Mar 8 • 1
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper • 2403.08764 • Published Mar 13 • 34
Sora Reference Papers Collection A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Feb 20 • 51
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Paper • 2402.10176 • Published Feb 15 • 33
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models Paper • 2402.13064 • Published Feb 20 • 46
When StyleGAN Meets Stable Diffusion: a W_+ Adapter for Personalized Image Generation Paper • 2311.17461 • Published Nov 29, 2023 • 2
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On Paper • 2312.01725 • Published Dec 4, 2023 • 4
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Paper • 2401.13795 • Published Jan 24 • 64
StoryGPT-V: Large Language Models as Consistent Story Visualizers Paper • 2312.02252 • Published Dec 4, 2023 • 1
MAGNeT Collection Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4 • 31
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Paper • 2309.05653 • Published Sep 11, 2023 • 9
Democratizing Reasoning Ability: Tailored Learning from Large Language Model Paper • 2310.13332 • Published Oct 20, 2023 • 14
Faithful Persona-based Conversational Dataset Generation with Large Language Models Paper • 2312.10007 • Published Dec 15, 2023 • 6
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models Paper • 2312.09608 • Published Dec 15, 2023 • 13
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models Paper • 2312.09767 • Published Dec 15, 2023 • 25
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent Paper • 2312.10003 • Published Dec 15, 2023 • 32
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper • 2312.09911 • Published Dec 15, 2023 • 52
TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 34
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models Paper • 2312.06585 • Published Dec 11, 2023 • 26
LivePhoto: Real Image Animation with Text-guided Motion Control Paper • 2312.02928 • Published Dec 5, 2023 • 15
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis Paper • 2312.02155 • Published Dec 4, 2023 • 11
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes Paper • 2311.13384 • Published Nov 22, 2023 • 48
GPT Can Solve Mathematical Problems Without a Calculator Paper • 2309.03241 • Published Sep 6, 2023 • 17