Making Multimodal Generation Easier: When Diffusion Models Meet LLMs Paper • 2310.08949 • Published Oct 13, 2023 • 1
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published about 23 hours ago • 23
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models Paper • 2308.04729 • Published Aug 9, 2023 • 32
PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation Paper • 2411.08307 • Published Nov 13, 2024 • 6
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Paper • 2405.20289 • Published May 30, 2024 • 11
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion Paper • 2402.14285 • Published Feb 22, 2024 • 1
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning Paper • 2308.11276 • Published Aug 22, 2023