kaizuberbuehler
's Collections
Music Generation
updated
Long-form music generation with latent diffusion
Paper
•
2404.10301
•
Published
•
24
MuPT: A Generative Symbolic Music Pretrained Transformer
Paper
•
2404.06393
•
Published
•
15
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
•
2404.09956
•
Published
•
11
Joint Audio and Symbolic Conditioning for Temporally Controlled
Text-to-Music Generation
Paper
•
2406.10970
•
Published
•
1
Paper
•
2409.00587
•
Published
•
32
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion
Transformer
Paper
•
2409.10819
•
Published
•
18
Seed-Music: A Unified Framework for High Quality and Controlled Music
Generation
Paper
•
2409.09214
•
Published
•
50
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid
Emotions
Paper
•
2409.18042
•
Published
•
37
MuCodec: Ultra Low-Bitrate Music Codec
Paper
•
2409.13216
•
Published
•
23
High Fidelity Text-Guided Music Generation and Editing via Single-Stage
Flow Matching
Paper
•
2407.03648
•
Published
•
17
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Paper
•
2412.01169
•
Published
•
12