DITTO: Diffusion Inference-Time T-Optimization for Music Generation Paper • 2401.12179 • Published Jan 22 • 20
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Paper • 2401.11053 • Published Jan 19 • 10