MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 22 days ago • 42
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models Paper • 2412.10117 • Published Dec 13, 2024 • 3
alibaba-damo/audio_codec-freqcodec_magphase-en-libritts-16k-gr8nq32ds320-pytorch Updated Oct 12, 2023 • 4 • 1
alibaba-damo/audio_codec-freqcodec_magphase-en-libritts-16k-gr1nq32ds320-pytorch Updated Oct 12, 2023 • 3
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 26