-
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Paper • 2311.07965 • Published • 1 -
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Paper • 2311.08673 • Published -
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation
Paper • 2311.08670 • Published -
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data
Paper • 2309.16196 • Published
Lab of Large Audio Model
community
AI & ML interests
Large Audio Model、Text to Speech (TTS)、Voice Conversion、Talking Face、Music AI、Speech Security、Infant Acoustic
Organization Card
Edit this README.md
markdown file to author your organization card 🔥
Collections
1
spaces
1
models
None public yet
datasets
None public yet