SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model Paper • 2210.00705 • Published Oct 3, 2022
Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer Paper • 2111.04093 • Published Nov 7, 2021