Reading List (Mainly Focused of VLM's and Diffusion Models)
-
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Paper • 2310.03502 • Published • 75 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 10 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 9 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 8