Generate images from text prompts
Generate images from text prompts and reference images
Transcribe voice to text