Identify objects in images using a vision model
Evaluate model predictions with correctness scores
Evaluate open-ended outputs from AI models using MM-Vet