VLMEvalKit Evaluation Results Collection
Extract text or generate Markdown from images
Convert images of screens to structured elements
View LLM Performance Leaderboard
Generate text based on input prompts