VLMEvalKit Evaluation Results Collection
Extract text or generate Markdown from images
Convert images of screens to structured elements
Display LLM performance leaderboards
Generate code from text prompts