Crocy Cheng's picture

8 8

Crocy Cheng

zhycheng4ai

·

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

NexaAIDev/omnivision-968M

liked a model 9 days ago

OuteAI/OuteTTS-0.1-350M

liked a model 9 days ago

black-forest-labs/FLUX.1-dev

View all activity

Organizations

None yet

zhycheng4ai's activity

upvoted 8 papers 13 days ago

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 47

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26 • 28

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

Paper • 2110.08518 • Published Oct 16, 2021 • 1

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 46

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Paper • 2406.10118 • Published Jun 14 • 30

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4 • 36

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 10