3 8 1

Kanzhi Cheng

cckevinn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

authored a paper 28 days ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

authored a paper 28 days ago

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

View all activity

Organizations

cckevinn's activity

upvoted a paper about 14 hours ago

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published 5 days ago • 12

authored 4 papers 28 days ago

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

Paper • 2401.10935 • Published Jan 17, 2024 • 4

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Paper • 2406.11736 • Published Jun 17, 2024 • 5

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 82

upvoted a paper about 1 month ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 82

upvoted a collection about 1 month ago

OS-Genesis

Collection

11 items • Updated 29 days ago • 6

reacted to Symbol-LLM's post with 🚀🔥🔥 2 months ago

Post

995

🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

🔗 Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

😇Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !