A GPT-4V Level Multimodal LLM on Your Phone
chongyi
yuzaa
AI & ML interests
multimodal large language models
Recent Activity
authored
a paper
21 days ago
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
authored
a paper
21 days ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and
Training Recipe
upvoted
a
paper
21 days ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and
Training Recipe