A GPT-4V Level Multimodal LLM on Your Phone
chongyi
yuzaa
AI & ML interests
multimodal large language models
Recent Activity
new activity
1 day ago
Qwen/Qwen2.5-VL-7B-Instruct:some scores for minicpm-o 2.6 are not quite correct
new activity
4 days ago
openbmb/MiniCPM-o-2_6:Avoid duplicate input kwargs in `_decode`
new activity
7 days ago
openbmb/MiniCPM-o-2_6:Release `get_audio_placeholder` interface in processing
Organizations
Collections
1
models
None public yet
datasets
None public yet