6 117 26

meng shao

meng-shao

shao__meng

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

upvoted a paper 10 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

upvoted a paper 11 days ago

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

View all activity

Organizations

meng-shao's activity

upvoted 2 papers 10 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 10 days ago • 64

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted a paper 11 days ago

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Paper • 2502.20730 • Published 14 days ago • 33

upvoted a paper 25 days ago

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published 27 days ago • 52

upvoted a paper 29 days ago

LM2: Large Memory Models

Paper • 2502.06049 • Published Feb 9 • 30

upvoted a paper about 1 month ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 142

liked a Space about 1 month ago

428

Chat with DeepSeek-VL2-small

🌍

Generate responses using images and text input

upvoted a paper about 1 month ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3 • 24

reacted to AdinaY's post with 🔥 about 2 months ago

Post

1461

VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba

Model: DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
Paper: VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding (2501.13106)

✨ 2B/7B
✨ Apache2.0

1 reply

upvoted 3 papers about 2 months ago

upvoted 5 papers 3 months ago

RAG Playground: A Framework for Systematic Evaluation of Retrieval Strategies and Prompt Engineering in RAG Systems

Paper • 2412.12322 • Published Dec 16, 2024 • 1

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46

AI PERSONA: Towards Life-long Personalization of LLMs

Paper • 2412.13103 • Published Dec 17, 2024 • 2

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 22

commented a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352 •

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

liked a Space 3 months ago

Openai Realtime Voice

💻

Talk with openAI's new Realtime Voice API