meng shao's picture

meng shao

meng-shao

·

shao__meng

AI & ML interests

None yet

Recent Activity

reacted to AdinaY's post with 🔥 6 days ago

VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba Model: https://huggingface.co/collections/DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15 Paper: https://huggingface.co/papers/2501.13106 ✨ 2B/7B ✨ Apache2.0

upvoted a paper 9 days ago

Evolving Deeper LLM Thinking

upvoted a paper 9 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

View all activity

Organizations

meng-shao's activity

upvoted 2 papers 9 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 13 days ago • 100

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published 13 days ago • 40

upvoted a paper 14 days ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published 29 days ago • 14

upvoted 8 papers about 1 month ago

RAG Playground: A Framework for Systematic Evaluation of Retrieval Strategies and Prompt Engineering in RAG Systems

Paper • 2412.12322 • Published Dec 16, 2024 • 1

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 45

AI PERSONA: Towards Life-long Personalization of LLMs

Paper • 2412.13103 • Published Dec 17, 2024 • 2

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 21

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 344

The Open Source Advantage in Large Language Models (LLMs)

Paper • 2412.12004 • Published Dec 16, 2024 • 9

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 91

upvoted 9 papers about 2 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published Dec 12, 2024 • 28

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published Dec 11, 2024 • 38

[MASK] is All You Need

Paper • 2412.06787 • Published Dec 9, 2024 • 2

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 105

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Paper • 2412.03552 • Published Dec 4, 2024 • 26

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 22