MAmmoTH-VL

community

AI & ML interests

None defined yet.

Recent Activity

aaabiao authored a paper 9 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

yizhilll authored a paper 9 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

yizhilll authored a paper about 1 month ago

Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation

View all activity

aaabiao

authored a paper 9 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 11 days ago • 76

yizhilll

authored a paper 9 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 11 days ago • 76

yizhilll

authored 16 papers about 1 month ago

Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation

Paper • 2406.03151 • Published Jun 5, 2024

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning

Paper • 2212.02508 • Published Dec 5, 2022

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Paper • 2410.13639 • Published Oct 17, 2024 • 19

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10, 2024 • 1

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 10

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 70

LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm

Paper • 2502.19103 • Published Feb 26 • 3

ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding

Paper • 2505.23922 • Published May 29

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

Paper • 2507.16331 • Published Jul 22 • 18

aaabiao

authored a paper about 2 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

yuexiang96

authored a paper 2 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 40

AI & ML interests

Recent Activity

Team members 8

MAmmoTH-VL's activity