quehry's picture

1 14 8

quehry

quehry

·

quehry

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

upvoted a paper about 2 months ago

Reverse-Engineered Reasoning for Open-Ended Generation

upvoted a paper 12 months ago

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

View all activity

Organizations

upvoted a paper 12 days ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published 14 days ago • 45

upvoted a paper about 2 months ago

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 147

upvoted 2 papers 12 months ago

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published Oct 27, 2024 • 40

OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities

Paper • 2410.12219 • Published Oct 16, 2024 • 1

upvoted 8 papers about 1 year ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 36

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 60

PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment

Paper • 2410.13785 • Published Oct 17, 2024 • 19

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26, 2024 • 53

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 63

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23, 2024 • 22

upvoted 2 papers over 1 year ago

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 44

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25, 2024 • 23