wongyukim's picture

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

upvoted a paper about 23 hours ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

upvoted a paper 2 days ago

Mixture of Experts Made Intrinsically Interpretable

View all activity

Organizations

None yet

wongyukim's activity

upvoted 2 papers about 23 hours ago

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Paper • 2503.09402 • Published 3 days ago • 6

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published 3 days ago • 14

upvoted 5 papers 2 days ago

Mixture of Experts Made Intrinsically Interpretable

Paper • 2503.07639 • Published 10 days ago • 7

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published 4 days ago • 16

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published 4 days ago • 25

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 5 days ago • 73

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 4 days ago • 89

upvoted 4 papers 3 days ago

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Paper • 2503.04812 • Published 11 days ago • 12

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published 5 days ago • 21

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 5 days ago • 36

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published 5 days ago • 36

upvoted 6 papers 4 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 9 days ago • 8

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 9 days ago • 14

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 8 days ago • 31

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published 8 days ago • 48

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 8 days ago • 42

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 8 days ago • 104

upvoted a paper 5 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 8 days ago • 25

upvoted 2 papers 7 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 9 days ago • 22

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published 9 days ago • 20