Junyeong Song's picture

Junyeong Song

junyeong-nero

·

https://junyeong-nero.github.io/portfolio/

AI & ML interests

Synthetic Data / OCR / Image-Generation

Recent Activity

upvoted an article about 21 hours ago

Mixture of Experts (MoEs) in Transformers

liked a dataset 1 day ago

nvidia/SPEED-Bench

upvoted a paper 2 days ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

View all activity

Organizations

None yet

upvoted an article about 21 hours ago

Article

Mixture of Experts (MoEs) in Transformers

+5

2 days ago

•

76

upvoted 2 papers 2 days ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17, 2025 • 35

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published 3 days ago • 87

upvoted a collection 3 days ago

Qwen3.5

9 items • Updated 2 days ago • 452

upvoted a paper 4 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 17 days ago • 185

upvoted a paper 7 days ago

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Paper • 2602.16968 • Published 9 days ago • 11

upvoted a paper 18 days ago

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Paper • 2401.02677 • Published Jan 5, 2024 • 25

upvoted an article about 2 months ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

40

upvoted a paper about 2 months ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published Jan 5 • 92

upvoted a collection about 2 months ago

Tiny-A2D

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 16

upvoted a paper about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

upvoted a collection 2 months ago

Kanana-2

Open Source Kanana-2 • 30 items • Updated Jan 27 • 36

upvoted 3 papers 4 months ago

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6, 2025 • 97

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 240

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 101

upvoted a collection 5 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 645

upvoted an article 5 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Jun 26, 2025

•

49

upvoted 2 collections 5 months ago

T5Gemma

32 items • Updated Jul 10, 2025 • 81

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 183

upvoted a paper 6 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11, 2025 • 80