winvswon78 (Devin Thang)

upvoted a paper 3 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 93

upvoted 2 articles 5 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

106

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

276

upvoted a paper 6 months ago

Reconstructing 4D Spatial Intelligence: A Survey

Paper • 2507.21045 • Published Jul 28, 2025 • 38

upvoted a paper 8 months ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5, 2025 • 27

upvoted 2 articles 9 months ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

110

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

593

upvoted a paper 9 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 133

upvoted an article 9 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

251

upvoted a collection 10 months ago

Aero-1-Audio

Collection

2 items • Updated May 1, 2025 • 1

upvoted a collection 11 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Dec 31, 2025 • 163

upvoted a paper 12 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

upvoted a collection 12 months ago

Ola

Collection

Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment • 4 items • Updated Feb 21, 2025 • 3

upvoted a paper 12 months ago

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 43

upvoted 2 articles about 1 year ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.07k

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

Jul 11, 2024

•

126

upvoted 2 papers about 1 year ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 23

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 19

Devin Thang

AI & ML interests

Organizations

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Reconstructing 4D Spatial Intelligence: A Survey

Aligning Latent Spaces with Flow Priors

KV Cache from scratch in nanoVLM

Vision Language Models (Better, faster, stronger)

Emerging Properties in Unified Multimodal Pretraining

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Aero-1-Audio

Qwen2.5-Omni

Matryoshka Quantization

Ola

Lost in the Middle: How Language Models Use Long Contexts

Mixture of Experts Explained

How NuminaMath Won the 1st AIMO Progress Prize

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Devin Thang

AI & ML interests

Organizations

winvswon78's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

KV Cache from scratch in nanoVLM

Vision Language Models (Better, faster, stronger)

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Mixture of Experts Explained

How NuminaMath Won the 1st AIMO Progress Prize