218 109

Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

LatitudeGames/Equinox-31B

liked a model 5 days ago

bytedance-research/Lance

upvoted a paper 6 days ago

Teaching Language Models to Think in Code

View all activity

Organizations

upvoted a paper 6 days ago

Teaching Language Models to Think in Code

Paper • 2605.07237 • Published 14 days ago • 30

upvoted a paper 10 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 12 days ago • 59

upvoted a paper 12 days ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published 14 days ago • 107

upvoted a paper 14 days ago

Lightning Unified Video Editing via In-Context Sparse Attention

Paper • 2605.04569 • Published 19 days ago • 18

upvoted a paper 20 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 22 days ago • 162

upvoted a paper 25 days ago

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

Paper • 2604.10268 • Published Apr 11 • 12

upvoted an article 26 days ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

27 days ago

• 57

upvoted a paper 27 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 28 days ago • 118

upvoted an article 30 days ago

Article

How to Use Transformers.js in a Chrome Extension

nico-martin

•

Apr 23

• 37

upvoted a paper about 1 month ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 59

upvoted an article about 1 month ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 71

upvoted a paper about 1 month ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published Apr 8 • 34

upvoted a paper about 2 months ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published Apr 2 • 31

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 899

upvoted 2 papers about 2 months ago

Voxtral TTS

Paper • 2603.25551 • Published Mar 26 • 62

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 133

upvoted 3 papers 2 months ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 36

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published Mar 4 • 15

upvoted a paper 3 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 195

Ougrid Dumdang

AI & ML interests

Recent Activity

Organizations

Ougrid-D's activity

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

How to Use Transformers.js in a Chrome Extension

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Welcome Gemma 4: Frontier multimodal intelligence on device