KW's picture

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

liked a model about 11 hours ago

zai-org/RealVideo

liked a model about 11 hours ago

zai-org/GLM-TTS

liked a dataset about 18 hours ago

HuggingFaceFW/finetranslations

View all activity

Organizations

upvoted a paper about 23 hours ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288

upvoted an article 5 days ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

7 days ago

•

60

upvoted a collection 5 days ago

💧 LFM2.5

Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 21 minutes ago • 67

upvoted a paper 6 days ago

Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

Paper • 2601.02346 • Published 7 days ago • 25

upvoted an article 6 days ago

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

7 days ago

•

42

upvoted 2 articles 7 days ago

Article

Introducing Falcon H1R 7B

8 days ago

•

57

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

762

upvoted a paper 14 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 18 days ago • 24

upvoted an article 16 days ago

Article

Deriving the PPO Loss from First Principles

18 days ago

•

33

upvoted 3 articles 19 days ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

109

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

247

Article

Efficient MultiModal Data Pipeline

+3

Jul 8, 2025

•

69

upvoted a collection 20 days ago

Optimal Sparsity Math

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks • 67 items • Updated Aug 19, 2025 • 2

upvoted an article 22 days ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

26 days ago

•

44

upvoted a collection 22 days ago

Speech Language Models

20 items • Updated 21 days ago • 6

upvoted an article 24 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

26 days ago

•

111

upvoted a collection 26 days ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated about 15 hours ago • 47

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

571

upvoted a paper about 2 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published Nov 25, 2025 • 48

upvoted an article about 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

301