4 28 39

Chew Kok Wah

chewkokwah

AI & ML interests

Open Domain Question Answering

Recent Activity

upvoted an article about 9 hours ago

How to Run a Hugging Face Model in JAX (Part 1)

liked a model 18 days ago

xl-zhao/PromptCoT-2.0-SelfPlay-30B-A3B

liked a model 18 days ago

xl-zhao/PromptCoT-2.0-Prompt-Generation-Model

View all activity

Organizations

upvoted an article about 9 hours ago

Article

How to Run a Hugging Face Model in JAX (Part 1)

•

Jul 20

• 27

upvoted an article about 1 month ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 156

upvoted a paper about 2 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 33

upvoted a paper 3 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted an article 3 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 49

upvoted a paper 3 months ago

OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique

Paper • 2507.09075 • Published Jul 11 • 15

upvoted an article 3 months ago

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Jul 16

• 73

upvoted 2 papers 3 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 26

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 34

upvoted an article 3 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 696

upvoted a paper 3 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

upvoted an article 4 months ago

Article

The 4 Things Qwen-3's Chat Template Teaches Us

Apr 30

• 73

upvoted a paper 5 months ago

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

Paper • 2505.00612 • Published May 1 • 9

upvoted an article 5 months ago

Article

The Transformers Library: standardizing model definitions

May 15

• 119

upvoted a paper 5 months ago

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 25

upvoted 2 papers 6 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22 • 64

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34

upvoted an article 7 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

• 246

upvoted 2 collections 8 months ago

Light-R1

Collection

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated 2 days ago • 12

TinyR1

Collection

7 items • Updated 2 days ago • 4

Chew Kok Wah

AI & ML interests

Recent Activity

Organizations

chewkokwah's activity

How to Run a Hugging Face Model in JAX (Part 1)

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

SmolLM3: smol, multilingual, long-context reasoner

The 4 Things Qwen-3's Chat Template Teaches Us

The Transformers Library: standardizing model definitions

Visualize and understand GPU memory in PyTorch