1 132 595

Motoki Wu

tokestermw

https://motoki.co

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Distillation Scaling Laws

liked a Space 3 days ago

m-ric/open_Deep-Research

upvoted a paper 6 days ago

Agency Is Frame-Dependent

View all activity

Organizations

tokestermw's activity

upvoted a paper 2 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 4 days ago • 32

upvoted 2 papers 6 days ago

Agency Is Frame-Dependent

Paper • 2502.04403 • Published 10 days ago • 21

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Paper • 2502.04689 • Published 9 days ago • 7

upvoted an article 6 days ago

Article

Open R1: Update #2

and 6 others •

6 days ago

• 166

upvoted a paper 6 days ago

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published 10 days ago • 18

upvoted a paper 8 days ago

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published 13 days ago • 21

upvoted a paper 9 days ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published 10 days ago • 22

upvoted an article 9 days ago

Article

Open-R1: Update #1

and 7 others •

15 days ago

• 279

upvoted a paper 11 days ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published 13 days ago • 22

upvoted an article 11 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

13 days ago

• 46

upvoted 2 papers 16 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 17 days ago • 81

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 17 days ago • 53

upvoted an article 16 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

18 days ago

• 45

upvoted a paper 18 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 19 days ago • 105

upvoted a paper 19 days ago

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published about 1 month ago • 18

upvoted an article 19 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

20 days ago

• 747

upvoted a paper 23 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 25 days ago • 24

upvoted an article 26 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

27 days ago

• 60

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 144

upvoted a collection about 1 month ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 10 days ago • 234