1 78 22

js

rldy

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Scalable-Softmax Is Superior for Attention

upvoted a paper 2 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

upvoted a paper 2 days ago

s1: Simple test-time scaling

View all activity

Organizations

rldy's activity

upvoted 3 papers 2 days ago

liked a Space 3 days ago

CoT-Lab: Human-AI Co-Thinking Laboratory

🤖

Generate human-like text responses to your prompts

upvoted 2 papers 13 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 14 days ago • 86

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 14 days ago • 295

upvoted a paper 14 days ago

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 16 days ago • 31

liked a model 16 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 4 days ago • 1.23M • • 6.86k

upvoted a paper 16 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 20 days ago • 105

upvoted 2 papers 19 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 20 days ago • 67

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 22 days ago • 53

upvoted a paper 20 days ago

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published 22 days ago • 56

upvoted a paper 21 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 22 days ago • 272

upvoted 2 papers 22 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 25 days ago • 80

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 25 days ago • 29

upvoted a collection 23 days ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

liked a model 25 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 23 days ago • 18.5k • 524

upvoted 2 papers 27 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 28 days ago • 84

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 28 days ago • 253

liked a model about 1 month ago

katanemo/Arch-Function-3B

Text Generation • Updated Dec 2, 2024 • 1.12k • 99