32 72 87

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

sesame/csm-1b

liked a model 3 days ago

nvidia/DeepSeek-R1-FP4

liked a dataset 3 days ago

open-r1/codeforces

View all activity

Organizations

smajumdar94's activity

upvoted an article 3 days ago

Article

Open R1: Update #3

and 9 others •

4 days ago

• 217

upvoted a paper 23 days ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 26 days ago • 28

upvoted a paper 24 days ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 72

upvoted a paper 29 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 30 days ago • 33

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted a paper about 1 month ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 43

upvoted 2 papers about 2 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted 2 papers 3 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 51

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44

upvoted 2 papers 4 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 47

upvoted a collection 5 months ago

steiner-preview

Collection

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 32

upvoted an article 5 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 51

upvoted a paper 5 months ago

CursorCore: Assist Programming through Aligning Anything

Paper • 2410.07002 • Published Oct 9, 2024 • 13

upvoted 2 articles 5 months ago

Article

Welcome, Gradio 5

Oct 9, 2024

• 128

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted a collection 6 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 17 days ago • 563

upvoted an article 6 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 225