mmhamdy (Mohammed Hamdy)

upvoted an article 4 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

343

upvoted 2 articles 5 months ago

Article

Promoter-GPT: Writing DNA Instructions with Language Models

Oct 22, 2025

•

25

Article

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Oct 20, 2025

•

20

upvoted an article 8 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Jul 8, 2025

•

35

upvoted an article 9 months ago

Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Jun 4, 2025

•

23

upvoted a paper 9 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 153

upvoted an article 10 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30, 2025

•

85

upvoted a paper 10 months ago

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published May 20, 2025 • 10

upvoted an article 11 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

308

upvoted a paper 11 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

upvoted a paper about 1 year ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124

upvoted an article about 1 year ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

Mar 4, 2025

•

78

upvoted a collection about 1 year ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Jul 31, 2025 • 72

upvoted an article about 1 year ago

Article

Common AI Model Formats

Feb 27, 2025

•

66

upvoted a collection about 1 year ago

CHASE

Collection

Generate challenging synthetic data to evaluate LLMs • 4 items • Updated 13 days ago • 4

upvoted a paper about 1 year ago

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20, 2025 • 18

upvoted a collection about 1 year ago

Reasoning Datasets

Collection

50 items • Updated Jun 8, 2025 • 11

upvoted 3 papers about 1 year ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 45

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Paper • 2502.13791 • Published Feb 19, 2025 • 5

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 124

Mohammed Hamdy

AI & ML interests

Organizations

Continuous batching from first principles

Promoter-GPT: Writing DNA Instructions with Language Models

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

The 4 Things Qwen-3’s Chat Template Teaches Us

Text Generation Beyond Discrete Token Sampling

Tiny Agents: an MCP-powered agent in 50 lines of code

SmolVLM: Redefining small and efficient multimodal models

Unified Reward Model for Multimodal Understanding and Generation

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Cohere Labs Aya Vision

Common AI Model Formats

CHASE

How to Get Your LLM to Generate Challenging Problems for Evaluation

Reasoning Datasets

MMTEB: Massive Multilingual Text Embedding Benchmark

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Mohammed Hamdy

AI & ML interests

Organizations

mmhamdy's activity

Continuous batching from first principles

Promoter-GPT: Writing DNA Instructions with Language Models

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

The 4 Things Qwen-3’s Chat Template Teaches Us

Tiny Agents: an MCP-powered agent in 50 lines of code

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Common AI Model Formats