13 20 51

Sultan Alrashed PRO

SultanR

https://srashed.com

AI & ML interests

Smol language modelling and Arabic!

Recent Activity

updated a dataset 2 days ago

AdaMLLab/indicxnli_repaired

updated a dataset 2 days ago

AdaMLLab/indicxnli_repaired

updated a dataset 2 days ago

AdaMLLab/indicxnli_repaired

View all activity

Organizations

upvoted 11 papers 12 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7, 2025 • 27

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7, 2025 • 56

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28, 2025 • 31

upvoted a paper about 1 year ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

upvoted a collection about 1 year ago

Fineweb-Edu-Ar

Collection

Largest (as of 2024) machine translated Arabic educational corpus • 2 items • Updated Dec 16, 2024 • 1

upvoted 4 papers about 1 year ago

SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs

Paper • 2412.08347 • Published Dec 11, 2024 • 4

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Paper • 2402.01781 • Published Feb 1, 2024 • 4

Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models

Paper • 2411.06402 • Published Nov 10, 2024 • 2

ALLaM: Large Language Models for Arabic and English

Paper • 2407.15390 • Published Jul 22, 2024 • 3

upvoted 2 collections about 1 year ago

SmolTulu

Collection

A collection of models that use SmolLM2 as the pretrained base in conjunction with AllenAI's Tulu 3 post training pipeline. • 6 items • Updated Dec 17, 2024 • 1

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Dec 23, 2025 • 96

Sultan Alrashed PRO

AI & ML interests

Recent Activity

Organizations

SultanR's activity