3 4

Max Ryabinin

mryab

https://mryab.github.io/

AI & ML interests

Distributed training, natural language generation, efficient architectures for DL

Recent Activity

submitted a paper 2 months ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

published an article 8 months ago

Fine-tune Any LLM from the Hugging Face Hub with Together AI

upvoted a paper 9 months ago

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

View all activity

Organizations

submitted a paper to Daily Papers 2 months ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Paper • 2602.21196 • Published Feb 24 • 7

published an article 8 months ago

Article

Fine-tune Any LLM from the Hugging Face Hub with Together AI

Sep 10, 2025

•

upvoted a paper 9 months ago

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published Aug 15, 2025 • 40

upvoted a paper 12 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

authored 2 papers over 1 year ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 58

authored a paper almost 2 years ago

Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Paper • 2110.03313 • Published Oct 7, 2021 • 1

upvoted a paper almost 2 years ago

SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices

Paper • 2406.02532 • Published Jun 4, 2024 • 13

authored 6 papers about 2 years ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8, 2024 • 9

Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements

Paper • 2401.06766 • Published Jan 12, 2024 • 2

authored 3 papers over 2 years ago

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 27

Training Transformers Together

Paper • 2207.03481 • Published Jul 7, 2022 • 6

Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy

Paper • 2310.09247 • Published Oct 13, 2023 • 3

upvoted a paper over 2 years ago

Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy

Paper • 2310.09247 • Published Oct 13, 2023 • 3

authored 2 papers almost 3 years ago

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

Paper • 2303.06865 • Published Mar 13, 2023 • 1

SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient

Paper • 2301.11913 • Published Jan 27, 2023 • 1

Max Ryabinin

AI & ML interests

Recent Activity

Organizations

mryab's activity

Fine-tune Any LLM from the Hugging Face Hub with Together AI