Pedram Rostami's picture

4 5 25

Pedram Rostami

PedramR

·

PedramRostami

AI & ML interests

NLP, Machine Learning

Organizations

upvoted 5 papers over 1 year ago

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4, 2024 • 25

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 82

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 52

Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 24