Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jeff Rasley's picture
38 1 2

Jeff Rasley

jeffra
MillerForAI's profile picture shuyuej's profile picture Samanthaleysi's profile picture
·
  • jeffra45
  • jeffra

AI & ML interests

None yet

Organizations

BigScience Workshop's profile picture Snowflake's profile picture LLHF's profile picture SLLHF's profile picture wut?'s profile picture

authored 5 papers over 1 year ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 34

ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 7

ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning

Paper • 2104.07857 • Published Apr 16, 2021

DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale

Paper • 2201.05596 • Published Jan 14, 2022 • 2

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Paper • 2310.04610 • Published Oct 6, 2023 • 1
authored a paper almost 2 years ago

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

Paper • 2401.08671 • Published Jan 9, 2024 • 15
authored a paper over 2 years ago

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 45
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs