Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ruslan S.'s picture
3 9 2

Ruslan S.

poedator
sebastianking's profile picture bulatovv's profile picture Titus-von-Koeller's profile picture
·

AI & ML interests

None yet

Organizations

Petals Team's profile picture Blog-explorers's profile picture Spec Diffusion's profile picture

authored 3 papers over 1 year ago

SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices

Paper • 2406.02532 • Published Jun 4, 2024 • 13

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Paper • 2306.03078 • Published Jun 5, 2023 • 3

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Paper • 2402.12374 • Published Feb 19, 2024 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs