Accelerating Large Language Model Decoding with Speculative Sampling
Paper
• 2302.01318 • Published
• 4
exploring speculative sampling with autoregressive model like: https://proceedings.mlr.press/v139/song21a.html and https://proceedings.mlr.press/v119/