Accelerating Large Language Model Decoding with Speculative Sampling
Paper
•
2302.01318
•
Published
•
2
exploring speculative sampling with autoregressive model like: https://proceedings.mlr.press/v139/song21a.html and https://proceedings.mlr.press/v119/