Payload Logo

DSpark: Speculative decoding accelerates LLM inference [pdf]

Author

DJStern

Date Published

Front page story about speculative decoding for LLM inference.