News

Since each step of Jacobi decoding involves LLM forward computation on >1 token, it is significantly more expensive in terms of FLOPs required than each step of autoregressive decoding. The ...
The motivation is to explore how Speculative Decoding can be effectively adapted across different model series and configurations. This repo aims to implement two algorithms: (1) Deepmind's Algorithm: ...
A CRC-aided successive cancellation list (SCL) decoding algorithm for polar codes and PAC codes with various code constructions/rate profiles. The list decoding algorithm is an adaptive two stage ...
Researchers from the University of Washington, the Pennsylvania State University, and Allen Institute for AI have open-sourced SafeDecoding, a technique for protecting large language models (LLMs) aga ...
This upgrade would allow Co-LLM to course-correct so the algorithm can still give a satisfactory reply. ... Learning to Decode Collaboratively with Multiple Language Models, arXiv (2024).