The Information Bottleneck cover image

EP14: AI News and Papers

The Information Bottleneck

00:00

Continuous Autoregressive Language Models: New Paradigm

Ravid describes the Tencent paper proposing continuous chunk predictions, encoder–decoder compression, and energy-based heads.

Play episode from 27:54
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app