
EP8: RL with Ahmad Beirami
The Information Bottleneck
00:00
Best-Of Decoding Puzzle and Sunday Projects
Ahmad recounts investigating why best-of decoding outperforms others, leading to theoretical inquiry.
Play episode from 55:19
Transcript

Ahmad recounts investigating why best-of decoding outperforms others, leading to theoretical inquiry.