Deep Papers cover image

Sleep-time Compute: Beyond Inference Scaling at Test-time

Deep Papers

00:00

Transforming Inference Costs Through Sleep Time Compute

This chapter delves into 'Sleep Time Compute', a paper that tackles the inference costs in reasoning models, emphasizing the shift from training time scaling to test time compute. It proposes innovative solutions for resource efficiency by utilizing idle periods for computation, aiming to enhance response times while managing GPU costs effectively.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app