Yannic Kilcher Videos (Audio Only) cover image

This is a game changer! (AlphaTensor by DeepMind explained)

Yannic Kilcher Videos (Audio Only)

00:00

The Neural Network Reward Signal Isn't Just Zero or One

The reward isn't just zero or one they do give and I believe they describe it somewhere they do give a negative one reward for every step that's being done. This actually encourages a low the low rank decomposition on the other hand it also provides a denser reward signal so you don't have to because this problem is super difficult right and bite to stumble by chance upon this would be not really it would be like really lucky and the reward would be super sparse.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app