Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

The Problem With Gaussian Processes in Machine Learning

Michael Cohen: I've been trying to do some pessimistic reinforcement learning in practice, having done some theory on that. And it's tricky. So I've been using Gaussian processes for the pessimistic reinforcement learners model of the world. He says as you add more and more and more points, nothing breaks. With a Gaussian process, it's technically an infinite dimensional multivariate Gaussian.

Play episode from 02:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app