Adventures in Machine Learning cover image

ML 021: Grokking Deep Reinforcement Learning with Miguel Morales

Adventures in Machine Learning

00:00

How to Build a Simulator AI?

When you hook up an agent is going to find loopholes. We don't think optimally at all, but we've been doing okay, I guess. And the funny thing about that is that sometimes, or there's like most of the time, and it just happened a couple of days ago to me too, you don't think of things and those AIs find them. But here's the very, very interesting point in reinforcement is, aren't you distracting your AI from the main goal, which is destroy that other aircraft over there by saying, Oh, and by the way, fly smoothly. Like I don't care. Are you going to destroy that guy or not? Right?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app