#045 Microsoft's Platform for Reinforcement Learning (Bonsai)

Feb 28, 2021

Guest

Megan Bloemsma

Guest

Scott Stanfield

Scott Stanfield and Megan Bloemsma from Microsoft's Autonomous Systems team dive into the ambitious Project Bonsai. They discuss its goal to simplify reinforcement learning, making it accessible for developers without PhDs. The conversation highlights the role of machine teaching in enhancing AI training, using real-world applications like balancing robots. They emphasize the need for expert guidance and domain knowledge in overcoming traditional challenges in the field. Innovations in simulation and collaboration are also spotlighted, showcasing a future where complex tasks become manageable.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Two-Ball Anecdote

Tim Scarfe uses a two-ball anecdote with Project Moab to illustrate machine learning's limitations.
A model trained for one task gets confused when given a slightly different input, highlighting generalization issues.

INSIGHT

Reinforcement Learning vs. Traditional ML

Deep reinforcement learning (RL) differs from traditional machine learning by exploring environments without pre-set answers.
Bonsai aims to optimize and automate physical processes through this exploration.

INSIGHT

RL's Challenges

Alex Erpan's article, "Deep Reinforcement Learning Doesn't Work Yet," highlights RL's challenges, like reward sparsity and generalization.
Tim Scarfe emphasizes that RL's complexity makes expert consultation crucial, a key aspect of Bonsai's approach.

Get the Snipd Podcast app to discover more snips from this episode

Get the app