Machine Learning Guide

MLG 029 Reinforcement Learning Intro

29 snips
Feb 5, 2018
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

MuJoCo Rag-Doll Shows Model-Free Power

  • The Mujoco rag-doll demonstrates surprising capabilities of model-free agents learning complex motor control.
  • The host used MuJoCo examples to show PPO and DQN can solve high-dimensional control tasks.
INSIGHT

What 'Model' Means In RL

  • 'Model' in model-based RL specifically means a learned model of environment transition dynamics.
  • Many other internal models (policy, networks) exist even in model-free agents.
ADVICE

Begin With Policy-Gradient Methods

  • Start your RL study with model-free, policy-gradient methods before moving to model-based planning.
  • Policy gradients apply standard gradient updates to optimize action policies from reward signals.
Get the Snipd Podcast app to discover more snips from this episode
Get the app