The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

AI Trends 2023: Reinforcement Learning - RLHF, Robotic Pre-Training, and Offline RL with Sergey Levine - #612

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

ChatGPT Will Not Optimize for That

Chad GBT is better than GBT3 at remembering conversation history and serving users better by asking clarifying questions./nGBT3 is not optimized for the final result after many iterations, which is why it will rarely ask clarifying questions.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app