"The Cognitive Revolution" cover image

"The Cognitive Revolution"

Robotics Research Update, with Keerthana Gopalakrishnan and Ted Xiao of Google Deepmind

Mar 15, 2024
In this podcast, they discuss breakthroughs in robotics research such as internet-scale vision-language for robots, training robots with a single human demonstration, and using language models for ethical oversight. They also cover challenges in robotics models, safety concerns, and advancements in vision-language generalization for robots.
01:19:47

Podcast summary created with Snipd AI

Quick takeaways

  • Robots can learn new skills from single human demonstrations with simple line drawings.
  • Collaboration with academic labs allows training a single model to control diverse robot embodiments.

Deep dives

RT2: Leveraging Internet-Scale Vision Language Models for General Purpose Robots

RT2 demonstrates how training robots on vision language models allows them to understand and manipulate unseen objects. By combining image language pairs from the internet, the robots can stitch concepts from the internet with the emotions in robotic datasets. The co-fine tuning method ensures better understanding and retention of concepts learned from the internet to avoid mode collapse.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner