The Progress of RT2 Models in Robotics

5min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

RT2 is using a core A-vision language model that has been trained on internet data and those models have been exploding in the past few years. We can ride that wave of improvement and our models are going to get better over time just by virtue of those large models getting better themselves. It feels like for the first time we're really bringing everything together the sort of high level intelligence the low level perception and the action in a way that is not just you know gluing things together it's more than additive. So there is really sort of a paradigm shift that we're going through in our head right now which I think is going to translate into a lot more breakthroughs moving forward.

On Sunday night, a crane arrived in downtown San Francisco to take down the Twitter sign from the company’s office building. The crane’s arrival marked the death of Twitter, the brand, and the start of X, Elon Musk’s everything app. Today, why Elon’s acquisition feels more and more like cultural vandalism and what, if anything, will replace the global town square.

Then, is Sam Altman’s universal basic income cryptocurrency app Worldcoin an iris scanning tool to save humanity, or just another attempt to get rich on crypto?

Plus: a trip to Google’s robotics lab, where artificial intelligence models are creating breakthroughs.

Additional reading:

Casey breaks down Twitter’s rebrand.
The launch of Worldcoin — and the story of how it recruited the first half a million users.
Kevin’s column is a deep dive into Google’s new robotics model, which melds A.I. with robots.