AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Google's Multimodal Training Scheme
Google is using two different models of blend of two different models. The pathways language and image model, Pali X, and the pathways, the pathway language model and body that's palm E. And what that does is essentially capture common sense in a way. So if you can tell the robot to put strawberry into the correct ball, move soccer ball to basketball, move coke and to X, pick land animal, pick animal with different color,. Just all these things where just from the data, just from the model, it understands and can reason about what it's seeing.