AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Emergence of Multimodal Models
I think right now what I'm interested in is how do people understand these things and then how best to express information so that you promote understanding. We at least need to know how to assess if it did a good job or not, which I think we need to do more work on. There are going to be emergent capabilities with multimodal models that can deal both with the visual and the language side. But just to go out one step out onto the limb, I know you're very wary of speculation, but this one feels like a safe speculation.