What's Lacking in Large Language Models?

Self-supervised learning has been basically brought about a revolution in natural language processing because they are used for pre-training transformer architectures. The problem with generative models is that it's very difficult to represent uncertain predictions. So the only thing that works in the domain of images is those joint embedding architecture where instead of predicting the image, you predict a representation of the image. We have techniques for generating images before actually predicting good images that could fit in the video. But what I've been proposing is we should use SSL to get machines to learn predictive world so essentially to predict how the world is going to evolve. Because if an intelligent agent might take such a step as a consequence of an

Play episode from 02:17

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app