AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Find Points Off and on the Manifold of a Burt Model
With each pre training task, you're finding a different way of defining points off and on the manifold. So we've learned this language manifold, and with a burt model that takes in, let's say, 500 tokens. It's less than 500, because generally there's a separatedto but every single point on that manifold is a piece of valid language. i think it comes back to this idea of stepping stones and curriculum learning an that we have these different manifolds, alike substance of this high dimensional space.