AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Future of Machine Learning
There's one work that we put out almost a year ago now called Flash Attention. It was a great collaboration with some really smart folks at Stanford. We made it possible to, even though the compute is still quadratic, you're still doing all this brute forcing previous implementations. So in kind of the machine learning parlance, we call it tiling and it's actually even simpler than hashing.