The Inside View cover image

5. Charlie Snell on DALL-E and CLIP

The Inside View

CHAPTER

What's Z Plus Quantize?

VQGAN is essentially again that enabled to have the higher resolutions. It's basically a VQVAE with some GAN components sort of so it uses sort of discrete latents like we described earlier and because of that and a couple other tricks that like generate really good images. Back then they were even not bad but like since then it's gotten even crazier I guess as we'll see. Okay there's one thing I want you to teach me. What's this Z plus quantize save trade? okay.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner