The Inside View cover image

5. Charlie Snell on DALL-E and CLIP

The Inside View

00:00

VQGAN - Z Plus Quantize Trick

VQGAN uses discrete latents. So normally when we're back propping to the latent vector like in big GAN you can just sort of move the latent vector wherever you want. It's like a continuous latent space but this has a discrete latent space. And so that it can work better with VQGAN because if you put anything that's not in the codebook into VQGAN it probably won't work. The Z plus quantize trick takes the latent that you have that you're learning or whatever and sort of quantizes it to the nearest thing in the code book.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app