AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Stability Diffusion: A Deep Dive Into Image Generation
GPT-4 is a new system that uses multiple models to generate high resolution images. It's still diffusion based, but there's one model that generates latents of the desired output size. The second step is specialized to generate this sort of high resolution image. They combine these in another stage and then kind of adds finer details to the generated output.