AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
A Zero-Shot Learning-Based Approach
The images we capture are at least 3,000 by 4,000 pixels. We cannot feed an image like that in a normal network. So downsizing significantly to something I can be handled is step number one. And then after this downsizing, passing it through a network that acts like us, sort of not to encoder,. essentially takes the image, the two-dimensional image, and finds a representation in higher dimensional space.