AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do You Detect Objects in a Video?
We're gathering basically video data of people speaking in different languages. And then we label the data witha landmarks and with a resolution. We create a bridge or a latent space that basicaly can incode the odio and the code the face, and then we can reconnect the odio in the back. Of course, we're usinga mainly guns, and exploring different things. The main interesting thing is a video and its stability and temporal stability,. Things that we is not required in other fields. For example, image generation. Now we see on different platforms ith dally and stable diffusion. So we are dealing with a lot of temporal stability and correctness of the of the expression