AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Future Directions of Universal Document Processing
Can you automatically evaluate the image to determine if there's hallucination or does a human need to look at the image and say? Yeah, it's almost automatable as long as you're a good object detector. We've talked about a lot of future directions. I know the universal document processing is something that you're continuing to work on. Any other ideas in terms of future directions based on what we've talked about? It would be just multi-modal right now,. There's things like having action and interaction, right? Like we as humans learned not to touch something hard after you got burned.