AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Pros and Cons of Using Multi-Modal Language Models
Research is mostly in multi-modal language technologies. One of the most, in my opinion, promising potential use cases for vision and language models is to improve web accessibility. Images are often used for a communicative purpose online. And so simply substituting in an accurate description is oftentimes not quite enough because there's so many ways to describe an image.