AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Defend Against a Multimodal Attack
Language models are just getting text, so that's this fairly constrained set of inputs. But you start tacking on other modalities, like video, audio, images, and so on. So the attack surface just grows. And I think something that I would naively expect to come up more with multimodal systems.