
Udio & the age of multi-modal AI
Practical AI
00:00
Exploring Multimodal Perception and AI
This chapter delves into multimodal perception in humans and animals, highlighting how sensory experiences shape our understanding of the world. It also discusses advancements in multimodal AI technologies, such as visual instruction tuning and new models like LAVA, which enhance the integration of visual and language processing.
Play episode from 25:09
Transcript


