Ethan Rosenthal from Runway discusses challenges of multimodal AI, transitioning from language to video models, and tools for content creators. He explores managing large datasets, transitioning research to production, and collaboration with cloud infrastructure tools. Accelerating research and utilizing resources efficiently at startups are also covered.
Read more
AI Summary
Highlights
AI Chapters
Episode notes
auto_awesome
Podcast summary created with Snipd AI
Quick takeaways
Multimodal AI combines various data types for better model performance.
Efficient training of large datasets is crucial for scalable research in AI.
Deep dives
Multimodal Feature Store Introduction
Introduction of the concept of a multimodal feature store that combines images, videos, audio, and text as different modalities for training and generating outputs in AI models. The idea is to handle varying input and output data types efficiently for better model performance.
Training Efficiency in Multimodal AI
Challenges of training models with large datasets, especially videos and images, fast and efficiently. Discusses the importance of creating training data sets that are relevant and scalable for researchers to experiment with different features and inputs.
Researcher-Engineer Collaboration
Emphasizes the need for seamless collaboration between researchers and engineers in a shared code base environment. Advocates for a unified approach to combining research and production code to encourage code reuse, testing, and better communication.
Accelerating Research & Inference
Focus on the Machine Learning Acceleration team's role in speeding up research by optimizing training processes and model scalability. Prioritizing faster training and inference for researchers to experiment efficiently and maximize GPU utilization for cost-effective performance.
Timestamps:
[00:00] Ethan's preferred coffee
[00:11] Takeaways
[02:07] Falling into LLMs
[03:16] Advanced AI Tech Capabilities
[04:40] AI-powered video editing tool
[06:56] Transition to AI: Diffusion Models
[09:09] Multimodal Feature Store breakdown
[15:33] Multimodal Feature Stores Evolution
[18:09] Benefits of Multimodal Feature Store
[25:09] Centralized Training Data Repository
[27:33] Large-scale distributed training
[32:37 - 33:39] AWS Ad
[33:45] Dealing with researchers on productionizing
[43:52] Infrastructure for Researchers and Engineers
[47:04] Generative DevOps movement
[49:21] Structuring teams
[52:06] Multimodal Feature Stores Efficiency
[54:02] Wrap up
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode