Unsupervised Learning cover image

Ep 47: Chief AI Scientist of Databricks Jonathan Frankle on Why New Model Architectures are Unlikely, When to Pre-Train or Fine Tune, and Hopes for Future AI Policy

Unsupervised Learning

00:00

Advancements in AI Architecture

This chapter discusses the rise of transformer models in machine learning, overshadowing LSTMs, while reflecting on significant investments in new architectures like Llama 3. It further explores the merger between Mosaic and Databricks, emphasizing cultural alignment and practical strategies for navigating AI model development.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app