undefined

Jonathan Frankle

Author of the Lottery Ticket Hypothesis, researching Sparse Neural Networks, Pruning, and Lottery Tickets. Also works on AI Technology Policy and is an adjunct professor of law at Georgetown University Law Center.

Top 5 podcasts with Jonathan Frankle

Ranked by the Snipd community
undefined
66 snips
Dec 30, 2023 • 2h 42min

NeurIPS 2023 Recap — Top Startups

In this dynamic discussion, Jonathan Frankle, Chief Scientist at MosaicML, shares insights on their $1.3 billion acquisition by Databricks. Lin Qiao, CEO of Fireworks AI, talks about optimizing PyTorch for inference. Aman Sanger from Cursor reveals innovative memory strategies for AI coding. Aravind Srinivas discusses the impressive growth of Perplexity AI, hitting 1 million installs, while Jeremy Howard emphasizes the need for accessible AI. Together, they explore the vibrant AI startup landscape showcased at NeurIPS 2023, reflecting on innovation, collaboration, and the future of technology.
undefined
59 snips
May 20, 2023 • 1h 7min

MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML

Jonathan Frankle, Chief Scientist at MosaicML, and Abhinav Venigalla, Research Scientist at MosaicML, dive into the groundbreaking MPT-7B model. They discuss its unprecedented 84,000-token context length and how it was trained on 1 trillion tokens, achieving industry-leading performance for a fraction of the cost. The duo also navigates the complexities of AI model training, ethical considerations in creative generation, and the balance between open research and business interests, providing fascinating insights into the future of AI technologies.
undefined
24 snips
Jun 25, 2024 • 1h 22min

State of the Art: Training >70B LLMs on 10,000 H100 clusters

In this engaging discussion, Jonathan Frankle, Chief AI Scientist at Databricks, and Josh Albrecht, CTO of Imbue, dive into groundbreaking advancements in AI. They unveil Imbue 70B, a model outperforming GPT-4o with significantly less data. The duo shares insights on the complexities of scaling GPU clusters and the importance of high-performance infrastructure. They also address evaluating language models and introduce innovative tools for hyperparameter tuning. Their expertise shines through as they explore the future of AI in coding and reasoning tasks.
undefined
17 snips
Nov 12, 2024 • 1h 4min

Ep 47: Chief AI Scientist of Databricks Jonathan Frankle on Why New Model Architectures are Unlikely, When to Pre-Train or Fine Tune, and Hopes for Future AI Policy

Jonathan Frankle, Chief AI Scientist at Databricks, brings deep insight into the fast-paced world of AI. He discusses the evolution of AI models, favoring transformers over LSTMs, and shares strategic insights from the merger of Mosaic and Databricks. Frankle emphasizes the importance of effective AI evaluation benchmarks and customer collaboration in developing AI solutions. Ethical considerations and responsible AI policy also take center stage, as he highlights the need for transparency and community engagement in the rapidly evolving landscape.
undefined
16 snips
Jan 19, 2024 • 1h 10min

The Myth of AI Breakthroughs // Jonathan Frankle // #205

Jonathan Frankle, Chief Scientist at Databricks, discusses the realities and usefulness of AI, including face recognition systems, the 'lottery ticket hypothesis,' and robust decision-making protocols for training models. They also explore Jonathan's move into law, his experience with GPUs, and the revolutionary algorithm called Qstar.