Super Data Science: ML & AI Podcast with Jon Krohn

801: Merged LLMs Are Smaller And More Capable, with Arcee AI's Mark McQuade and Charles Goddard

8 snips

Jul 16, 2024

Charles Goddard

Mark McQuade and Charles Goddard from Arcee AI discuss merging LLMs efficiently, using MergeKit and evolutionary algorithms. They explore commercial applications, compare MoE vs. MoA, and highlight the advantages of smaller language models. The podcast also covers the Spectrum Project for efficient training and the future of SLMs.

Ask episode

Chapters

Transcript

Episode notes

Exploring Model Merging for Enhanced Neural Networks

03:39 • 12min

MergeKit: Empowering Organizations with Custom Language Models and Evolutionary Model Merging

Innovative Model Merging Using Evolutionary Algorithms

Benefits of Model Merging and Case Studies on Smaller Merged Models

Value of Merged Large Language Models in the Enterprise

Efficient Training of Language Models with Sparse Upcycling and MergeKit

41:16 • 23min

Evolution of Models: Small and Powerful

01:04:06 • 9min

Enhancing Language Models Through Model Merging and Innovative Training Approaches

01:13:22 • 4min