Interconnects cover image

Arcee AI goes all-in on open models built in the U.S.

Interconnects

00:00

Stability issues and expert collapse in MoE training

Lucas details early training collapse after a trillion tokens and balancing strategies used to stabilize experts.

Play episode from 18:46
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app