Mixture of Experts cover image

Episode 40: DeepSeek facts vs hype, model distillation, and open source competition

Mixture of Experts

CHAPTER

The Resurgence of Reinforcement Learning in Model Training

This chapter examines the intricacies of reinforcement learning within the framework of the DeepSeek project, highlighting two distinct training methodologies. It further investigates the impact of these approaches on model training and the potential of RL to enhance reasoning in smaller models.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner