Stratechery cover image

DeepSeek FAQ

Stratechery

00:00

Economic Insights of DeepSeq V3 Training

This chapter explores the financial aspects of training the DeepSeq V3 model, detailing the GPU hours and costs involved, which total approximately $5.576 million. It also discusses the architecture's innovations, optimization techniques, and the impact of model distillation on the AI industry's economic landscape.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app