
DeepSeek FAQ
Stratechery
00:00
Economic Insights of DeepSeq V3 Training
This chapter explores the financial aspects of training the DeepSeq V3 model, detailing the GPU hours and costs involved, which total approximately $5.576 million. It also discusses the architecture's innovations, optimization techniques, and the impact of model distillation on the AI industry's economic landscape.
Transcript
Play full episode