
DeepSeek v3.2 Is Okay And Cheap But Slow
Don't Worry About the Vase Podcast
00:00
Paper read: technical innovation
Zvi summarizes V3.2's new attention mechanism and praises training-efficiency innovations in the paper.
Play episode from 07:33
Transcript


