I apologize for adding yet another DeepSeek video to your video queue. During a trip to Tokyo last year, I was told that DeepSeek was the real deal. A cracked team, and perhaps the only ones of significance in China. Since then, I have annoyed the guys on Transistor Radio - our podcast with Dylan Patel and Doug O'Laughlin - into talking about it. Though there was nothing much to be said. In December 2024, DeepSeek released their V3 base model, which had impressive efficiency. A few people in AI were impressed. Then on January 22nd 2025, DeepSeek released their reasoning model, R1, which works kind of like OpenAI's o1 and o3 models. It takes extra compute time to "think" up a better answer. R1's release kicked everything off. The next day, the New York Times published an article on it, but focused mostly on the earlier V3's training costs.
Get all episodes of Asianometry, Sharp Tech, Sharp China, Stratechery Updates and Interviews, Greatest of All Talk, and Dithering as part of Stratechery Plus for $15/month or $150/year.
Listen to Stratechery.
Listen to Dithering.
Listen to Sharp China.
Listen to Sharp Tech.
Listen to Greatest Of All Talk.