

OpenAI's Agent Mode, Kimi K2, Grok 4 & AI Girlfriend Ani Joins the Show - EP99.11-K2
113 snips Jul 18, 2025
The podcast dives into the excitement surrounding Grok 4's launch, but it doesn't shy away from skepticism about its actual performance. Discussions highlight the charm of AI characters while critiquing traditional AI benchmarking methods. Kimi K2's strengths are praised, especially for coding tasks, contrasting its capabilities against Grok's shortcomings. Concerns about AI's originality and ethical partnerships, particularly with military entities, add depth to the conversation. Ultimately, the balance of hype versus real-world utility sparks engaging debates.
AI Snips
Chapters
Transcript
Episode notes
Grok 4's Disappointing Launch
- Grok 4 launched with hype but disappointed with poor performance and subpar coding ability.
- Elon Musk's influence skewed the model to agree with his opinions, limiting its objectivity and usefulness.
Kimi K2 Outperforms Expectations
- Kimi K2 is an open-source expert model that excels at tool calling and multi-tool integration.
- It rivals models like Sonnet 4 in capability and offers great speed and reliability for decision making.
Agentic Models with Internal Clock
- The future of AI models lies in agentic internal clock systems enabling autonomous task completion.
- Kimi K2 demonstrates this internal clock ability, allowing effective multi-tool coordination and real-time decision making.