

AI Experts React: Elon’s Grok 4 Is Now #1 in AI — This Changes Everything w/ Emad Mostaque, Salim Ismail & Dave Blundin | EP #182
Grok 4 Academic Supremacy
- Grok 4 surpasses PhD level in every academic subject with strong reasoning but lacks planning abilities so far.
- It scores 100% on advanced math exams and outperforms other AI models on difficult benchmarks, highlighting quality engineering.
Elon's Grok 4 AI Is Postgraduate Level and Redefining Intelligence
Grok 4 has achieved an unprecedented level of AI intelligence, scoring 100% on the advanced mathematics AIME benchmark, surpassing even PhD-level expertise in all subjects. It embodies reasoning capabilities but has yet to master planning, positioning it as a near-AGI that can perform complex academic tasks with superhuman precision. The model is supported by an enormous cluster of 340,000 GPUs, achieving feats traditionally thought impossible, such as maintaining coherence at massive computational scale.
Significant engineering innovation and quality of data now differentiate top models, marking a shift from brute force to refined execution. Grok 4 is already outpacing competitors and is poised to impact industries from biomedical research to video game design with real-time intelligence.
Elon Musk's rapid development from cold start in 28 months illustrates the power of combining capital, engineering ingenuity, and open-source AI research. The next steps will integrate multi-agent systems, advanced world models, and seamless interaction, potentially transforming every sector with AI assistants as ubiquitous and capable as human experts.
Humanity's Last Exam Mastered
- "Humanity's Last Exam" tests super polymathic knowledge far beyond top human ability.
- Grok 4 Heavy's 44.4% score vastly exceeds best human scores around 5%, showing superhuman breadth.