

Mistral Medium 3, OpenAI HealthBench and AI chips to Saudi Arabia
May 16, 2025
Join Volkmar Uhlig, AI Infrastructure Portfolio Lead, Chris Hay, CTO of Customer Transformation, and Kaoutar El Maghraoui, Principal Research Scientist, as they dive into the launch of Mistral Medium 3 and its potential to elevate Europe’s AI stature. They assess NVIDIA's AI chip sales to Saudi Arabia, highlighting the region's growing tech landscape. The conversation also critiques new AI performance benchmarks from OpenAI and IBM, pushing for more nuanced evaluations that reflect real-world applications in healthcare and beyond.
AI Snips
Chapters
Transcript
Episode notes
Europe’s AI Innovation and Challenges
- Europe has strong AI innovation but lacks scale and compute infrastructure compared to the US and China.
- Mistral shows promise with efficient models but still needs to align with open source developer needs.
Saudi Arabia's AI Infrastructure Push
- Saudi Arabia is investing heavily in AI infrastructure with hundreds of thousands of chips and 500 megawatts capacity.
- Sovereign AI infrastructure requires not only hardware but also talent and ecosystem development to succeed.
Evolving AI Benchmarks and Evaluation
- AI benchmarks are evolving towards domain-specific and agent-based evaluations rather than broad general benchmarks.
- Organizations need to build their own evaluation frameworks to validate AI models for their specific use cases.