

A Deep Dive Into Generative's Newest Models: Gemini vs Mistral (Mixtral-8x7B)–Part I
Dec 27, 2023
ML Solutions Architect Dat Ngo and Product Manager Aman Khan discuss the new models Gemini and Mixtral-8x7B. They cover the background and context of Mixtral, its performance compared to Llama and GPT3.5, and its optimized fine-tuning. Part II will explore Gemini, developed by DeepMind and Google Research.
Chapters
Transcript
Episode notes
1 2 3 4 5 6
Introduction
00:00 • 4min
Understanding the Relationship Between Model Size, Data Size, and Compute Performance
04:29 • 5min
Chinchilla Outperforms Gopher in Test
08:59 • 8min
Important Aspects of Newest Models
16:53 • 11min
The Limitations of Gemini and Mixtral-8x7B Models
27:45 • 17min
Architecture and Performance of Mistral Model: A Comparative Analysis
44:57 • 3min