A Deep Dive Into Generative's Newest Models: Gemini vs Mistral (Mixtral-8x7B)–Part I

Dec 27, 2023

ML Solutions Architect Dat Ngo and Product Manager Aman Khan discuss the new models Gemini and Mixtral-8x7B. They cover the background and context of Mixtral, its performance compared to Llama and GPT3.5, and its optimized fine-tuning. Part II will explore Gemini, developed by DeepMind and Google Research.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 4min

Understanding the Relationship Between Model Size, Data Size, and Compute Performance

04:29 • 5min

Chinchilla Outperforms Gopher in Test

08:59 • 8min

Important Aspects of Newest Models

16:53 • 11min

The Limitations of Gemini and Mixtral-8x7B Models

27:45 • 17min

Architecture and Performance of Mistral Model: A Comparative Analysis

44:57 • 3min