

EP72: Croc Test with Gemini 1.5 Experimental, Flux Destroys Midjourney & GPT4o Model Updates
Aug 7, 2024
Dive into the intriguing world of AI as the hosts tackle Google's Gemini 1.5 model, discussing its crocodile video analysis capabilities and performance challenges. They compare AI models like Flux and MidJourney, revealing Flux's superiority in image generation. Exciting updates on OpenAI's GPT-4 model highlight structured outputs and cost reductions. The conversation wraps up with insights into the current AI development landscape, emphasizing the need for reliable tools in an increasingly competitive market.
AI Snips
Chapters
Transcript
Episode notes
Crocodile Show Experiment
- Mike Angell took his kids to a crocodile show at Australia Zoo and was impressed by the demonstration.
- This inspired him to test Google's new Gemini 1.5 Pro Experimental model with a video from the show.
Hallucination Observations
- Gemini 1.5 Pro Experimental hallucinates less than previous versions on single-shot questions.
- However, hallucinations increase as the conversation continues.
Using Long Context Windows
- Use long context windows to maintain context throughout a work session, including various media types.
- Verify the AI's focus when referring to earlier points to avoid ambiguity.