EP50: We Bet $1000 Using Gemini Advanced, Qwen1.5 72B, Retell AI, Apple's MGIE & GOODY-2
Feb 9, 2024
auto_awesome
This episode of the podcast covers topics such as betting $1000 on horse racing using advanced AI models like Gemini Ultra 1.0 and Qwen 1.5 72B, the launch of Google Gemini Advanced, Apple's Open Source Guiding Instruction-based Image Editing, and the high refusal rate of GOODY-2, the world's most responsible AI model.
Gemini Ultra AI outperformed GPT-4 and Qwen 1.5 in sports betting, showcasing its potential in the field.
Google's release of the Gemini Ultra 1.0 model offers advanced reasoning capabilities, but its high price and access restrictions raise concerns.
Retail AI's voice agent technology impresses with interruption handling, speed, and natural-sounding voices, enhancing user experience in applications.
Deep dives
Google Gemini Ultra AI Performance in Sports Betting
Gemini Ultra AI outperformed GPT-4 and Qwen 1.5 in a sports betting test. While Qwen 1.5 lost money by betting on favorites, Gemini Ultra used advanced strategies to make a profit. The test showed that Gemini Ultra is particularly skilled at betting on horses, despite not always picking the race winners. This performance was surprising given the underwhelming results obtained in other tests with Gemini Ultra. The test results highlight the potential of Gemini Ultra in the world of sports betting.
Google's Gemini Ultra 1.0 Model Announcement
Google recently announced the release of the Gemini Ultra 1.0 model, which aims to deliver advanced reasoning capabilities. The model offers features such as access to Google services like search, Google flights, email integration, and more. However, pricing and access restrictions have raised concerns among potential users. The Gemini Ultra model is part of the Google One AI premium plan, which comes at a hefty price of $19.99 per month. While some existing Google One subscribers may find value in the plan, others might find the cost unjustified.
Sync Labs: Real-Time Lip Syncing API
Sync Labs has introduced an API that offers real-time lip syncing for videos. The API allows users to upload a video and automatically sync the lip movements to match any language. While the technology has potential, initial tests of the API have shown less than impressive results, with comically exaggerated lip movements that don't look realistic. Despite its current limitations, Sync Labs' real-time lip syncing API could be a valuable tool once the technology improves.
Retail AI: Convenient AI Voice Agents for Call Services
Retail AI is an API-focused company that provides voice agents for call services. Their technology allows developers to easily integrate AI voice agents into their applications, enabling tasks such as making restaurant reservations or handling customer support calls. Retail AI's system demonstrates impressive capabilities in interruption handling, speed, and natural-sounding voices. The ability to integrate with various underlying language models and perform actions based on voice interactions adds an exciting dynamic to the technology. Retail AI plans to be integrated into the Sim Theory platform to provide users with seamless access to their voice agent services.
Apple's Guiding Instruction-Based Image Editing Model
Apple has introduced a new open-source model called Guiding Instruction-Based Image Editing via Multimodal Large Language Models. This model allows users to provide instructions and direct image editing tasks using natural language prompts. While the model's performance in handling various instructions and outputs is not perfect, it shows potential for enabling convenient voice-controlled image editing capabilities in the future. The model aims to provide a safe and responsible approach to image editing and opens up possibilities for real-time voice-guided image manipulation.
Thanks to everyone for all your support and kind reviews to reach 50 episodes! Please consider leaving us a review wherever you get your podcasts. =====
This week we cover the launch of Google Gemini Advanced, Gemini Ultra 1.0 and Bard being Renamed to Gemini. We compare GPT-4, Gemini Ultra 1.0 and Qwen 1.5 72B by sports betting $1000 on horse racing.
We celebrate 50 episodes and share our excited for Qwen 1.5 72B's performance at coding and quick refusals. We cover new releases including SyncLabs and Retell AI and Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models.
Finally, we discuss GOODY-2 and it's high refusal rate.
===== CHAPTERS:
00:00 - Betting $1,000 To Compare Gemini Ultra 1.0 to GPT-4 to Qwen 1.5 07:33 - Google Gemini Advanced, Ultra: Details of Announcement and First Impressions 25:48 - OpenAI is Developing Agents to Control Your Devices 27:40 - Celebrating 50 Episodes of This Day in AI 30:34 - Qwen 1.5 72B: We're Impressed! 42:47 - SyncLabs: Tested & Impressions 47:58 - Retell AI: Tested & Impressions 54:18 - Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models 58:10 - GOODY-2: The World's Most Responsible AI Model
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode