EP58: We Convinced a Record Label to Sign an AI Artist + Udio AI Music, Gemini 1.5 Pro, GPT-4 TURBO, Mixtral
Apr 12, 2024
auto_awesome
This podcast explores AI in music creation, including convincing a record label to sign an AI artist. It discusses major updates in language models, Google Gemini Pro 1.5 launch, Mistral's open-source model, and challenges with AI models. The speakers critique the usability of AI tools, question their practicality, and explore the potential impact of AI technology advancements.
UDO AI music platform excels in creating lifelike music, impacting various genres and applications.
Google Gemini 1.5 Pro enhances audio inputs for better query analysis and video processing.
Competition in AI intensifies with advanced language models like Google Gemini Pro and OpenAI's GPT-4 Turbo.
Deep dives
AI Music Creation Advancements with UDO
UDO has revolutionized AI music creation, surpassing previous platforms like Suno to such an extent that the music created seems more human and realistic. Users have created a variety of songs, including rock and country genres, showcasing UDO's exceptional capabilities in generating lifelike music. The platform's effectiveness extends to various applications, such as elevator music, gym playlists, and content creation without copyright concerns. UDO's potential to impact music charts may disrupt traditional artist-driven content, leading to a shift in music creation and consumption.
Google Gemini 1.5 Updates and Gemini Codassist Release
Google's Gemini 1.5 release introduced significant updates like audio inputs and system prompts, enhancing its capabilities for queries and analysis. The integration of audio understanding and a large context window allows for enhanced video processing, as evidenced by its potential in video indexing and chapterization. Additionally, Gemini Codassist, a developer tool, exhibits speed and promising completion capabilities, rivaling tools like co-pilot in coding efficiency. Despite challenges in setting up Gemini applications due to interface complexities, the expanding features indicate positive advancements in AI-powered applications.
Implications of Large Language Model Releases
The intensifying competition in the AI landscape is evident from the rapid releases of large language models like Google Gemini Pro and OpenAI's GPT-4 Turbo. These models promise enhanced capabilities in multimedia reasoning and function calling, raising the bar for AI functionalities. The competition's focus on functionality, cost efficiency, and ease of access underscores the industry's evolution towards sophisticated applications. Despite the innovations, challenges like interface complexities and billing uncertainties highlight the need for streamlined and accessible AI technologies for broader adoption and impactful use cases.
OpenAI's Improvements in GPT for Turbo Model
OpenAI announced major improvements in their GPT for Turbo model, highlighting the incorporation of the vision model into the API, streamlining the process of calling functions and receiving JSON output. This update is significant for developers as it offers a more reliable and efficient way to work with LLMs, enhancing the practicality of building applications around them.
Criticisms of OpenAI's Release Strategy and Product Performance
The podcast discusses criticisms aimed at OpenAI's release strategy and product performance, particularly focusing on the rushed and disorganized nature of the GPT for Turbo release. Criticisms highlight the lack of concrete examples, metrics, and transparency around the model's updates. Additionally, the podcast delves into the mixed reactions towards OpenAI's tactics and its impact on user trust and industry perception.
AI News: https://thisdayinai.com SimTheory: https://simtheory.ai Show Notes: https://thisdayinai.com/bookmarks/48-ep58 -------
CHAPTERS: 00:00 - Udio, Udio Examples 10:45 - Will a Record Label Sign an AI Udio Artist? 19:09 - 3 Major LLM Updates/Release in a Single Day 22:58 - Google Gemini 1.5 Pro General Availability, Audio Modality & Impressions 30:20 - Google Cloud Next 2024 AI Announcements Discussion 47:18 - OpenAI Announces "improvements" to GPT-4 Turbo, GPT-4 Turbo Official Release & Vision API JSON & Function Calling 57:35 - Mistral Posts BitTorrent To New Open Source Model Mixtral-8-22B 1:03:00 - Humane's AI Pin Reviews are out... and they aren't great.
Special thanks to AI artist Conor for the great content!
Thanks for listening.
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode