

#162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter
Discover the latest in AI music generation with Udio, competing against established platforms like Suno. Explore early reviews of the Humane AI Pin and its performance struggles. Dive into tech giants' advancements with Microsoft's GPT-4 Turbo and new AI chips from Google, Intel, and Meta. Unpack the changes in OpenAI’s startup fund and the launch of Mistral's impressive Mixtral model. Finally, delve into the ethical challenges of AI, including copyright issues and energy demands for data centers.
01:45:00
NYC Encounter
- Jeremy Harris had an unexpected encounter with John Krohn and Sadie St. Lawrence in NYC.
- Krohn commented on Harris' height, leading Harris to question his perceived stature.
Udio's Impact
- Udio, a new music generation tool, rivals Suno in quality and offers more control for musicians.
- Backed by notable investors and musicians, it's poised to revolutionize music creation.
Claude AI Tools
- Anthropic's Claude AI now uses external tools via APIs, improving accuracy and functionality.
- This tool use, including stock ticker integration, strengthens Claude's competition with OpenAI.
Get the Snipd Podcast app to discover more snips from this episode
Get the app 1 chevron_right 2 chevron_right 3 chevron_right 4 chevron_right 5 chevron_right 6 chevron_right 7 chevron_right 8 chevron_right 9 chevron_right
Intro
00:00 • 3min
AI Music Unleashed: Udeo vs. Suno
02:30 • 12min
Early Impressions of the Humane AI Pin: Pros and Cons
14:16 • 2min
AI Innovations and Market Dynamics
16:23 • 21min
OpenAI's Startup Fund Restructuring: A Shift in Leadership
37:23 • 2min
Advancements in Open-Source AI Models
39:52 • 31min
Navigating AI Ethics, Copyright, and Energy Challenges
01:10:58 • 22min
Legal Precedent: Blocking AI-Enhanced Evidence
01:32:42 • 4min
Canada's AI Investment and Artistic Integrity
01:36:44 • 8min
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
Peyman Milanfar
Hossein Talebi
Vishal M. Patel
Mauricio Delbracio
Kangfu Mei
Zhengzhong Tu
This research investigates how the size of latent diffusion models affects their sampling efficiency. It reveals that smaller models can outperform larger ones under certain inference budgets, offering insights into optimizing model size for better performance.
Responsible Reporting for Frontier AI Development
Responsible Reporting for Frontier AI Development
Asher Brass
Markus Anderljung
Joslyn Barnhart
Kevin Esvelt
Gillian Hadfield
Noam Kolt
This paper proposes frameworks for responsible reporting by frontier AI developers to improve visibility into emerging risks and enhance decision-making for risk management. It suggests both voluntary and regulatory pathways for implementing such reporting mechanisms, ensuring better informed policy-making and safer AI development.
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
David Raposo
Peter Humphreys
Sam Ritter
Blake Richards
Adam Santoro
Timothy P. Lillicrap
This paper presents a novel method called Mixture-of-Depths, which dynamically allocates computational resources in transformer-based language models. By adapting the depth of the transformer layers based on input complexity, the approach improves efficiency and performance. The authors demonstrate significant improvements in speed and accuracy across various language tasks.

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Siddharth Gopal
Tsendsuren Munkhdalai
Manaal Faruqui
This paper presents a novel approach to scaling Transformer-based large language models to handle infinitely long inputs with bounded memory and computation. The Infini-attention mechanism incorporates compressive memory into the vanilla attention mechanism, enabling efficient processing of long contexts. The approach is demonstrated on various language modeling benchmarks.
Octopus v2: On-device language model for super agent
Octopus v2: On-device language model for super agent
Improving Function Calling Capabilities for Software Agents
Wei Chen
Zhiyuan Li
The Octopus v2 model is an advanced on-device language model with 2 billion parameters, designed to improve the natural language understanding and generation capabilities of software agents. It focuses on enhancing function-calling capabilities, offering high accuracy and low latency compared to other models like GPT-4. The model is particularly useful for tasks such as taking selfies, sending messages, and managing Android system functions.
OpenAI Removes Sam Altman's Ownership of Its Startup Fund
OpenAI Removes Sam Altman's Ownership of Its Startup Fund
A Case Study on Corporate Governance and Venture Capital
Unknown Author
This hypothetical book would explore the reasons behind Sam Altman's removal as the owner of OpenAI's Startup Fund, examining the implications for corporate governance and venture capital investments in AI startups. It would delve into the fund's history, its investments, and the shift in control to Ian Hathaway.

Many-shot Jailbreaking
A New LLM Vulnerability
Cem Anil
Many-shot jailbreaking is a technique that exploits the extended context windows of large language models by using a lengthy faux dialogue to condition the model into providing harmful responses. This vulnerability has been demonstrated to affect several state-of-the-art models, including those from Anthropic and OpenAI. The research aims to raise awareness and encourage the development of more robust defenses against such attacks.

Artists for Responsible AI Music Practices
A Collective Call to Action
Nicki Minaj
Billie Eilish
Pearl Jam
This book would explore the collective efforts of over 200 artists, including Billie Eilish, Pearl Jam, and Nicki Minaj, as they call for responsible AI practices in the music industry. It would delve into the challenges posed by AI-generated music and the importance of preserving human creativity.
Our 162nd episode with a summary and discussion of last week's big AI news!
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
Timestamps + links:
- Tools & Apps
- (00:02:50) AI-Music Arms Race: Meet Udio, the Other ChatGPT for Music
- (00:07:42) Anthropic launches external tool use for Claude AI, enabling stock ticker integrations and more
- (00:11:51) Building LLMs for Code Repair
- (00:14:16) Early Reviews of Humane AI Pin Aren’t Impressed
- (00:16:23) Microsoft 365’s Copilot gets a GPT-4 Turbo upgrade and improved image generation
- (00:18:41) AI editing tools are coming to all Google Photos users
- Applications & Business
- (00:19:21) Google announces the Cloud TPU v5p, its most powerful AI accelerator yet
- (00:23:32) Meta unveils its newest custom AI chip as it races to catch up
- (00:27:27) Intel Unveils New AI Accelerator in Bid to Challenge Nvidia
- (00:30:46) Adobe Is Buying Videos for $3 Per Minute to Build AI Model
- (00:32:55) OpenAI transcribed over a million hours of YouTube videos to train GPT-4
- (00:36:23) Waymo will launch paid robotaxi service in Los Angeles on Wednesday
- (00:37:23) OpenAI removes Sam Altman's ownership of its Startup Fund
- Projects & Open Source
- Research & Advancements
- (00:52:08) Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
- (00:57:41) Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
- (01:03:31) Octopus v2: On-device language model for super agent
- (01:07:54) Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
- (01:09:54) Many-shot Jailbreaking
- Policy & Safety
- (01:15:08) Schiff unveils AI training transparency measure
- (01:20:25) Linwei Ding was a Google software engineer. He was also a prolific thief of trade secrets, say prosecutors.
- (01:26:11) Responsible Reporting for Frontier AI Development
- (01:30:08) US govt wants to talk to tech companies about AI electricity demands — eyes nuclear fusion and fission
- (01:32:39) Washington state judge blocks use of AI-enhanced video as evidence in possible first-of-its-kind ruling
- (01:36:45) Trudeau announces $2.4 billion for AI-related investments
- Synthetic Media & Art
- Fun!