#157 - Gemini controversy, new Mistral models, Deepmind's Genie & Griffinn, AI Warfare is here

Last Week in AI

Sora Architecture and Diffusion Transformers in Image Processing

2min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

The Sora architecture involves chunking up images or videos into patches to train the model to operate on these patches instead of full images. These patches act as atomic ingredients of the image that are mapped into latent space for processing. This approach marks a shift from traditional image-level network architectures to diffusion transformers. The comparison between different models like Sora, Valley free, and stable diffusion highlights the challenge of diminishing returns for companies specialized in this area. As models advance in handling text well, the focus now shifts to finer details like drawing hands accurately. While stable diffusion three is still in the testing phase, the competition in image processing appears to be intensifying with newer architectures like Sora and Gemini entering the scene.

Our 157th episode with a summary and discussion of last week's big AI news!

Check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.

Bonus plug: also check out this new book by Stanford AI expert, bestselling author, and Last Week in AI supporter Jerry Kaplan! Generative Artificial Intelligence: What Everyone Needs to Know

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
Applications & Business
- (00:30:09) Microsoft Strikes Deal with France’s Mistral AI
- (00:33:45) Figure Raises $675M at $2.6B Valuation and Signs Collaboration Agreement with OpenAI
- (00:37:05) Nvidia posts record revenue up 265% on booming AI business
- (00:39:54) MediaTek’s latest chipsets are now ‘optimized’ for Gemini Nano
- (00:41:28) Tumblr’s owner is striking deals with OpenAI and Midjourney for training data, says report
- (00:43:45) Mistral AI models coming soon to Amazon Bedrock
Projects & Open Source
Research & Advancements
- (00:57:51) Genie: Generative Interactive Environments
- (01:07:08) Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
- (01:15:16) Quantum Circuit Optimization with AlphaTensor
- (01:20:56) Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
- (01:22:10) Repetition Improves Language Model Embeddings
Policy & Safety
Synthetic Media & Art
- (01:40:23) The Intercept, Raw Story, and AlterNet sue OpenAI and Microsoft
- (01:41:15) A viral photo of a guy smoking in McDonald's is completely fake — and of course made by AI
Fun!
- (01:42:54) Impossible AI Food

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

App store banner

Play store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode