#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!

Last Week in AI

Limitations of Grok System and Future of Chip Design for Inference

3min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

The Grok system features chips with high speed but limited onboard RAM, requiring approximately 600 chips for inference, contrasted with Nvidia H100 chip capable of the same task alone. Grok is facing financial challenges and needs a significant increase in utilization to break even due to unit economics. Notably, Grok chips only perform inference, not training, indicating a trend where models increasingly focus on post-training computations. This points towards a direction of custom chips for specific language model use cases optimized for inference, hinting at potential advancements in chip design even with existing fabrication nodes. This emphasizes the importance of chip designs tailored for inference tasks, showcasing the evolving landscape of hardware breakthroughs in the realm of AI technology.

Our 156th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:02:16) OpenAI introduces Sora, its text-to-video AI model
- (00:13:24) Gemini 1.5 is Google’s next-gen AI model — and it’s already almost ready
- (00:24:00) Groq AI model goes viral and rivals ChatGPT, challenges Elon Musk’s Grok
- (00:29:26) Introducing IP Adapters: Create Consistent Game Assets in Seconds
- (00:31:36) Report: OpenAI working on web search product
- (00:33:20) Adobe Acrobat adds generative AI to ‘easily chat with documents’
Applications & Business
Projects & Open Source
- (00:51:41) BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
- (00:54:33) Nomic AI Releases the First Fully Open-Source Long Context Text Embedding Model that Surpasses OpenAI Ada-002 Performance on Various Benchmarks
Research & Advancements
- (00:57:33) Meta unveils V-JEPA AI model that improves training by learning from video
- (01:04:51) Chain-of-Thought Reasoning Without Prompting
- (01:09:45) OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
- (01:11:40) World Model on Million-Length Video And Language With RingAttention
- (01:15:00) Amazon AGI Team Say Their AI Is Showing "Emergent Abilities"
Policy & Safety
Synthetic Media & Art
- (01:32:13) Sarah Silverman’s lawsuit against OpenAI partially dismissed
Fun & Miscellaneous
- (01:36:14) A Visual Guide to Mamba and State Space Models
- (01:38:00) Scientific Journal Publishes AI-Generated Rat with Gigantic Penis In Worrying Incident
- (01:40:50) Helen Mirren Rips Up AI-Generated Speech at American Cinematheque Awards
- (01:41:40) Microsoft's game-changing Super Bowl ad

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

App store banner

Play store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode