#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!

Last Week in AI

Extracting Semantic Meaning from Video with Discriminative Models

1min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

The model being discussed focuses on analyzing pieces of video and images to make predictions based on the semantic meaning captured in an embedding space, rather than operating on raw pixels. This approach is similar to Sora, as both operate at the level of the embedding space. The goal of the model is to generate meaningful embeddings from video or image patches through an encoder, capturing the essence or meaning of the inputs.

Our 156th episode with a summary and discussion of last week's big AI news!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:02:16) OpenAI introduces Sora, its text-to-video AI model
- (00:13:24) Gemini 1.5 is Google’s next-gen AI model — and it’s already almost ready
- (00:24:00) Groq AI model goes viral and rivals ChatGPT, challenges Elon Musk’s Grok
- (00:29:26) Introducing IP Adapters: Create Consistent Game Assets in Seconds
- (00:31:36) Report: OpenAI working on web search product
- (00:33:20) Adobe Acrobat adds generative AI to ‘easily chat with documents’
Applications & Business
Projects & Open Source
- (00:51:41) BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
- (00:54:33) Nomic AI Releases the First Fully Open-Source Long Context Text Embedding Model that Surpasses OpenAI Ada-002 Performance on Various Benchmarks
Research & Advancements
- (00:57:33) Meta unveils V-JEPA AI model that improves training by learning from video
- (01:04:51) Chain-of-Thought Reasoning Without Prompting
- (01:09:45) OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
- (01:11:40) World Model on Million-Length Video And Language With RingAttention
- (01:15:00) Amazon AGI Team Say Their AI Is Showing "Emergent Abilities"
Policy & Safety
Synthetic Media & Art
- (01:32:13) Sarah Silverman’s lawsuit against OpenAI partially dismissed
Fun & Miscellaneous
- (01:36:14) A Visual Guide to Mamba and State Space Models
- (01:38:00) Scientific Journal Publishes AI-Generated Rat with Gigantic Penis In Worrying Incident
- (01:40:50) Helen Mirren Rips Up AI-Generated Speech at American Cinematheque Awards
- (01:41:40) Microsoft's game-changing Super Bowl ad

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

App store banner

Play store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode