

#204 - OpenAI Audio, Rubin GPUs, MCP, Zochi
Exciting advancements in AI are on the agenda! Baidu has launched new multimodal models, aiming to rival Western counterparts. OpenAI showcases audio models that make AI sound remarkably human, while their costly O1 Pro aims for profitability. Nvidia's upcoming GPUs promise to transform performance, and Apple reveals significant updates to its Mac Studio. Increasing travel restrictions for DeepSeek employees suggest a heightened urgency in AI competition. Plus, Tencent's chip acquisitions indicate a booming demand for advanced AI technology!
01:49:03
Baidu's Competitive AI Models
- Baidu launched Ernie 4.5, a multimodal model competitive with GPT-4.5.
- Ernie X1, their reasoning model, rivals DeepSeek R1 at half the price.
Baidu's AI Integration Strategy
- Baidu aims to integrate Ernie 4.5 and X1 into its product ecosystem, including Baidu Search.
- Ernie models offer significantly lower pricing per token compared to GPT-4.5.
OpenAI Expands into Audio
- OpenAI released two new speech-to-text models, GPT-4 Transcribe and GPT-4 Mini Transcribe.
- They also launched a text-to-speech model, GPT-4 Mini TTS, and a demo site, OpenAI.fm.
Get the Snipd Podcast app to discover more snips from this episode
Get the app 1 chevron_right 2 chevron_right 3 chevron_right 4 chevron_right 5 chevron_right 6 chevron_right 7 chevron_right 8 chevron_right 9 chevron_right
Intro
00:00 • 6min
AI Models in Competitive Pricing and Performance
05:35 • 21min
NVIDIA's Rubin Revolution
26:28 • 16min
Strategic Acquisition: Enhancing AI Capabilities in Video Technology
42:44 • 2min
Tencent's Strategic Chip Stockpiling Amid Rising AI Demand
44:41 • 2min
Advancements in AI Model Protocols
46:32 • 21min
Exploring Block Diffusion in Language Models
01:07:45 • 4min
AI Training: Challenges and Innovations
01:12:01 • 25min
AI Evaluation and Copyright Challenges
01:37:02 • 12min
#51314
GP40-mini
A Guide to the EMD GP40 Locomotive


Unknown Author
This book would provide detailed information about the EMD GP40 locomotive, including its specifications, operational history, and technical features.
It would be a valuable resource for rail enthusiasts and historians.
#32801
EXAONE Deep
Reasoning Enhanced Language Models


LG AI Research
EXAONE Deep is a cutting-edge AI model developed by LG AI Research, designed to excel in mathematics, science, and coding tasks.
It outperforms larger models in various benchmarks, showcasing superior reasoning capabilities.
#43481
Mistral Small 3.1
A Compact Open-Source AI Model


Mistral AI Team
Mistral Small 3.
1 is a 24 billion parameter AI model designed for efficiency and versatility.
It supports multimodal understanding, processes both text and images, and operates efficiently on consumer-grade hardware.
The model is released under the Apache 2.
0 license, making it freely available for development and customization.

#49516
Rubin


Reuven Rubin
This book is an autobiography by Reuven Rubin, detailing his life journey as an artist and his experiences.
It includes a selection of his paintings, offering insights into his artistic style and contributions to Israeli art.

#20566
• Mentioned in 2 episodes
o1
OpenAI's Advanced Reasoning Model


OpenAI Research Team
The o1 model, part of OpenAI's 'Strawberry' AI reasoning project, is designed to break down complex tasks into component parts and handle them step-by-step.
It excels in math, science, coding, and logic tasks, and has shown significant improvements over previous models in benchmarks such as GPQA and MMLU.
The model is available in iterations like o1-preview and o1-mini, with o1 Pro offering enhanced capabilities for users.
#39668
Model Context Protocol (MCP)
A Standard for AI Integrations

Anthropic
The Model Context Protocol (MCP) is an open standard designed to simplify the integration of AI models with various data sources and tools.
It provides a universal protocol for these connections, reducing the need for custom integrations and enhancing scalability and security.

#54257
Gemini
A Guided Journal


My Journals
This journal helps you explore your sun sign, strengths, weaknesses, and goals through over 75 tailored questions.
It's designed for those interested in astrology to deepen their self-reflection and understanding of their sign.
#23123
• Mentioned in 2 episodes
Claude 3.5 Sonnet
An AI Model for Advanced Text and Code Generation

Anthropic
Claude 3.
5 Sonnet is a sophisticated AI model developed by Anthropic, capable of producing high-quality text and code.
It excels in tasks such as writing, coding, and visual data extraction, making it a versatile tool for various applications.

#43686
Block Diffusion
Not Available


Not Available
The search results do not include a book titled 'Block Diffusion'.
However, there is a research paper on block diffusion models in the context of language modeling.

#19725
• Mentioned in 2 episodes
DeepSeek-R1
An Open-Source Reasoning Model

Liang Wenfeng
DeepSeek-R1 is a significant AI model release, utilizing reinforcement learning to achieve state-of-the-art reasoning capabilities.
It is part of a broader effort by DeepSeek to provide open-source solutions that match the performance of proprietary models like OpenAI's o1.
The model is notable for its ability to solve complex problems in mathematics and coding.
Our 204th episode with a summary and discussion of last week's big AI news!
Recorded on 03/21/2025
Hosted by Andrey Kurenkov and Jeremie Harris.
Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.
Join our Discord here! https://discord.gg/nTyezGSKwP
In this episode:
- Baidu launched two new multimodal models, Ernie 4.5 and Ernie X1, boasting competitive pricing and capabilities compared to Western counterparts like GPT-4.5 and DeepSeek R1.
- OpenAI introduced new audio models, including impressive speech-to-text and text-to-speech systems, and added O1 Pro to their developer API at high costs, reflecting efforts for more profitability.
- Nvidia and Apple announced significant hardware advancements, including Nvidia's future GPU plans and Apple's new Mac Studio offering that can run DeepSeek R1.
- DeepSeek employees are facing travel restrictions, suggesting China is treating its AI development with increased secrecy and urgency, emphasizing a wartime footing in AI competition.
Timestamps + Links:
- (00:00:00) Intro / Banter
- (00:01:36) News Preview
- Tools & Apps
-
- (00:02:50) Baidu launches two new versions of its AI model Ernie
- (00:10:46) OpenAI Unveils New Audio Models to Make AI Agents Sound More Human Than Ever
- (00:16:41) OpenAI’s o1-pro is the company’s most expensive AI model yet
- (00:20:53) Google brings a ‘canvas’ feature to Gemini, plus Audio Overview
- (00:22:18) Anthropic adds web search to its Claude chatbot
- (00:23:55) xAI launches an API for generating images
- Applications & Business
-
- (00:26:28) Nvidia announces Rubin GPUs in 2026, Rubin Ultra in 2027, Feynman also added to roadmap
- (00:36:25) M3 Ultra Runs DeepSeek R1 With 671 Billion Parameters Using 448GB Of Unified Memory, Delivering High Bandwidth Performance At Under 200W Power Consumption, With No Need For A Multi-GPU Setup
- (00:40:07) Intel reaches 'exciting milestone' for 18A 1.8nm-class wafers with first run at Arizona fab
- (00:42:45) Elon Musk’s AI company, xAI, acquires a generative AI video startup
- (00:44:44) Tencent Reportedly Makes Massive NVIDIA H20 Chip Purchase for WeChat’s DeepSeek Integration
- Projects & Open Source
- Research & Advancements
-
- (00:55:58) Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification
- (01:07:44) Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
- (01:12:27) Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
- (01:18:46) Transformers without Normalization
- (01:19:52) Measuring AI Ability to Complete Long Tasks
- (01:26:12) HCAST: Human-Calibrated Autonomy Software Tasks
- Policy & Safety
- Synthetic Media & Art
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.