#198 - DeepSeek R1 & Janus, Qwen2.5, OpenAI Agents

whatshot 403 snips

Feb 2, 2025

DeepSeek has launched R1, a competitive AI model causing a stir as tech stocks plummet, including a significant drop for NVIDIA. OpenAI's new tool, Operator, aims to enhance user experiences amidst rising competition. In a surprising move, President Trump has revoked the Biden administration's AI executive order, hinting at a shift in policy. Meanwhile, Taiwan's TSMC is permitted to produce advanced 2-nanometer chips abroad, highlighting the global semiconductor landscape and its geopolitical implications.

01:37:26

forum

Ask episode

web_stories

AI Snips

view_agenda

Chapters

menu_book

Books

auto_awesome

Transcript

info_circle

Episode notes

question_answer

ANECDOTE

Listener Critique and DeepSeek Coverage

A listener criticized the podcast for being "behind the curve," citing the hardware episode before DeepSeek R1.
Andrey Kurenkov pointed out that the podcast had actually covered DeepSeek V3 and its significance.

insights

INSIGHT

DeepSeek R1's Reasoning Optimization

DeepSeek R1, comparable to OpenAI's O1, optimizes reasoning in LLMs via reinforcement learning.
It showcases RL's potential, achieving impressive results with relatively few resources.

insights

INSIGHT

Reinforcement Learning's Power in DeepSeek R1

DeepSeek R1's success demonstrates the power of reinforcement learning (RL) for reasoning.
Simply rewarding correct answers organically encourages chain-of-thought-like reasoning.

Get the Snipd Podcast app to discover more snips from this episode

Casual Reflections and Listener Feedback on AI Discussions

02:39 • 2min

chevron_right

AI Model Advancements and Hardware Implications

04:18 • 2min

chevron_right

Exploring DeepSeq R1: Reinforcement Learning and Enhanced Reasoning

06:21 • 3min

chevron_right

Reinforcement Learning and AI Optimization

09:10 • 31min

chevron_right

The Landscape of AI: Risks, Innovations, and Competition

39:58 • 20min

chevron_right

AI Race: Europe's Struggles and Global Dynamics

59:30 • 32min

chevron_right

The Geopolitical Landscape of Semiconductors

01:31:51 • 6min

chevron_right

#54185

Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

Janus-Pro is an advanced version of the previous Janus model, incorporating optimized training strategies, expanded training data, and a larger model size. It achieves significant advancements in multimodal understanding and text-to-image instruction-following capabilities, while enhancing the stability of text-to-image generation. The model uses a decoupled visual encoding architecture, separate pathways for visual understanding and generation, and leverages synthetic data to improve performance.

#75895

DeepSeek-V3

A Mixture-of-Experts Large Language Model

DeepSeek Team

DeepSeek-V3 is an open-source large language model that leverages a Mixture-of-Experts (MoE) architecture. It features 671 billion parameters, with 37 billion activated per token, and incorporates innovative techniques such as Multi-Head Latent Attention (MLA), auxiliary-loss-free load balancing, and a novel Multi-Token Prediction (MTP) objective. The model was pre-trained on 14. 8 trillion tokens and outperforms other open-source models on various benchmarks, including coding and mathematics tasks. It also employs fine-grained quantization using FP8 and improved parallelism and cross-node communication for efficient training and inference.

#74663

Lucy

Jamaica Kincaid

In 'Lucy', Jamaica Kincaid tells the story of Lucy Josephine Potter, a nineteen-year-old from the West Indies who becomes an au pair for a wealthy white family in North America. The novel explores Lucy's journey as she navigates her new environment, scrutinizes the lives of her employers, and unravels the mysteries of her own sexuality. It is a portrait of a young woman's search for identity, independence, and self-discovery, set against the backdrop of cultural and societal contrasts.

#62542

Anthropic Computer Use Model

A New Capability for AI Agents

Anthropic Research Team

The Anthropic computer use model, part of the Claude 3. 5 Sonnet, enables AI agents to interpret and interact with computer screens, execute tasks, and follow user instructions. This model combines visual reasoning and natural language understanding to automate repetitive processes, build and test software, and conduct open-ended tasks like research. It is designed with constitutional AI for safer and more aligned task completion.

#37412

Quen 2.5

A Series of Large Language Models

Alibaba's Quin Team

Quen 2. 5 is the latest series of large language models from Alibaba's Quin team, featuring significant improvements in knowledge, coding, mathematics, instruction following, and generating long texts. These models support up to 128K tokens, generate up to 8K tokens, and offer multilingual support for over 29 languages. Specialized models include Qwen2. 5-Coder for coding and Qwen2. 5-Math for mathematics, both of which have undergone substantial enhancements compared to their predecessors.

#35662

Janus Pro

DeepSeek's Revolutionary Multimodal AI Model

DeepSeek AI Lab

Janus Pro is a cutting-edge AI model developed by DeepSeek, distinguished by its unified transformer architecture and decoupled visual encoding. It separates the tasks of understanding and generating images into distinct pathways, enhancing performance in both image comprehension and creation. The model has been trained on an extensive dataset of over 90 million samples, including 72 million synthetic aesthetic data points, and it outperforms other models like DALL-E 3 and Stable Diffusion in various benchmarks.

#38581

Machines of Loving Grace: How AI Could Transform the World for the Better

Dario Amodei

In this comprehensive essay, Dario Amodei presents a detailed vision of how artificial intelligence could accelerate progress and transform the world for the better. He explores potential AI-driven advances in five key areas: biology and physical health, neuroscience and mental health, economic development and poverty, peace and governance, and work and meaning. Amodei discusses plausible futures where AI could lead to significant improvements, such as eliminating infectious diseases, doubling human lifespan, and enhancing mental health. He also addresses the need for societal adjustments and responsible AI development to navigate these transformative changes effectively.

#43860

Project Stargate and Remote Viewing Technology

The CIA's Files on Psychic Spying

Axel Balthazar

This book compiles the government's formerly classified documents pertaining to the Stargate Project, a thirty-year series of classified projects initiated by the CIA in response to Soviet research into psychotronics and remote viewing. The project involved investigations into remote viewing, psychic spying, The First Earth Battalion, psi phenomena, extrasensory perception, and psychokinesis. The book exposes a wide range of topics related to these psychic phenomena and their potential use for military and intelligence purposes.

#37133

Executive Order on the Safe, Secure, and Trustworthy Development and Use of AI

Protecting Americans from Potential Risks of AI Systems

Joe Biden

The executive order, issued on October 30, 2023, mandates the development of standards, tools, and tests to ensure AI systems are safe, secure, and trustworthy. It emphasizes the protection of Americans' privacy, advancement of equity and civil rights, and promotion of innovation and competition. The order requires developers to share safety test results with the U.S. government and highlights the importance of AI in bolstering cybersecurity, detecting AI-enabled fraud, and enhancing software and network security. It also champions international collaborations to set global standards for AI safety and cybersecurity.

#37648

Initial Rescissions of Harmful Executive Orders and Actions

Donald Trump

This executive order, issued by President Donald Trump, revokes numerous executive orders and actions implemented during the Biden Administration. It includes the rescission of orders related to racial equity, environmental justice, climate change, and other policy areas. The order also imposes a regulatory freeze and directs agencies to review and potentially rescind rules that impose undue burdens on domestic energy resources and other areas.

#71359

OpenAI o1 Model

Advanced Reasoning and Chain-of-Thought Processing

OpenAI

The OpenAI o1 models are trained using large-scale reinforcement learning to perform complex reasoning. These models 'think' before responding, breaking down problems into smaller steps and solving them iteratively. This approach enhances their performance in tasks requiring detailed reasoning, such as coding challenges, math problems, and scientific research. The models include o1 and o1-mini, with the latter being optimized for speed and efficiency, particularly in coding tasks. They are pre-trained on diverse datasets, including public, proprietary, and custom datasets, to ensure robust reasoning and conversational capabilities.

#831

• Mentioned in 33 episodes

Game changers

What Leaders, Innovators, and Mavericks Do to Win at Life

Dave Asprey

Game Changers by Dave Asprey is a comprehensive guide that distills the wisdom from over 450 interviews with highly successful leaders, innovators, and mavericks. The book focuses on three main objectives: becoming smarter, faster, and happier. It offers 46 science-backed 'laws' that provide practical strategies for optimizing diet, exercise, sleep habits, and mental performance. Asprey combines insights from human biology and psychology with real-world examples to help readers upgrade their 'operating system' to better align with modern goals. The book covers a wide range of topics, including taming fear and anxiety, making better decisions, establishing high-performance habits, and practicing gratitude and mindfulness.

#2693

• Mentioned in 13 episodes

The bitter lesson

National Union of Teachers.

This publication by the National Union of Teachers focuses on the issues of teacher turnover and the effects of the London Allowance. It presents a sample survey and analysis aimed at understanding the factors influencing teacher retention and the financial incentives provided by the London Allowance.

#1158

• Mentioned in 26 episodes

Impromptu

Amelia Rosselli

Our 198th episode with a summary and discussion of last week's big AI news!
Recorded on 01/31/2024

Join our brand new Discord here! https://discord.gg/nTyezGSKwP

Hosted by Andrey Kurenkov and Jeremie Harris.
Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.

In this episode:

- DeepSeek releases R1, a competitive AI model comparable to OpenAI’s O1, leading to market unrest and significant drops in tech stocks, including a 17% plunge in NVIDIA's stock.
- OpenAI launches Operator to facilitate agentic computer use, while facing competition from new releases by DeepSeek and Quen, with applications seeing rapid adoption.
- President Trump revokes the Biden administration's executive order on AI, signaling a shift in AI policy and deregulation efforts.
- Taiwanese government clears TSMC to produce advanced 2-nanometer chip technology abroad, aiming to strengthen global semiconductor supply amidst geopolitical tensions.

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:03:01) Response to listener comments
Projects & Open Source
- (00:06:26) DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- (00:30:25) Viral AI company DeepSeek releases new image model family
- (00:34:07) Qwen2.5-1M Technical Report
- (00:38:32) Alibaba’s Qwen team releases AI models that can control PCs and phones
Tools & Apps
- (00:42:09) OpenAI launches Operator, an AI agent that performs tasks autonomously
- (00:47:37) DeepSeek reaches No. 1 on US Play Store
- (00:52:17) Alibaba rolled out Qwen Chat v0.2 and Qwen2.5-1M model
- (00:53:50) Perplexity launches US-hosted DeepSeek R1, hints at EU hosting soon
- (00:55:31) Apple is pulling its AI-generated notifications for news after generating fake headlines
- (00:59:00) French AI ‘Lucie’ looks très chic, but keeps getting answers wrong
Applications & Business
Policy & Safety
(01:33:01) Outro

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Home Top podcasts Popular guests Top books