Interconnects cover image

Interconnects

Latest episodes

undefined
10 snips
Jan 22, 2025 • 1h 13min

Interviewing OLMo 2 leads: Open secrets of training language models

Luca Soldaini, the Data lead for the OLMo project at AI2, joins the discussion to unveil the intricacies of training language models. He shares tales of overcoming challenges in pretraining efficiency and the quest for stability, especially after a significant 70B model attempt. The conversation dives into the strategic decisions behind building effective language modeling teams, the intricate balance of deep versus wide network architectures, and the importance of community-driven advancements in AI.
undefined
7 snips
Jan 21, 2025 • 20min

DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs

Discover the latest in AI with the launch of a groundbreaking reasoning language model, R1, featuring a unique four-stage reinforcement learning approach. The discussion dives into how this innovation could disrupt the market with competitive pricing and open-source implications. The conversation also touches on advancements in reasoning models and the fine-tuning processes that enhance their capabilities, hinting at exciting developments for researchers and companies alike.
undefined
Jan 15, 2025 • 10min

Let me use my local LMs on Meta Ray-Bans

Exploring the intersection of AI and wearable tech, the conversation highlights how new gadgets like Meta's Ray-Bans are transforming our interaction with technology. The role of local language models emerges as essential for enhanced privacy and efficiency. Insights reveal that while the current devices feel a step behind, their potential impact is likened to the revolutionary iPod and iPad. The excitement surrounding these innovations echoes the initial buzz of AirPods, signaling a shift in how we perceive and utilize AI in daily life.
undefined
10 snips
Jan 9, 2025 • 17min

(Voiceover) DeepSeek V3 and the actual cost of training frontier AI models

Discover the groundbreaking innovations behind DeepSeek V3 and its impressive learning efficiency. The discussion dives into the complex financial aspects of training frontier AI models, shedding light on the true costs involved. Get insights into how these advancements could shape the future of AI development and the importance of transparency in computational resources. It's a fascinating look at technology's evolution and its implications for the industry.
undefined
13 snips
Jan 8, 2025 • 54min

The state of post-training in 2025

Explore the exciting advancements in post-training for language models as experts discuss reinforced learning from human feedback and preference tuning. Gain insights into the complexities of these techniques and the challenges of data acquisition and metric evaluation. The conversation highlights a promising future for open recipes and knowledge in the field by 2025. It's an optimistic take as the scientific community continues to push the boundaries of understanding and effective training methods.
undefined
25 snips
Jan 2, 2025 • 16min

Quick recap on the state of reasoning

The discussion dives into the intriguing intersection of reasoning, inference, and post-training in AI. It challenges the myth that language models lack reasoning capabilities, emphasizing their potential to manipulate tokens to draw conclusions. The speaker highlights advancements in reinforcement learning and how they enhance model performance. Future developments in Reasoning Language Models (RLMs) are also a hot topic, suggesting a shift in understanding AI capabilities is on the horizon.
undefined
6 snips
Dec 31, 2024 • 6min

(Voiceover) 2024 Interconnects year in review

The podcast reflects on the significant milestones in AI for 2024, featuring the launch of OpenAI's O1 model. It delves into the advancements in reinforcement learning and the impact of open-source policies. The discussion highlights how these developments have shaped the AI landscape, offering insights into future trends and growth in the field.
undefined
Dec 20, 2024 • 18min

(Voiceover) OpenAI's o3: The grand finale of AI in 2024

Original post: https://www.interconnects.ai/p/openais-o3-the-2024-finale-of-aiChapters00:00 Introduction02:51 o3 overview05:57 Solving the Abstraction and Reasoning Corpus (ARC)10:41 o3’s architecture, cost, and training (hint: still no tree search)16:36 2024: RL returnsFiguresFig 1, Frontier Math resultsFig 2, Coding resultsFig 3, ARC AGI resultsFig 4, ARC AGI result detailsFig 5, ARC AGI example 1Fig 6, ARC AGI example in textFig 7, ARC AGI example “easy” Get full access to Interconnects at www.interconnects.ai/subscribe
undefined
16 snips
Dec 18, 2024 • 11min

(Voiceover) The AI agent spectrum

Dive into the intriguing world of AI agents and their diverse applications. Explore how the categorization of these agents is evolving, with a focus on their complexities and future potential. Discover the dynamics of feedback in reinforcement learning, and the differences between closed and open-ended agents. The discussion also delves into regulation and societal impact, shedding light on user experiences and expectations for AI. Prepare for a thought-provoking look at the next frontier of artificial intelligence.
undefined
Dec 11, 2024 • 13min

(Voiceover) OpenAI's Reinforcement Finetuning and RL for the masses

Original post: https://www.interconnects.ai/p/openais-reinforcement-finetuningChapters00:00 Introduction04:19 The impact of reinforcement finetuning’s existence07:29 Hypotheses on reinforcement finetuning’s implementationFiguresFig. 1, Yann’s CakeFig. 2, Grader configFig. 3, RLVR learning curves Get full access to Interconnects at www.interconnects.ai/subscribe

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode