"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Titans: Neural Long-Term Memory for LLMs, with author Ali Behrouz

141 snips

May 15, 2025

In this engaging discussion, Ali Behrouz, a PhD student from Cornell University, shares insights from his research on enhancing memory in large language models. He introduces the Titans architecture, which mimics human-like memory to tackle coherence challenges in AI. Key topics include the limitations of current models and innovative solutions to catastrophic forgetting. Behrouz also highlights the potential of specialized LLMs in corporate settings, revealing the necessity for improved memory mechanisms to unlock AI's full capabilities in the workplace.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Neural Network as Memory Module

Titans introduces a neural network as a memory module for LLMs updated by gradient descent at runtime.
This approach differs fundamentally from classical memories as mere vectors or matrices seen in previous models.

INSIGHT

Human Memory Inspires Neural Memory

Human memory likely operates as a network of interconnected neurons rather than isolated units.
Memory as a neural network inspires moving beyond vector/matrix memories for richer, more dynamic long-term memory.

INSIGHT

Runtime Gradient Updates in Memory

Recurrent gradient descent updates enable the memory MLP to approximate the attention values given keys.
This runtime update mirrors human-like continual learning within a fixed finite memory size.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

In this episode of The Cognitive Revolution, Ali Behrouz, a PhD student at Cornell University, delves into his research on enhancing memory mechanisms in large language models through his latest paper titled Titans. Behrouz discusses the limitations of current models in maintaining long-term coherence and introduces the concept of a neural network as a memory module. Highlighting architectures such as memory as context and memory as gate, he explains how these innovative approaches can significantly improve long-term memory retention in AI systems. The discussion also touches upon challenges such as catastrophic forgetting and the need for more effective models in reinforcement learning and decision-making tasks. This insightful conversation sheds light on the future directions and potential applications of advanced memory mechanisms in AI.

SPONSORS:

ElevenLabs: ElevenLabs gives your app a natural voice. Pick from 5,000+ voices in 31 languages, or clone your own, and launch lifelike agents for support, scheduling, learning, and games. Full server and client SDKs, dynamic tools, and monitoring keep you in control. Start free at https://elevenlabs.io/cognitive-revolution

Oracle Cloud Infrastructure (OCI): Oracle Cloud Infrastructure offers next-generation cloud solutions that cut costs and boost performance. With OCI, you can run AI projects and applications faster and more securely for less. New U.S. customers can save 50% on compute, 70% on storage, and 80% on networking by switching to OCI before May 31, 2024. See if you qualify at https://oracle.com/cognitive

The AGNTCY: The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at https://agntcy.org/?utm_campaign=fy25q4_agntcy_amer_paid-media_agntcy-cognitiverevolution_podcast&utm_channel=podcast&utm_source=podcast

Shopify: Shopify powers millions of businesses worldwide, handling 10% of U.S. e-commerce. With hundreds of templates, AI tools for product descriptions, and seamless marketing campaign creation, it's like having a design studio and marketing team in one. Start your $1/month trial today at https://shopify.com/cognitive

NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive

PRODUCED BY:

https://aipodcast.ing