AI + a16z cover image

AI + a16z

Latest episodes

undefined
54 snips
Jun 27, 2025 • 47min

AI's Unsung Hero: Data Labeling and Expert Evals

Manu Sharma, CEO of Labelbox and an expert in data labeling for AI, joins to discuss the evolution of data labeling from supervised learning to advanced reinforcement learning. He highlights how the shift to foundation models and generative AI has transformed the industry, emphasizing the emerging role of 'aligners'—top professionals who ensure high-quality training data. Manu also touches on the competitive landscape of AI, particularly the impact of recent acquisitions like Meta's purchase of Scale AI, underscoring the growing importance of data and talent in the AGI race.
undefined
243 snips
Jun 20, 2025 • 35min

AI, Data Engineering, and the Modern Data Stack

Tristan Handy, co-founder and CEO of dbt Labs, dives into the evolving world of data engineering alongside Jennifer Li and Matt Bornstein. They discuss the pivotal role of AI in enhancing data workflows, stressing the importance of human oversight in validating outputs. Handy highlights the transformative power of automation and tools like SQL compilers in reshaping engineering tasks. They also touch on recent industry acquisitions and the future implications for data architecture, blending operational and analytical workloads.
undefined
88 snips
Jun 13, 2025 • 26min

Enabling Agents and Battling Bots on an AI-Centric Web

David Mytton, CEO of Arcjet, dives into the complexities of web access and bot detection. He discusses the growing need for advanced security measures in an AI-driven landscape, where distinguishing between beneficial bots and malicious ones is crucial. Mytton emphasizes the importance of low-latency, full-context security checks for effective fraud prevention. The conversation also highlights innovative methods for managing automated traffic and the evolving role of AI agents in the digital environment.
undefined
173 snips
Jun 6, 2025 • 36min

Giving New Life to Unstructured Data with LLMs and Agents

Anant Bhardwaj, Founder and CEO of Instabase, specializes in automating the management of unstructured data. In this engaging discussion, he delves into how large language models (LLMs) are transforming the processing of unstructured documents, enabling innovations like identity verification via WhatsApp. Bhardwaj shares insights on the limitations of traditional robotic process automation and the significance of predictability in AI solutions. He envisions a future where AI agents autonomously handle complex workflows, reshaping enterprise automation.
undefined
46 snips
May 30, 2025 • 1h 42min

Beyond Leaderboards: LMArena’s Mission to Make AI Reliable

Anastasios N. Angelopoulos, a UC Berkeley professor and AI researcher, along with LMArena cofounders Wei-Lin Chiang and Ion Stoica, delve into innovative AI evaluation methods. They discuss transitioning from static benchmarks to dynamic user feedback for better model reliability. Fresh data and community engagement are emphasized as essential for AI development. The conversation highlights personalized leaderboards, real-time testing challenges, and the importance of scaling their platform to meet diverse user needs and preferences, all while fostering an inclusive approach to AI.
undefined
359 snips
May 23, 2025 • 48min

Building AI Systems You Can Trust

Scott Clark, Cofounder and CEO of Distributional, and Matt Bornstein, a Partner at a16z, discuss the pivotal role of trust in AI systems, moving beyond just performance metrics. They delve into the hidden complexities of generative AI behaviors and the critical need for robust evaluation frameworks. Topics include the pitfalls of traditional testing methods, the rise of 'shadow AI,' and practical strategies for scaling AI from prototypes to real-world applications. Their insights shed light on managing reliability and addressing the challenges of enterprise AI adoption.
undefined
254 snips
May 16, 2025 • 45min

Who's Coding Now? AI and the Future of Software Development

Guido Appenzeller, an Infra partner at a16z and a computer science expert, joins fellow partner Matt Bornstein, an AI application specialist, to explore how generative AI is revolutionizing software development. They discuss the rise of 'prompt-based programming' and its impact on developer productivity. The duo delves into the permanence of formal programming languages amidst AI advancements and the complexities of integrating AI into enterprise systems. They also touch on how these changes will reshape coding education and the future landscape of technology.
undefined
480 snips
May 2, 2025 • 54min

MCP Co-Creator on the Next Wave of LLM Innovation

David Soria Parra, creator of the Model Context Protocol (MCP) at Anthropic, shares his insights on revolutionizing AI applications. He discusses how MCP enables seamless integration of AI with existing tools, likening it to the API ecosystem. The conversation highlights innovative uses of MCP in creative tools like Blender and Ableton, fostering new artistic expressions. Furthermore, they delve into how agents communicate, balancing natural and programming languages, and underscore the importance of community participation in evolving this groundbreaking protocol.
undefined
391 snips
Apr 28, 2025 • 36min

What Is an AI Agent?

In this episode of AI + a16z, a16z Infra partners Guido Appenzeller, Matt Bornstein, and Yoko Li discuss and debate one of the tech industry's buzziest words right now: AI agents. The trio digs into the topic from a number of angles, including:Whether a uniform definition of agent actually existsHow to distinguish between agents, LLMs, and functionsHow to think about pricing agentsWhether agents can actually replace humans, andThe effects of data siloes on agents that can access the web.They don't claim to have all the answers, but they raise many questions and insights that should interest anybody building, buying, and even marketing AI agents.Learn more:Benchmarking AI Agents on Full-Stack CodingAutomating Developer Email with MCP and Al AgentsA Deep Dive Into MCP and the Future of AI ToolingAgent Experience: Building an Open Web for the AI EraDeepSeek, Reasoning Models, and the Future of LLMsAgents, Lawyers, and LLMsReasoning Models Are Remaking Professional ServicesFrom NLP to LLMs: The Quest for a Reliable ChatbotCan AI Agents Finally Fix Customer Support?Follow everybody on X:Guido AppenzellerMatt BornsteinYoko Li Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.
undefined
158 snips
Mar 28, 2025 • 33min

Benchmarking AI Agents on Full-Stack Coding

Sujay Jayakar, co-founder and Chief Scientist at Convex, dives into the future of autonomous coding. He discusses the challenges AI agents face with full-stack development and the significance of robust evaluation methods like Fullstack Bench. Jayakar emphasizes how type safety can reduce errors and improve consistency. He shares insights on which AI models excel in real-world app-building, and why treating your toolchain as part of the prompt could transform development workflows. Perfect for developers looking to enhance their projects with AI!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app