Chain of Thought cover image

Chain of Thought

Latest episodes

undefined
Feb 19, 2025 • 46min

Do I Need Agents? | Gartner’s Haritha Khandabattu

Haritha Khandabattu, Senior Director Analyst for AI at Gartner, dives into the fascinating world of AI agents. She untangles the hype around these technologies and offers real-world examples of successful implementations. Listen as she discusses critical considerations for engineering leaders, the challenge of building versus buying AI solutions, and the necessity of collaboration between data and software teams. Haritha also tackles ethical concerns and provides practical tips to enhance productivity while integrating AI into organizations.
undefined
Feb 12, 2025 • 41min

The Making of Gemini 2.0: DeepMind's Approach to AI Development and Deployment | Logan Kilpatrick

Logan Kilpatrick, Senior Product Manager at Google DeepMind, shares fascinating insights into the making of Gemini 2.0. He discusses Gemini's strength as a premier AI model, showcasing its multimodal capabilities and unique function calling approach. Logan highlights the role of Google's hardware in enhancing performance and long-context capabilities. The conversation also touches on the potential of vision-first AI agents and how Gemini is set to revolutionize developer experiences by integrating seamlessly into existing ecosystems.
undefined
Feb 5, 2025 • 33min

DeepSeek Fallout, Export Controls & Agentic Evals

Hosts dive into the significant impact of DeepSeek's latest R1 model on the open-source AI landscape. They discuss export controls and their mixed effects on global innovation, hinting at a shift towards "Agents as a Service." The necessity for robust evaluation frameworks for increasingly complex agentic systems takes center stage, revealing challenges in measuring performance. The launch of customizable evaluation tools is highlighted as a game-changer for developers, promising a safer trajectory for AI agents.
undefined
Jan 29, 2025 • 34min

AI, Open Source & Developer Safety | Block’s Rizel Scarlett

As DeepSeek so aptly demonstrated, AI doesn’t need to be closed source to be successful. This week, Rizel Scarlett, a Staff Developer Advocate at Block, joins Conor Bronsdon to discuss the intersections between AI, open source, and developer advocacy. Rizel shares her journey into the world of AI, her passion for empowering developers, and her work on Block's new AI initiative, Goose, an on-machine developer agent designed to automate engineering tasks and enhance productivity. Conor and Rizel also explore how AI can enable psychological safety, especially for junior developers. Building on this theme of community, they also dive into topics such as responsible AI development, ethical considerations in AI, and the impact of community involvement when building open source developer tools. Chapters: 00:00 Rizel's Role at Block 02:41 Introducing Goose: Block's AI Agent 06:30 Psychological Safety and AI for Developers 11:24 AI Tools and Team Dynamics 17:28 Open Source AI and Community Involvement 25:29 Future of AI in Developer Communities 27:47 Responsible and Ethical Use of AI 31:34 Conclusion Follow Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Rizel Scarlett LinkedIn: https://www.linkedin.com/in/rizel-bobb-semple/ Website: https://blackgirlbytes.dev/ Show Notes Learn more about Goose: https://block.github.io/goose/
undefined
Jan 15, 2025 • 33min

AI in 2025: Agents & The Rise of Evaluation Driven Development

"In the next three to five years, every piece of software that is built on this planet will have some sort of AI baked into it." - Atin Sanyal Chain of Thought is back for its second season, and this episode dives headfirst into the possibilities AI holds for 2025 and beyond. Join Conor Bronson as he chats with Galileo co-founders Yash Sheth (COO) and Atindriyo Sanyal (CTO) about major trends to look for this year. These include AI finding its product "tool stack" fit, generation latency decreasing, AI agents, their potential to revolutionize code generation and other industries, and the crucial role of robust evaluation tools in ensuring the responsible and effective deployment of these agents. Yash and Atin also highlight Galileo's focus on building trust and security in AI applications through scalable evaluation intelligence. They emphasize the importance of quantifying application behavior, enforcing metrics in production, and adapting to the evolving needs of AI development. Finally, they discuss Galileo's vision for the future and their active pursuit of partnerships in 2025 to contribute to a more reliable and trustworthy AI ecosystem. Chapters: 00:00 AI Trends and Predictions for 2025 02:55 Advancements in LLMs and Code Generation 05:16 Challenges and Opportunities in AI Development 10:40 Evaluating AI Agents and Applications 16:07 Building Evaluation Intelligence 23:41 Research Opportunities 29:50 Advice for Leveraging AI in 2025 32:00 Closing Remarks Show Notes: Check out Galileo⁠⁠⁠⁠⁠⁠⁠⁠⁠ Follow Yash Follow Atin Follow Conor
undefined
Jan 8, 2025 • 35min

Now is the Time to Build | Weaviate’s Bob van Luijt

Join Bob van Luijt, CEO and co-founder of Weaviate, an AI-native database innovator, as he dives into the future of AI infrastructure. He passionately asserts that now is the time to build and adapt to evolving tech. Bob discusses the importance of generative feedback loops and agent architectures, which could revolutionize data management. They also tackle the challenges of documentation and developer experience as key factors for successful AI implementation. Prepare for insights that inspire action and innovation in the AI landscape!
undefined
Dec 18, 2024 • 42min

How AI Assistants Can Enhance Human Connection | Twilio’s Vinnie Giarrusso

Can AI assistants actually enhance human connection? As Season 1 of Chain of Thought comes to a close, Conor Bronsdon and Vinnie Giarrusso (Twilio) explore the transformative potential of AI assistants in the workplace. Discover how these assistants function as "async junior digital employees," taking on specific tasks and contributing to the organizational structure. But will AI assistants ultimately replace human connection? Vinnie argues the opposite is true, suggesting that AI can liberate employees from mundane tasks, allowing them to focus on building meaningful relationships and providing personalized experiences. This thought-provoking conversation takes a philosophical turn as Vinnie explores how AI could revolutionize education while potentially disrupting traditional mentorship roles. He shares his vision for a future where AI democratizes information and empowers individuals to personalize their learning journey. Finally, learn how Twilio and Galileo are partnering to shape the future of AI and what this collaboration means for both companies. Chain of Thought will be taking a break for the holidays, but we'll see you back here on January 8th for the start of Season 2! Chapters: 00:00 Twilio's AI Agent Platform 06:34 Ensuring Accuracy and Trustworthiness 09:49 Challenges and Failure Modes 17:39 Future of Fully Autonomous Agents 22:18 Human-AI Collaboration and Mentorship 31:24 Education and Democratization of Information 32:58 Partnership with Galileo 39:54 Conclusion and Season Wrap-Up Follow: Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Vinnie Giarrusso: https://www.linkedin.com/in/vinniegiarrusso/ Show notes: Twilio Alpha: ⁠https://twilioalpha.com OWASP GenAI: https://genai.owasp.org
undefined
Dec 11, 2024 • 51min

Lessons from Deploying AI at Enterprise Scale | ServiceTitan, Indeed & Twilio

This week, a panel of experts (Mehmet Murat Ezbiderli, ServiceTitan; Grant Ledford, Indeed; and Vinnie Giarrusso, Twilio) join Atin Sanyal (CTO, Galileo) and Conor Bronsdon (Developer Awareness, Galileo) to explore the challenges and opportunities of deploying GenAI at enterprise scale in a conversation that's a wake-up call for any business leader looking to harness the power of AI. Together, Atin & Conor break down key considerations like performance, cost, and model selection, emphasizing the need for robust evaluation frameworks and a shift in developer mindset. Atin then sits down with our panel of AI engineering experts to discuss their firsthand experiences with enterprise AI, including the trade-offs of building AI systems, the evolving tools and frameworks available, and the impact these technologies are having on their organizations. Chapters: 00:00 Enterprise Scale Deployment 05:17 Cost, Performance, and Model Selection 08:59 Building and Integrating GenAI Systems 15:26 Emerging Enterprise Use Cases 18:12 Predictions for AI in 2025 27:28 Panel Discussion: Deploying AI at Enterprise Scale 31:19 Gen AI Solutions and Challenges 33:12 Building & Deploying Traditional Infrastructure vs GenAI Infrastructure 34:36 How to Assemble Your GenAI Stack 40:39 Today's Best GenAI Use Cases 48:15 Enterprise AI Trends for 2025 50:36 Closing Remarks and Future Outlook Follow: Atin Sanyal: ⁠⁠⁠https://www.linkedin.com/in/atinsanyal/⁠ Mehmet Murat Ezbiderli: https://www.linkedin.com/in/mehmet-murat-ezbiderli-b894a49/ Grant Ledford: https://www.linkedin.com/in/grant-ledford-36b146a5/ Vinnie Giarrusso: https://www.linkedin.com/in/vinniegiarrusso/ Show notes: Watch all of Productionize: https://www.galileo.ai/genai-productionize-2-0
undefined
Dec 4, 2024 • 48min

Practical Lessons for GenAI Evals | Chip Huyen & Vivienne Zhang

As AI agents and multimodal models become more prevalent, understanding how to evaluate GenAI is no longer optional – it's essential.  Generative AI introduces new complexities in assessment compared to traditional software, and this week on Chain of Thought we’re joined by Chip Huyen (Storyteller, Tép Studio), Vivienne Zhang (Senior Product Manager, Generative AI Software, Nvidia) for a discussion on AI evaluation best practices.  Before we hear from our guests, Vikram Chatterji (CEO, Galileo) and Conor Bronsdon (Developer Awareness, Galileo) give their takes on the complexities of AI evals and how to overcome them through the use of objective criteria in evaluating open-ended tasks, the role of hallucinations in AI models, and the importance of human-in-the-loop systems. Afterwards, Chip and Vivienne sit down with Atin Sanyal (Co-Founder & CTO, Galileo) to explore common evaluation approaches, best practices for building frameworks, and implementation lessons. They also discuss the nuances of evaluating AI coding assistants and agentic systems. Chapters: 00:00 Challenges in Evaluating Generative AI 05:45 Evaluating AI Agents 13:08 Are Hallucinations Bad? 17:12 Human in the Loop Systems 20:49 Panel discussion begins 22:57 Challenges in Evaluating Intelligent Systems 24:37 User Feedback and Iterative Improvement 26:47 Post-Deployment Evaluations and Common Mistakes 28:52 Hallucinations in AI: Definitions and Challenges 34:17 Evaluating AI Coding Assistants 38:15 Agentic Systems: Use Cases and Evaluations 43:00 Trends in AI Models and Hardware 45:42 Future of AI in Enterprises 47:16 Conclusion and Final Thoughts Follow: Vikram Chatterji: https://www.linkedin.com/in/vikram-chatterji/ Atin Sanyal: ⁠⁠https://www.linkedin.com/in/atinsanyal/ Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Chip Huyen: ⁠https://www.linkedin.com/in/chiphuyen/⁠ Vivienne Zhang: ⁠⁠https://www.linkedin.com/in/viviennejiaozhang/ Show notes: Watch all of Productionize 2.0: ⁠https://www.galileo.ai/genai-productionize-2-0⁠
undefined
Nov 27, 2024 • 41min

The Real ROI of Enterprise AI | HP, ServiceNow & Accenture

The “ROI of AI” has been marketed as a panacea, a near-magical solution to all business problems. Following that promise, many companies have invested heavily in AI over the past year and are now asking themselves, “What is the return on my AI investment?” This week on Chain of Thought, Galileo’s CEO, Vikram Chatterji joins Conor Bronsdon to discuss AI's value proposition, from the initial hype to the current search for tangible returns, offering insights into how businesses can identify the right AI use cases to maximize their investment. Next, we’re joined by a panel of AI experts to discuss the ROI of Enterprise AI, featuring Alex Klug, Head of Product, Data Science & AI at HP; Sriram Palapudi, Sr. Dir, ML Platform Engineering at ServiceNow; and Jay Subrahmonia, Global MD for AI Research & Products at Accenture. Together, they explore effective implementation strategies, how to measure the returns of AI adoption in the enterprise, and why AI's ROI isn't always just about the bottom line. Chapters: 00:00 Current State of AI Investments 03:59 Challenges and Solutions in AI Implementation 08:30 Identifying and Prioritizing AI Use Cases 10:53 Ensuring Trust and Explainability in AI 15:29 Measuring ROI and Efficiency Gains 21:10 Panel Discussion Begins 21:54 Trust and Risk Management at HP 23:27 Accenture's Approach to Operationalizing AI 26:06 ServiceNow's Trade-offs and Prioritization 31:17 Measuring the success of AI for customers 36:29 Frameworks and Best Practices 40:57 Conclusion and Final Thoughts Follow: Vikram Chatterji: ⁠https://www.linkedin.com/in/vikram-chatterji/ Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Alex Klug: https://www.linkedin.com/in/alex-klug-67ba3655/ Sriram Palapudi: https://www.linkedin.com/in/sriram-palapudi-11294b1/ Jay Subrahmonia: https://www.linkedin.com/in/jayashree-subrahmonia-99963a/ Show notes: Watch all of Productionize 2.0: ⁠⁠https://www.galileo.ai/genai-productionize-2-0⁠⁠

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode