AI Engineering Podcast

Tobias Macey

This show is your guidebook to building scalable and maintainable AI systems. You will learn how to architect AI applications, apply AI to your work, and the considerations involved in building or customizing new models. Everything that you need to know to deliver real impact and value with machine learning and artificial intelligence.

Episodes

Mentioned books

Apr 21, 2025 • 1h 12min

Understanding The Operational And Organizational Challenges Of Agentic AI

Julian LaNeve, CTO of Astronomer, shares his expertise on the transition from simple LLMs to complex agentic AI systems. He stresses the importance of starting with easy applications to build foundational knowledge. The discussion delves into orchestrating AI workflows using directed acyclic graphs and highlights the necessity of robust data management. Julian also addresses the challenges of reliability and observability in AI, urging teams to thoughtfully evaluate their operational readiness and investment decisions in this dynamic field.

Mar 16, 2025 • 56min

The Power of Community in AI Development with Oumi

Emmanouil (Manos) Koukoumidis, CEO of Oumi and former Google Cloud AI tech lead, talks about fostering community in AI development. He stresses the need for open-source models to promote collaboration and accessibility, likening Oumi's vision to 'the Linux of AI.' Manos shares insights on navigating the overwhelming choices in AI models and the importance of engaging a community for innovation. He also addresses gaps in AI accessibility and the need for standardization to empower both researchers and enterprises in their AI journeys.

Feb 26, 2025 • 31min

Arch Gateway: Add AI To Your Apps Without Custom Development

In this engaging discussion, Adil Hafiz, co-founder of Ardenimo and an expert with a rich engineering background at Microsoft and Amazon, sheds light on the Arch Gateway. This innovative tool simplifies AI integration for developers, allowing them to focus on core functions while bypassing complex AI specifics. He highlights the project's use of Rust and Envoy to enhance performance, discusses community feedback's crucial role, and outlines future aspirations for developing a leading planning model and improving AI agent interoperability.

Feb 16, 2025 • 54min

The Role Of Synthetic Data In Building Better AI Applications

Ali Golshan, Co-founder and CEO of Gretel.ai, dives into the fascinating world of synthetic data and its pivotal role in advancing AI applications. He discusses how synthetic data can enhance privacy while improving the quality and structural stability of datasets. The conversation highlights the shift from traditional data methods to the use of language models and the challenges of scaling synthetic data in production. Ali also explores its transformative applications in sectors like healthcare and finance, underscoring the importance of governance and ethical considerations.

Jan 22, 2025 • 1h 3min

Optimize Your AI Applications Automatically With The TensorZero LLM Gateway

Viraj Mehta, CTO and co-founder of TensorZero, shares insights on optimizing AI applications with their innovative LLM gateways. He discusses how these gateways standardize communication and manage interactions between applications and AI models. The conversation dives into sustainable AI optimization and the challenges of integrating structured data inputs. Viraj also highlights the role of user feedback in enhancing AI interactions, as well as the architectural innovations that improve efficiency and usability for developers.

Dec 16, 2024 • 55min

Harnessing The Engine Of AI

Ron Green, co-founder and CTO of Kung Fu AI, dives into the evolving AI landscape and the complexities of generative AI engines. He discusses the limitations of large language models and the critical need for human oversight and robust data management. Ron highlights innovative methods like Retrieval-Augmented Generation and the significance of targeted, domain-specific AI solutions. He expresses optimism for AI's future, predicting major advancements in the next 20 years that integrate seamlessly into everyday applications.

Dec 1, 2024 • 54min

The Complex World of Generative AI Governance

Jim Olson, CTO of ModelOp, specializes in generative AI governance and regulations. He discusses the importance of monitoring and inventory for compliance in high-risk areas like healthcare. Olson emphasizes the need for technical controls to manage data governance and the continuous monitoring of AI models to detect issues. He addresses the balance between innovation and regulation, particularly in light of evolving EU regulations, and highlights the necessity of building trust through effective governance solutions.

Nov 25, 2024 • 55min

Building Semantic Memory for AI With Cognee

Vasilije Markovich, a data engineer and AI specialist from Montenegro, discusses enhancing large language models with memory. He highlights the challenges of context window limitations and forgetting in LLMs, introducing hierarchical memory to improve performance. Vasilije dives into his creation, Cognee, which manages semantic memory, emphasizing its potential applications and the blend of cognitive science with data engineering. He shares insights from building an AI startup, the importance of user feedback, and future developments in open-source AI technology.

Nov 22, 2024 • 53min

The Impact of Generative AI on Software Development

Tanner Burson, VP of Engineering at Prismatic, dives into the transformative effects of generative AI on software development. He discusses how AI is reshaping developer roles and productivity, fueled by tools like GitHub's Copilot. Tanner outlines both the opportunities and challenges AI presents, emphasizing the crucial need for human oversight to ensure code quality. He also explores the microunits of AI integration in workflows, the growing importance of mentorship, and the balance between innovation and practical engineering skills in an AI-driven future.

Nov 11, 2024 • 1h 16min

ML Infrastructure Without The Ops: Simplifying The ML Developer Experience With Runhouse

Donnie Greenberg, Co-founder and CEO of Runhouse and former product lead for PyTorch at Meta, shares insights on simplifying machine learning infrastructure. He discusses the challenges of traditional MLOps tools and presents Runhouse's serverless approach that reduces complexity in moving from development to production. Greenberg emphasizes the importance of flexible, collaborative environments and innovative fault tolerance in ML workflows. He also touches on the need for integration with existing DevOps practices to meet the evolving demands of AI and ML.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner