AI Engineering Podcast

Tobias Macey
undefined
Feb 16, 2025 • 54min

The Role Of Synthetic Data In Building Better AI Applications

Ali Golshan, Co-founder and CEO of Gretel.ai, dives into the fascinating world of synthetic data and its pivotal role in advancing AI applications. He discusses how synthetic data can enhance privacy while improving the quality and structural stability of datasets. The conversation highlights the shift from traditional data methods to the use of language models and the challenges of scaling synthetic data in production. Ali also explores its transformative applications in sectors like healthcare and finance, underscoring the importance of governance and ethical considerations.
undefined
Jan 22, 2025 • 1h 3min

Optimize Your AI Applications Automatically With The TensorZero LLM Gateway

Viraj Mehta, CTO and co-founder of TensorZero, shares insights on optimizing AI applications with their innovative LLM gateways. He discusses how these gateways standardize communication and manage interactions between applications and AI models. The conversation dives into sustainable AI optimization and the challenges of integrating structured data inputs. Viraj also highlights the role of user feedback in enhancing AI interactions, as well as the architectural innovations that improve efficiency and usability for developers.
undefined
10 snips
Dec 16, 2024 • 55min

Harnessing The Engine Of AI

Ron Green, co-founder and CTO of Kung Fu AI, dives into the evolving AI landscape and the complexities of generative AI engines. He discusses the limitations of large language models and the critical need for human oversight and robust data management. Ron highlights innovative methods like Retrieval-Augmented Generation and the significance of targeted, domain-specific AI solutions. He expresses optimism for AI's future, predicting major advancements in the next 20 years that integrate seamlessly into everyday applications.
undefined
Dec 1, 2024 • 54min

The Complex World of Generative AI Governance

Jim Olson, CTO of ModelOp, specializes in generative AI governance and regulations. He discusses the importance of monitoring and inventory for compliance in high-risk areas like healthcare. Olson emphasizes the need for technical controls to manage data governance and the continuous monitoring of AI models to detect issues. He addresses the balance between innovation and regulation, particularly in light of evolving EU regulations, and highlights the necessity of building trust through effective governance solutions.
undefined
8 snips
Nov 25, 2024 • 55min

Building Semantic Memory for AI With Cognee

Vasilije Markovich, a data engineer and AI specialist from Montenegro, discusses enhancing large language models with memory. He highlights the challenges of context window limitations and forgetting in LLMs, introducing hierarchical memory to improve performance. Vasilije dives into his creation, Cognee, which manages semantic memory, emphasizing its potential applications and the blend of cognitive science with data engineering. He shares insights from building an AI startup, the importance of user feedback, and future developments in open-source AI technology.
undefined
19 snips
Nov 22, 2024 • 53min

The Impact of Generative AI on Software Development

Tanner Burson, VP of Engineering at Prismatic, dives into the transformative effects of generative AI on software development. He discusses how AI is reshaping developer roles and productivity, fueled by tools like GitHub's Copilot. Tanner outlines both the opportunities and challenges AI presents, emphasizing the crucial need for human oversight to ensure code quality. He also explores the microunits of AI integration in workflows, the growing importance of mentorship, and the balance between innovation and practical engineering skills in an AI-driven future.
undefined
9 snips
Nov 11, 2024 • 1h 16min

ML Infrastructure Without The Ops: Simplifying The ML Developer Experience With Runhouse

Donnie Greenberg, Co-founder and CEO of Runhouse and former product lead for PyTorch at Meta, shares insights on simplifying machine learning infrastructure. He discusses the challenges of traditional MLOps tools and presents Runhouse's serverless approach that reduces complexity in moving from development to production. Greenberg emphasizes the importance of flexible, collaborative environments and innovative fault tolerance in ML workflows. He also touches on the need for integration with existing DevOps practices to meet the evolving demands of AI and ML.
undefined
11 snips
Nov 11, 2024 • 54min

Building AI Systems on Postgres: An Inside Look at pgai Vectorizer

Avthar Sewrathan, Head of AI at Timescale and expert in database infrastructure, shares insights into the innovative pgai Vectorizer toolchain. He reveals how this tool enables seamless management of AI workflows in Postgres, emphasizing the importance of keeping vector data updated. The discussion covers optimizing embedding strategies, the balance between user-friendliness and customization for developers, and the future of AI integration within databases. Avthar also touches on challenges in content moderation and semantic search, highlighting the need for continuous improvement and collaboration in the open-source community.
undefined
31 snips
Oct 28, 2024 • 58min

Running Generative AI Models In Production

Philip Kiely, an AI infrastructure expert at BaseTen, dives into the complexities of running generative AI models in production. He shares insights on the importance of selecting the right model based on product requirements and discusses key deployment strategies, including architecture and performance monitoring. Challenges like model quantization and the balance between open-source and proprietary models are explored. Philip also highlights future trends such as local inference, emphasizing the need for compliance in sectors like healthcare.
undefined
95 snips
Sep 10, 2024 • 59min

Enhancing AI Retrieval with Knowledge Graphs: A Deep Dive into GraphRAG

Philip Rathle, CTO of Neo4J and an expert in knowledge graphs, dives deep into how GraphRAG revolutionizes AI retrieval systems. He explains how this innovative method blends knowledge graphs with vector similarity for clearer, more accurate AI outputs. Rathle discusses the technical aspects of data modeling and the importance of structured data in addressing traditional retrieval challenges. The conversation also touches on real-world applications of GraphRAG across various industries, highlighting its potential to transform AI interactions.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app