AI Engineering Podcast cover image

AI Engineering Podcast

Latest episodes

undefined
Mar 16, 2025 • 56min

The Power of Community in AI Development with Oumi

Emmanouil (Manos) Koukoumidis, CEO of Oumi and former Google Cloud AI tech lead, talks about fostering community in AI development. He stresses the need for open-source models to promote collaboration and accessibility, likening Oumi's vision to 'the Linux of AI.' Manos shares insights on navigating the overwhelming choices in AI models and the importance of engaging a community for innovation. He also addresses gaps in AI accessibility and the need for standardization to empower both researchers and enterprises in their AI journeys.
undefined
Feb 26, 2025 • 31min

Arch Gateway: Add AI To Your Apps Without Custom Development

In this engaging discussion, Adil Hafiz, co-founder of Ardenimo and an expert with a rich engineering background at Microsoft and Amazon, sheds light on the Arch Gateway. This innovative tool simplifies AI integration for developers, allowing them to focus on core functions while bypassing complex AI specifics. He highlights the project's use of Rust and Envoy to enhance performance, discusses community feedback's crucial role, and outlines future aspirations for developing a leading planning model and improving AI agent interoperability.
undefined
Feb 16, 2025 • 54min

The Role Of Synthetic Data In Building Better AI Applications

Ali Golshan, Co-founder and CEO of Gretel.ai, dives into the fascinating world of synthetic data and its pivotal role in advancing AI applications. He discusses how synthetic data can enhance privacy while improving the quality and structural stability of datasets. The conversation highlights the shift from traditional data methods to the use of language models and the challenges of scaling synthetic data in production. Ali also explores its transformative applications in sectors like healthcare and finance, underscoring the importance of governance and ethical considerations.
undefined
Jan 22, 2025 • 1h 3min

Optimize Your AI Applications Automatically With The TensorZero LLM Gateway

Viraj Mehta, CTO and co-founder of TensorZero, shares insights on optimizing AI applications with their innovative LLM gateways. He discusses how these gateways standardize communication and manage interactions between applications and AI models. The conversation dives into sustainable AI optimization and the challenges of integrating structured data inputs. Viraj also highlights the role of user feedback in enhancing AI interactions, as well as the architectural innovations that improve efficiency and usability for developers.
undefined
9 snips
Dec 16, 2024 • 55min

Harnessing The Engine Of AI

Ron Green, co-founder and CTO of Kung Fu AI, dives into the evolving AI landscape and the complexities of generative AI engines. He discusses the limitations of large language models and the critical need for human oversight and robust data management. Ron highlights innovative methods like Retrieval-Augmented Generation and the significance of targeted, domain-specific AI solutions. He expresses optimism for AI's future, predicting major advancements in the next 20 years that integrate seamlessly into everyday applications.
undefined
Dec 1, 2024 • 54min

The Complex World of Generative AI Governance

Jim Olson, CTO of ModelOp, specializes in generative AI governance and regulations. He discusses the importance of monitoring and inventory for compliance in high-risk areas like healthcare. Olson emphasizes the need for technical controls to manage data governance and the continuous monitoring of AI models to detect issues. He addresses the balance between innovation and regulation, particularly in light of evolving EU regulations, and highlights the necessity of building trust through effective governance solutions.
undefined
8 snips
Nov 25, 2024 • 55min

Building Semantic Memory for AI With Cognee

Vasilije Markovich, a data engineer and AI specialist from Montenegro, discusses enhancing large language models with memory. He highlights the challenges of context window limitations and forgetting in LLMs, introducing hierarchical memory to improve performance. Vasilije dives into his creation, Cognee, which manages semantic memory, emphasizing its potential applications and the blend of cognitive science with data engineering. He shares insights from building an AI startup, the importance of user feedback, and future developments in open-source AI technology.
undefined
17 snips
Nov 22, 2024 • 53min

The Impact of Generative AI on Software Development

Tanner Burson, VP of Engineering at Prismatic, dives into the transformative effects of generative AI on software development. He discusses how AI is reshaping developer roles and productivity, fueled by tools like GitHub's Copilot. Tanner outlines both the opportunities and challenges AI presents, emphasizing the crucial need for human oversight to ensure code quality. He also explores the microunits of AI integration in workflows, the growing importance of mentorship, and the balance between innovation and practical engineering skills in an AI-driven future.
undefined
9 snips
Nov 11, 2024 • 1h 16min

ML Infrastructure Without The Ops: Simplifying The ML Developer Experience With Runhouse

Donnie Greenberg, Co-founder and CEO of Runhouse and former product lead for PyTorch at Meta, shares insights on simplifying machine learning infrastructure. He discusses the challenges of traditional MLOps tools and presents Runhouse's serverless approach that reduces complexity in moving from development to production. Greenberg emphasizes the importance of flexible, collaborative environments and innovative fault tolerance in ML workflows. He also touches on the need for integration with existing DevOps practices to meet the evolving demands of AI and ML.
undefined
11 snips
Nov 11, 2024 • 54min

Building AI Systems on Postgres: An Inside Look at pgai Vectorizer

Avthar Sewrathan, Head of AI at Timescale and expert in database infrastructure, shares insights into the innovative pgai Vectorizer toolchain. He reveals how this tool enables seamless management of AI workflows in Postgres, emphasizing the importance of keeping vector data updated. The discussion covers optimizing embedding strategies, the balance between user-friendliness and customization for developers, and the future of AI integration within databases. Avthar also touches on challenges in content moderation and semantic search, highlighting the need for continuous improvement and collaboration in the open-source community.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode