MLOps.community

On-Device AI Agents in Production: Privacy, Performance, and Scale // Varun Khare & Neeraj Poddar // #340

10 snips
Sep 30, 2025
Varun Khare, Founder and CEO of NimbleEdge, and Neeraj Poddar, Co-founder and CTO, dive into the revolution of on-device AI agents. They discuss why now is the perfect time for this technology, highlighting the hurdles of overmarketing and platform diversity. The duo also explores practical capabilities of on-device models, optimized through lightweight runtimes, and the evolution of personalized multi-agent systems. They delve into the privacy advantages of on-device computing and the potential of AI-first apps to transform user experiences.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

On-Device AI Is Finally Approaching Reality

  • On-device AI became viable as runtimes and models matured, but AI constantly changes the stack.
  • Varun Khare says the ecosystem has reached a point where production on-device AI is close.
INSIGHT

Device Diversity Is The Core Technical Challenge

  • Device diversity (flagship phones to old Androids) makes on-device deployment hard.
  • Varun Khare explains the need to build the full stack to support that heterogeneity.
ADVICE

Give App Developers Easy AI Tooling

  • Bridge the gap between ML teams (Python) and mobile devs (Kotlin/Swift) with developer-focused tooling.
  • Neeraj Poddar urges creating ecosystems so app developers can integrate AI without rewriting in Python.
Get the Snipd Podcast app to discover more snips from this episode
Get the app