MLOps.community  cover image

On-Device AI Agents in Production: Privacy, Performance, and Scale // Varun Khare & Neeraj Poddar // #340

MLOps.community

00:00

What can current on-device models do and how are they optimized?

Varun details feasible on-device capabilities and software optimizations like sparsity and faster runtimes.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app