The New Stack Podcast cover image

Do All Your AI Workloads Actually Require Expensive GPUs?

The New Stack Podcast

00:00

Axion applicability at the edge

Pranay and Andrei discuss running ARM-based models at the edge, model quantization, and Android/Pixel local LLM examples.

Play episode from 21:52
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app