
LLMs on CPUs, Period
The Data Exchange with Ben Lorica
00:00
CPU Capability, AMD GPUs, and Multimodal Applications
The chapter explores the abundance of CPU capability in the cloud and the increasing viability of AMD GPUs for inference, as well as the availability of CPUs for inference and the potential interest in multimodal applications. It also discusses enterprises' focus on LLMs and the need to adapt runtime to accommodate changes.
Transcript
Play full episode