AWS Podcast cover image

#631: Amazon EC2 P5 instances, powered by the latest NVIDIA H100 Tensor Core GPUs

AWS Podcast

00:00

Engineering and Infrastructure of P5 Instances

This chapter discusses the engineering and infrastructure behind the P5 instances, powered by NVIDIA H100 Tensor Core GPUs, offered by Amazon EC2. They explain that the GPUs require careful engineering to ensure they work at scale and are consistent. P5 instances are specifically designed for training large language models or generative AI models, offering up to six times better performance and 40% cost savings compared to previous generations.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app