AWS Podcast cover image

#631: Amazon EC2 P5 instances, powered by the latest NVIDIA H100 Tensor Core GPUs

AWS Podcast

CHAPTER

Engineering and Infrastructure of P5 Instances

This chapter discusses the engineering and infrastructure behind the P5 instances, powered by NVIDIA H100 Tensor Core GPUs, offered by Amazon EC2. They explain that the GPUs require careful engineering to ensure they work at scale and are consistent. P5 instances are specifically designed for training large language models or generative AI models, offering up to six times better performance and 40% cost savings compared to previous generations.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner