AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Optimize the Gpu for a Megatron Style Workload?
GPS has sprouted custom arithmetic hardware for deep learning. It's very arithmetically dense. What are the features of a future gpu that would be more optimal for a megatron style workload? Well, we're always working on the tenser quarters themselves. So all the cashes and various buffers inside of the chip that are moving data around have to work at peak efficiency in order to keep the tenser corps fed.