MLOps.community  cover image

Large Language Models in Production Round-table Conversation

MLOps.community

00:00

The Importance of Engineering Skills in MLP

Hard engineering skills are increasingly relevant to MLP and just to the productionization of AI going forward so yeah I just really want to drive that point how as well we need good engineers in this space on the inference side. Some of the inference being slow is indeed Diego it's the transformer's architecture, he says. So if you compare transformers to models for like recurrent neural networks like RNNs transformers are not naturally sequential so RNN decoding is going to be a lot faster than transformers which have multi-headed attention matrix computations.

Play episode from 35:57
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app