Latent Space: The AI Engineer Podcast cover image

MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML

Latent Space: The AI Engineer Podcast

00:00

Evaluating AI Models with Vibes Check and Human Eval Metrics

Using motion I train can help make big runs more predictable and give confidence in quality outcome/nVibes based eval is important and can't be underrated in monitoring and improving model training/nHuman eval is an automated evaluation metric and there are no humans involved despite the name/nOther metrics have confusing and strange names like 'hella swag'

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app