Interconnects cover image

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between

Interconnects

00:00

Evolving AI Evaluation Methods and User Interaction Insights

This chapter delves into the development of AI model evaluation techniques, focusing on the introduction of MTBench and its significance. It highlights the contrast between technical assessments and real-world applications, particularly in chatbot interactions within specific industries such as airline ticket sales.

Play episode from 36:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app