AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Exploring Trust, Performance, and Price in AI Evaluations
Exploring the interplay between trust, performance, and pricing in AI evaluations, discussing how trustable organizations differ from technically proficient ones, analyzing evaluation tools like LM's YS chatbot arena and Alpaca-Val, and highlighting the increasing costs and challenges faced by industry actors.