
OpenAI, AGI, LLMs Eval & Applied ML with Reah Miyara #47
AI Stories
Transitioning to Model Evaluation at OpenAI
This chapter explores the speaker's transition from a leadership role in machine learning observability to becoming part of OpenAI's LLM evaluation team. It discusses the complexities of evaluating large language models, the evolving benchmarks for performance, and the significance of actionable insights derived from evaluation metrics. The chapter also reflects on the future of Generative AI and the potential impact of Artificial General Intelligence in various fields.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.