AI Safety Newsletter cover image

AISN #48: Utility Engineering and EnigmaEval

AI Safety Newsletter

00:00

Exploring Utility Engineering and New Benchmarks in AI

This chapter explores the emerging field of utility engineering and introduces the Enigma Evil benchmark, which assesses the creative problem-solving capabilities of AI. It highlights recent findings that suggest large language models have structured preferences, challenging the traditional view of them as merely passive tools.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app