BlueDot Narrated

Constitutional AI Harmlessness from AI Feedback

Jan 4, 2025

01:01:49

forum

Ask episode

view_agenda

Chapters

auto_awesome

Transcript

info_circle

Episode notes

Audio versions of blogs and papers from BlueDot courses.

This paper explains Anthropic’s constitutional AI approach, which is largely an extension on RLHF but with AIs replacing human demonstrators and human evaluators.

A podcast by BlueDot Impact.

Home Top podcasts Popular guests Top books