AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Exploring Outer and Inner Misalignment in AI Systems
Exploring the impact of outer and inner misalignment on AI systems through examples like training language models for truthful answers and aligning AI goals with specified objectives to avoid scenarios of misalignment in navigation and safety prioritization.