The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Exploring Causal Reasoning through Task-Specific Reinforcement Learning

This chapter explores the use of Reinforcement Learning from Human Feedback to enhance models’ understanding of causal reasoning. It critiques standard RLHF methods and advocates for more tailored approaches that align with specific task requirements over mere imitation of human responses.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app