
Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Exploring Causal Reasoning through Task-Specific Reinforcement Learning
This chapter explores the use of Reinforcement Learning from Human Feedback to enhance models’ understanding of causal reasoning. It critiques standard RLHF methods and advocates for more tailored approaches that align with specific task requirements over mere imitation of human responses.
Transcript
Play full episode