5min chapter

Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

The Performance of Pairwise Causal Discovery Benchmarks

In several types of questions from these benchmarks that were posed to the models, they did really well. And so I think that- if you could look at activations and things like that, when these questions are being evaluated, you might get some hint as to how they're grounded in the model or something. Yeah. Using kind of probing procedures if the latent representations in the model align with some kind of causal model or causal abstractions or causal assumptions is very fresh research. But my personal belief is that that's the type of analysis you want to run to understand exactly how it's reasoning if causally, if that's what its doing.

00:00

Transcript

Episode notes

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

5min chapter

Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights