AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Performance of Large Language Models for Casual Inference
The models are getting good accuracy on at least for the pairwise and full-grads discovery, those types of benchmarks. We see these kind of failures a lot because they can't do D and E. The next step I think in your kind of analysis is, you know, so therefore these models can be used in these ways by practitioners as effective tools.