AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Limits of Turbo GPD 3.5
It's important to notice the evolution of these benchmarks. I think we're getting better, but still observe their limitations when GPD-4 came out. It was tested on various AP exams and a lot of different licensing tests. There's some correlation there and there are some important correlations,. But certainly they are not as strong or reliable indicators as we might think.