
Episode 01: Kelvin Guu, Google AI, on language models & overlooked research problems
Generally Intelligent
00:00
Building Models That Don't Work?
Theoristic is people seem to be always trying to build models that have a shorter path between the input and the output. There are papers on this, but e personally, i still don't have that intuition. The hetaristics that i have are maybe more of a procedure for building models than a static thing. So it's a process where you try to start with a large model, make sure you can overfit. You're always checking for sperious correlations early on when you're setting up the data.
Transcript
Play full episode