Machine Learning Street Talk (MLST) cover image

Test-Time Adaptation: the key to reasoning with DL (Mohamed Osman)

Machine Learning Street Talk (MLST)

CHAPTER

Advancements in AI Benchmarking

This chapter focuses on the research and development at a newly formed AI lab, emphasizing advancements in the ARC benchmark and the challenges of transitioning to ARC v2. The discussion covers innovative approaches to enhancing compositionality in language models and the significance of maintaining legacy benchmarks for evaluating intelligence.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner