Papers Read on AI

ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code

Jul 8, 2024
Ask episode
Chapters
Transcript
Episode notes