

Asimov: Building An Omniscient RL Oracle with ReflectionAI’s Misha Laskin
117 snips Jul 17, 2025
Misha Laskin, Co-founder and CEO of ReflectionAI, is a former RL lead for Gemini and a passionate advocate for user-focused AI. He discusses their latest project, Asimov, a code comprehension tool designed around customer needs. Misha dives into the challenges of scaling reinforcement learning and the significance of collaborative memory in product development. He also shares insights on the future of artificial superintelligence and reflects on the implications of Google’s decision regarding Windsurf.
AI Snips
Chapters
Transcript
Episode notes
Focus Product and Eval on Users
- Win by tightly coupling product needs and model research in a focused domain rather than spreading efforts thin.
- Develop evaluations grounded in real customer problems to optimize specific capabilities that drive user value.
Integrate All System Components
- Couple product features, agent designs, and model training to create the best overall system.
- Avoid dependence on third party models by controlling multiple system components for competitive edge.
Complex Bugs Reveal Agent Strengths
- When some internal jobs ran five times slower, it was due to a subtle race condition between two teams' pull requests.
- Engineers often spend days diagnosing these complex, semantic issues across dispersed knowledge sources.