Join Noam Brown, an OpenAI expert in deep reinforcement learning known for his poker-playing AI, and Hunter Lightman, a developer of O1, as they dive into the groundbreaking O1 model. They discuss the blend of LLMs and reinforcement learning, revealing how O1 excels at math and coding challenges. Discover insights on problem-solving methods, iterative reasoning, and the surprising journey from doubt to confidence in AI. With exciting applications like the International Olympiad in Informatics and beyond, the future of reasoning in AI seems bright!