This chapter delves into the capabilities and limitations of language models, focusing on how they can generate detailed plans for high-level tasks like traveling but struggle with low-level physical actions. The conversation highlights the need for joint embedding spaces in robotics for interacting with the physical world and emphasizes the importance of learned plans in human actions.