
AI Agents and Long Context Windows with Mark Huang
Software Huddle
00:00
Extending Context Window in AI Models
The chapter delves into the process of extending context window in AI models using a curriculum learning approach, exploring theta scaling for positional interpolation. It discusses the development of high-parameter models to achieve context windows beyond 8,000 tokens and the challenges associated with increasing model sizes.
Transcript
Play full episode