
AI Agents and Long Context Windows with Mark Huang
Software Huddle
Extending Context Window in AI Models
The chapter delves into the process of extending context window in AI models using a curriculum learning approach, exploring theta scaling for positional interpolation. It discusses the development of high-parameter models to achieve context windows beyond 8,000 tokens and the challenges associated with increasing model sizes.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.