AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Discussion on Stochastic Gradient Descent, Putting Foundation Models into Production, and Infrastructure Serving
This chapter discusses the concept of stochastic gradient descent in machine learning and the challenges associated with putting foundation models into production, including context limits, privacy, and rate limits. It also emphasizes the importance of understanding GPU infrastructure, model swap utilization, and the ability to batch for efficient model serving.