The InfoQ Podcast cover image

Sam Partee on Retrieval Augmented Generation (RAG)

The InfoQ Podcast

00:00

Challenges and Recommendations for Running Large Language Models On-Prem

This chapter explores the difficulties of dealing with phone companies and the benefits of running large language models on-prem using tools like NVIDIA Triton. The episode also mentions various companies and individuals who have contributed to training models and providing cloud services.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app