
SE Radio 661: Sunil Mallya on Small Language Models
Software Engineering Radio - the podcast for professional software developers
00:00
Navigating Language Model Deployment
This chapter explores the considerations enterprises must weigh when choosing between small and large language models, particularly in managing unique enterprise data and ensuring optimal performance through effective data curation. It discusses the balance between deployment accuracy and cost, alongside the challenges of on-premises versus SaaS solutions, emphasizing compliance and data governance. The chapter also highlights strategies for improving model performance, such as fine-tuning and retrieval augmented generation, to address evolving user needs.
Transcript
Play full episode