AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Creating a Self-Service Batch Job Execution Service with Flotilla API
The chapter explores the open sourcing of Flotilla, an API designed for managing batch-oriented tasks in data science departments. Citrix's utilization of Spark and Docker on ECS for batch execution, performing data wrangling in Spark and model training in Docker containers is highlighted, showcasing how Flotilla abstracts over ECS to streamline job execution processes.