Ship It! Cloud, SRE, Platform Engineering cover image

Ship It! Cloud, SRE, Platform Engineering

Latest episodes

undefined
7 snips
Jan 11, 2023 • 1h 17min

The hard parts of platform engineering

Marcos Nils has been into platform engineering for the best part of the last decade. He helped architect & build developer platforms using VMs & OpenStack, containers with Docker, and even Kubernetes. He did this at startups with 10 people, as well as large, publicly traded companies with 1000+ software engineers. Today we talk with Marcos about the hard parts of platform engineering. Join the discussionChangelog++ members get a bonus 15 minutes at the end of this episode and zero ads. Join today!Sponsors:Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs. Featuring:Marcos Nils – GitHub, LinkedIn, Mastodon, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: Something missing or broken? PRs welcome!
undefined
9 snips
Jan 5, 2023 • 1h 4min

Bare metal meets Talos Linux (the K8s OS)

Welcome to 2023! A new year is the perfect time to start with a fresh perspective. Given a few bare metal hosts with fast, local storage, how would you run your workloads on them? Would you cluster them for redundancy? What operating system would you choose? Steve Francis, CEO at Sidero Labs and Andrew Rynhard, CTO at Sidero Labs join us today to talk about running Talos Linux on bare metal. Join the discussionChangelog++ members save 2 minutes on this episode because they made the ads disappear. Join today!Sponsors:Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs. Practical AI – Making artificial intelligence practical, productive & accessible to everyone! Featuring:Andrew Rynhard – GitHub, LinkedIn, XSteve Francis – GitHub, LinkedInGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: Talos Linux 🎬 Hack Sesh: K8s@Home Edition - April, 2022 🎬 DevOps Toolkit - Talos: Linux Designed For Kubernetes - March, 2022 🎬 Sidero Metal Webinar - January, 2022 🎬 I am Installing Talos OS (Sidero) at Home - October, 2021 Something missing or broken? PRs welcome!
undefined
Dec 15, 2022 • 1h 5min

🎄 Planning for failure to ship faster 🎁

Eight months ago, in 🎧 episode 49, Alex Sims (Solutions Architect & Senior Software Engineer at James & James) shared with us his ambition to help migrate a monolithic PHP app running on AWS EC2 to a more modern architecture. The idea was some serverless, some EKS, and many incremental improvements. So how did all of this work out in practice? How did the improved system cope with the Black Friday peak, as well as all the following Christmas orders? Thank you Alex for sharing with us your Ship It! inspired Kaizen story. It’s a wonderful Christmas present! 🎄🎁 Join the discussionChangelog++ members save 4 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights Raygun – Never miss another mission-critical issue again — Raygun Alerting is now available for Crash Reporting and Real User Monitoring, to make sure you are quickly notified of the errors, crashes, and front-end performance issues that matter most to you and your business. Set thresholds for your alert based on an increase in error count, a spike in load time, or new issues introduced in the latest deployment. Start your free 14-day trial at Raygun.com Featuring:Alex Sims – LinkedIn, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: Something missing or broken? PRs welcome!
undefined
Dec 8, 2022 • 1h 8min

Red Hat's approach to SRE

Narayanan Raghavan leads the global SRE organization that runs Red Hat managed cloud services including OpenShift Dedicated, Azure Red Hat Openshift, Red Hat OpenShift Service on AWS, and Red Hat OpenShift Data Science among others across the three major cloud providers: AWS, GCP & Azure. We start with a high-level discussion about DevOps, SRE & platform engineering, and then we dig into SRE specifics, including what it takes to safely roll out updates across many tens of thousands of OpenShift clusters. Join the discussionChangelog++ members save 6 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights Raygun – Never miss another mission-critical issue again — Raygun Alerting is now available for Crash Reporting and Real User Monitoring, to make sure you are quickly notified of the errors, crashes, and front-end performance issues that matter most to you and your business. Set thresholds for your alert based on an increase in error count, a spike in load time, or new issues introduced in the latest deployment. Start your free 14-day trial at Raygun.com Retool – The low-code platform for developers to build internal tools — Some of the best teams out there trust Retool…Brex, Coinbase, Plaid, Doordash, LegalGenius, Amazon, Allbirds, Peloton, and so many more – the developers at these teams trust Retool as the platform to build their internal tools. Try it free at retool.com/changelog Featuring:Narayanan Raghavan – GitHub, LinkedIn, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: 💡 Red Hat’s approach to site reliability engineering (SRE) 🎬 Kubernetes vs. OpenShift 🎬 Hands-on demo of Red Hat OpenShift Dedicated Something missing or broken? PRs welcome!
undefined
Dec 1, 2022 • 1h 7min

Let's deploy straight to production!

In today’s episode, we have the pleasure of two guests: Whitney Lee, Staff Technical Advocate at VMware, the one behind the ⚡️ Enlightning episodes, and Mauricio Salatino, which you already know from 🎧 shipit.show/41 on Continuous Delivery for Kubernetes. The two of them gave the most amazing KubeCon NA Keynote last month: What a RUSH! Let’s Deploy Straight to Production! So how do we create an Internal Development Platform that enables anyone on the team to deploy straight to production with the confidence that everything will just work? Join the discussionChangelog++ members save 6 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights FireHydrant – The reliability platform for every developer. Incidents impact everyone, not just SREs. FireHydrant gives teams the tools to maintain service catalogs, respond to incidents, communicate through status pages, and learn with retrospectives. Small teams up to 10 people can get started for free with all FireHydrant features included. No credit card required to sign up. Learn more at firehydrant.com/ Sentry – Working code means happy customers. That’s exactly why teams choose Sentry. From error tracking to performance monitoring, Sentry helps teams see what actually matters, resolve problems quicker, and learn continuously about their applications - from the frontend to the backend. Use the code CHANGELOG and get the team plan free for three months. Featuring:Mauricio Salatino – Website, GitHub, LinkedIn, XWhitney Lee – GitHub, LinkedIn, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: 🎬 What a RUSH! Let’s Deploy Straight to Production! + 🗺 SLIDES + 🐙 CODE - KubeCon NA 2022 Keynote ⚡️ Enlightning - learn cloud concepts with Whitney & her lightboard - every Tuesday at 11am ET 📃 Building platforms on top of Kubernetes: vcluster and Crossplane - Mauricio Salatino - August 2022 📖 Continuous Delivery for Kubernetes Something missing or broken? PRs welcome!
undefined
Nov 23, 2022 • 1h 9min

Kaizen! 24 improvements & a lot more

For our last 2022 Kaizen episode, we went all out: 💪 @jerod outdid himself in the number of improvements shipped between Kaizens 🕺 A few of our listeners contributed → prompted us to create a new contributing guide 🗺 We now have a new infrastructure diagram All of this, and a whole lot more, is captured as GitHub discussion 🐙 changelog.com#433. If you want to see everything that we improved, that is a great companion to this episode. Join the discussionChangelog++ members save 5 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sentry – Working code means happy customers. That’s exactly why teams choose Sentry. From error tracking to performance monitoring, Sentry helps teams see what actually matters, resolve problems quicker, and learn continuously about their applications - from the frontend to the backend. Use the code CHANGELOG and get the team plan free for three months. FireHydrant – The reliability platform for every developer. Incidents impact everyone, not just SREs. FireHydrant gives teams the tools to maintain service catalogs, respond to incidents, communicate through status pages, and learn with retrospectives. Small teams up to 10 people can get started for free with all FireHydrant features included. No credit card required to sign up. Learn more at firehydrant.com/ Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights Featuring:Adam Stacoviak – Website, GitHub, LinkedIn, Mastodon, XJerod Santo – GitHub, LinkedIn, Mastodon, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes:All episode notes are in GitHub discussion changelog.com#433. Feel free to add your thoughts / questions! Something missing or broken? PRs welcome!
undefined
Nov 18, 2022 • 1h 5min

Developer Experience Infrastructure (DXI)

In your company, who designs the end-to-end developer experience? From design to implementation, what is the developer experience that you actually ship? Even though the average developer wastes almost half of their working hours because of bad DX, many of us don’t even know what that means, or how to improve it. Kenneth Auchenberg is working at Stripe, building economic infrastructure for the internet. Gerhard found his perspective on Developer Experience Infrastructure (DXI) refreshingly simple, as well as very useful. Join the discussionChangelog++ members save 2 minutes on this episode because they made the ads disappear. Join today!Sponsors:FireHydrant – The reliability platform for every developer. Incidents impact everyone, not just SREs. FireHydrant gives teams the tools to maintain service catalogs, respond to incidents, communicate through status pages, and learn with retrospectives. Small teams up to 10 people can get started for free with all FireHydrant features included. No credit card required to sign up. Learn more at firehydrant.com/ Sentry – Working code means happy customers. That’s exactly why teams choose Sentry. From error tracking to performance monitoring, Sentry helps teams see what actually matters, resolve problems quicker, and learn continuously about their applications - from the frontend to the backend. Use the code CHANGELOG and get the team plan free for three months. Featuring:Kenneth Auchenberg – Website, GitHub, LinkedIn, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: The tweet that started this episode 📝 Developer Experience Infrastructure (DXI) The Radiating Circles of DX Architecture - How to design an end to end developer journey - Shawn “swyx” Wang Developer Experience is the new hot thing, and we now see many existing teams re-labeled as Dev Experience. What does DX really mean, and what is the relationship between developer relations, advocacy, DX, and product teams? @auchenberg Twitter 🧵 Something missing or broken? PRs welcome!
undefined
Nov 9, 2022 • 1h 2min

The system that runs Norway's welfare payments 🇳🇴

In today’s episode we have the pleasure of Audun Fauchald Strand, Principal Software Engineer at NAV.no, Norway’s Labour & Welfare Administration. We will be talking about NAIS.io, the application platform that runs on-prem, as well as on the public cloud. Imagine hundreds of developers shipping on an average day 300 changes into a system which processes $100,000,000 worth of transactions on a quiet week. If you think this is hard, consider the context: a government institution which must comply with all laws & regulations. Join the discussionChangelog++ members save 5 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights Raygun – Never miss another mission-critical issue again — Raygun Alerting is now available for Crash Reporting and Real User Monitoring, to make sure you are quickly notified of the errors, crashes, and front-end performance issues that matter most to you and your business. Set thresholds for your alert based on an increase in error count, a spike in load time, or new issues introduced in the latest deployment. Start your free 14-day trial at Raygun.com Featuring:Audun Fauchald Strand – Website, GitHub, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: NAIS.io - Application platform and DevEx toolbox for teams digitalizing NAV.no 🗂 docs.nais.io - references, step-by-step guides & some good YAML 📊 NAV.no deployment stats, 2009 - 2022 🗺 NAV Teknisk retning (technical direction) Being NAIS at a distance - How we work in a hybrid world Do we need an internal technology platform? - The case for platforms at NAV Changing Service Mesh - How we swapped Istio with Linkerd with hardly any downtime NAIS @ GitHub 📃 How to Optimize for Fast Flow Using Alignment and Autonomy 🐦 Do you know what’s cool? Keeping your #kubernetes clusters secure. NAV.no average weekly service deployments, 2009 - 2022 Something missing or broken? PRs welcome!
undefined
Nov 3, 2022 • 59min

Seven shipping principles

David Heinemeier Hansson, creator of Ruby on Rails, discusses the seven shipping principles in software development. Topics include writing clarity, product refinement vs timely releases, managing email systems in Hey, advancements in computing technology, and the consequences of mistakes.
undefined
9 snips
Oct 27, 2022 • 1h 3min

Container base images with glibc & musl

In today’s episode, we talk about distroless, ko, apko, melange, musl and glibc. The context is Wolfi OS, a community Linux OS designed for the container and cloud-native era. If you are looking for the lightest possible container base image with 0 CVEs and both glibc and musl support, Wolfi OS & the related chainguard-images are worth checking out. Ariadne Conill is an Alpine Linux TSC member & Software Engineer at Chainguard. Join the discussionChangelog++ members save 5 minutes on this episode because they made the ads disappear. Join today!Sponsors:Sentry – Working code means happy customers. That’s exactly why teams choose Sentry. From error tracking to performance monitoring, Sentry helps teams see what actually matters, resolve problems quicker, and learn continuously about their applications - from the frontend to the backend. Use the code CHANGELOG and get the team plan free for three months. FireHydrant – The reliability platform for every developer. Incidents impact everyone, not just SREs. FireHydrant gives teams the tools to maintain service catalogs, respond to incidents, communicate through status pages, and learn with retrospectives. Small teams up to 10 people can get started for free with all FireHydrant features included. No credit card required to sign up. Learn more at firehydrant.com/ Sourcegraph – Transform your code into a queryable database to create customizable visual dashboards in seconds. Sourcegraph recently launched Code Insights — now you can track what really matters to you and your team in your codebase. See how other teams are using this awesome feature at about.sourcegraph.com/code-insights Featuring:Ariadne Conill – Website, GitHub, LinkedIn, XGerhard Lazu – Website, GitHub, LinkedIn, XShow Notes: Ariadne’s Twitter thread that kick-started this episode Wolfi OS - a stripped-down distro designed for the cloud-native era Minimal Container images from Chainguard ko - build and deploy Go applications on Kubernetes apko Overview - build OCI images using APK directly melange - build APKs from source code musl libc glibc - GNU C library “We discovered a bug in our repository management service” Something missing or broken? PRs welcome!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app