

PurePerformance
PurePerformance
The brutal truth about digital performance engineering and operations.Andreas (aka Andi) Grabner and Brian Wilson are veterans of the digital performance world. Combined they have seen too many applications not scaling and performing up to expectations. With more rapid deployment models made possible through continuous delivery and a mentality shift sparked by DevOps they feel it’s time to share their stories. In each episode, they and their guests discuss different topics concerning performance, ranging from common performance problems for specific technology platforms to best practices in development, testing, deploying and monitoring software performance and user experience. Be prepared to learn a lot about metrics.Andi & Brian both work at Dynatrace, where they get to witness more real world customer performance issues than they can TPS report at.
Episodes
Mentioned books

Mar 27, 2023 • 54min
“You Build It, You Run It Doesn’t Scale!” with Luca Galante
The famous tagline from Werner Vogel in 2006 is still used in many presentations promoting DevOps and the autonomy of development teams. But how long does and did this really scale?Based on our guest Luca Galante, Head of Product at Humanitec, organizations that reach 50-100 engineers start experiencing the first bottlenecks. After initial workarounds sometimes leading to Shadow Ops it’s the time where organizations look into building Internal Development Platforms (IDP). This is where Platform Engineering is born by providing “Golden Paths around DevOps & SRE” as a self-service to engineering teams.Tune in an learn more about the emerging practice of platform engineering, why it already attracted more than 11000 global community members, has an annual dedicated conference and why global analysts are putting Platform Engineering in the Top Trends of 2023! We referenced a lot of material in our discussion. Here all the promised links:What is Platform Engineering: https://platformengineering.org/blog/what-is-platform-engineeringPlatform Engineering Community: https://platformengineering.orgPlatformCon: https://platformcon.com/Platform Weekly: https://platformweekly.com/Follow Luca on Twitter: https://twitter.com/luca_cloudConnect with Luca on LinkedIn: https://www.linkedin.com/in/luca-galante/

Mar 13, 2023 • 54min
Don’t look away from the next cyber security threat with Stefan Achleitner
While Spring4Shell, Ransomware and attacks on critical infrastructure were the most severe attacks in 2022 the evolving trends in 2023 are around the rising power of AIs, complexity and therefore misconfiguration of cloud native stacks as well as social engineering challenges as part of the post-pandemic shift back towards the office.Tune in and learn from Stefan Achleitner, Lead Researcher Cloud Native Security at Dynatrace, about getting better in securing software supply chain, understanding the impact of attacks and vulnerabilities and why nobody should look away when it comes to detecting and preventing cyber security threats

Feb 27, 2023 • 53min
Is The Practice of Practice the better Gameday with Matt Davis
How do you prepare yourself for the next incident? Not at all? Are you running game days where you simulate incidents? Or are you following the steps of good musicians who are constantly practicing with their band members to always be best prepared for the next big gig!Tune in and hear from Matt Davis, Specialist in Learning from Incidents, how he runs weekly continuous practice and learning sessions with DevOps, SREs, Developers, Marketers or Technical Writers and what the outcomes are.Matt is a regular presenter at conferences. You can meet him at SRECon Americas 2023 where he talks about “Human Observability of Incident Response” Here the other links we discussed during the podcast:Practice of PracticeRivers of OppositesVarieties of WorkFollow Matt on TwitterConnect on LinkedIn

Feb 13, 2023 • 49min
OpenTelemetry for the Mainframe and more with Christian Schram
Did you know that almost 60 years after IBM presented the mainframe 92 of the worlds top 100 banks run mainframes handling 90% of all credit card transactions? We didn’t either until we recorded this episode with Christian Schram, Solutions Engineer at Dynatrace, who has spent the last 20+ years helping organizations optimizing their mainframe environments. Tune in and learn about the mainframe, how the cloud native project OpenTelemetry has made it to the mainframe and what the most common performance patterns are on the mainframe.As discussed check out the following links in case you want to learn more:A Brief History of the Mainframe World (Blog)Modernizing the Mainframe (YouTube)Eliminating inefficiencies on IBM Z (Blog)End-2-End IBM Z transactional visibility (Blog)

6 snips
Jan 30, 2023 • 49min
How not to get Kubernetes cluster hijacked with Nico Meisenzahl
Do you know that 53% of security related issues on Kubernetes are caused by misconfiguration? Me neither!To raise the awareness of how to protect your Kubernetes cluster and workloads from being hijacked we invited Nico Meisenzahl, Microsoft MVP and GitLab Hero, to walk us through a set of best practices that everyone in cloud native should know to contribute to a more secure cloud native environment. In our conversation we cover a lot of what Nico has shown in his recent talks at different container, cloud native and security related conferences.Make sure you check out the slides, github tutorials and recordings from Nico through those links:Nico’s Website: https://meisenzahl.org/Hijack a Kubernetes Cluster YouTube: https://www.youtube.com/watch?v=9wc34MozKokHijack a Kubernetes Cluster Slides: https://www.slideshare.net/nmeisenzahl/containerconf-2022-hijack-kubernetesHijack a Kubernetes Cluster GitHub Tutorial: https://github.com/nmeisenzahl/hijack-kubernetesConnect with him on LinkedIn: https://www.linkedin.com/in/nicomeisenzahl/Follow him on Twitter: https://twitter.com/nmeisenzahl If you want to hear more from Nico listen until the end and pick from one of the suggested topics

Jan 16, 2023 • 50min
Learning from Incidents is what good SREs do with Laura Nolan
Incidents happen! And when asking Laura Nolan who was an SRE at Google and Slack, healthy organizations should take proper time to analyze and learn from them. This will improve future incident response as well as overall system resiliency.Tune in to this episode and hear Laura’s tips & tricks what makes a good SRE organization. It starts with doing good write ups of incidents, doing your research on incident reports of software and services that you are looking into using. We also spent a good amount of time discussing root cause analysis where she highlighted an incident that happened at her time at Google and what she learned about outdated alerting.Thanks Laura for a great discussion and lots of insights.Here are the additional links we discussed during the podcastLaura on LinkedIn: https://www.linkedin.com/in/laura-nolan-bb7429/Laura on Twitter:https://twitter.com/lauraliftsIncident Template talk @ SRECon: https://www.usenix.org/conference/srecon22emea/presentation/nolan-breakWhat SRE could be talk @ SRECon: https://www.usenix.org/conference/srecon22emea/presentation/nolan-sreHowie Post-Incident Guide: https://www.jeli.io/howie/welcomeMy philosophy on Alerting article: https://docs.google.com/document/d/199PqyG3UsyXlwieHaqbGiWVa8eMWi8zzAn0YfcApr8Q/edit

Jan 1, 2023 • 42min
What happened in 2022 and where 2023 is taking us!
What a year 2022 was! We had 25! episodes with amazing guests from all over the world covering topics from Kubernetes, OpenTelemetry, DevOps, SRE, Cloud Migrations, DNS, Value Streams all the way to Persona Driven Engineering and drawing parallels with Digital Marketing. If you are new to our podcast check out the playlist and listen to some of those we mentioned during our episode!Now its time to say Thank You listeners for the continued support. After 5+ years of podcasting we still see rising numbers of downloads which is the best motivation for us to keep going. Stay tuned as we are going to cover industry relevant topics going into 2023 – or is it year 53? (only those will know that listen to the full episode)

Dec 19, 2022 • 53min
Building the right thing: Learning from digital marketing expert Bernhard Dominguez
“If I wouldn’t measure it I wouldn’t know it!” or “Build, Measure, Learn! ”These quotes could be from any engineer building new digital services, observing them in production and based on that learn how to improve their software.They are however from Bernhard Dominguez, Digital Consultant at FACTOR, who we invited to the show. Bernhard highlights a lot of parallels between his work planning and executing digital marketing strategies and the world we live in: designing, operating and optimizing complex software systems.Tune in and learn about how important it is to understand your real target groups (=end users), how to define clear goals (=SLOs), how to change from campaign to funnel activities (=User Journeys) and why it is so important to get an outsider’s opinion before implementing your next big project! (=We have always done it this way) If you want to follow up with Bernhard and his work check out the following links we discussed during the podcast:Bernhard on LinkedInFACTORPodcast (German): Newsletter MarketingPodcast (German): Build - Measure - Learn

Dec 5, 2022 • 53min
SRE for the non-unicorns (aka Enterprises) with James Brookbank
You have a CISO (Chief Security Information Officer) but no CRO (Chief Reliability Officer)? You blame people if systems crash? You scale your people in the rate of scaling your infrastructure? If you answer any of those questions with YES then you should tune into this podcast as you probably struggle adopting Site Reliability Engineering (SRE) in your organization.James Brookbank, Cloud Solutions Architect, has dealt with resiliency topics in a large enterprise prior to joining Google. In our conversation he shares advice he gives Enterprises to convert the excitement about SRE into actual implementation. James gave some good guidance on what good and not so good projects are to start with. He gives practical examples on what it means to change your company culture and why there doesn’t have to be an SRE for every service.In our call we discussed the SRE in Enterprise talk at DevOpsDays Boston and SRECon EMEA as well as their recent book. Here are all the relevant links:James Brookbank on Linkedin:https://www.linkedin.com/in/jamesbrookbank/SRECon EMEA Slides: https://www.usenix.org/system/files/srecon22_slides_mcghee.pdfDevOpsDays Boston 2022 Session Recording: https://www.youtube.com/watch?v=__e7b25QOHcEnterprise Roadmap to SRE Book: https://sre.google/resources/practices-and-processes/enterprise-roadmap-to-sre/

Nov 21, 2022 • 43min
What is Dynatrace Grail and Why should you care with Andreas Lehofer
Dynatrace recently announced Grail – promising boundless observability, security and business analytics in context.You may think: that’s a lot of nice words that other solutions claim as well. So why should you care about Grail? What is the real problem it solves and how does it solve it?Tune in and hear from Andreas Lehofer, Chief Product Officer at Dynatrace as he boils it down to two critical issues:* Cost vs Value of your data: Current approaches are expensive as you keep 95% of your data not knowing whether you ever need it!* Functional Limits with having siloed observability data: When you need answers the current siloed approach is slow and limited!Thanks Andreas for the discussion, the insights on the hidden costs of current approaches, the technical explanation on our architecture as well as giving us some glimpse on what’s coming next.Show Links:Dynatrace Grail Announcement:https://www.dynatrace.com/platform/grail/Andreas Lehofer on Linkedin:https://www.linkedin.com/in/andreaslehofer/


