

PurePerformance
PurePerformance
The brutal truth about digital performance engineering and operations.Andreas (aka Andi) Grabner and Brian Wilson are veterans of the digital performance world. Combined they have seen too many applications not scaling and performing up to expectations. With more rapid deployment models made possible through continuous delivery and a mentality shift sparked by DevOps they feel it’s time to share their stories. In each episode, they and their guests discuss different topics concerning performance, ranging from common performance problems for specific technology platforms to best practices in development, testing, deploying and monitoring software performance and user experience. Be prepared to learn a lot about metrics.Andi & Brian both work at Dynatrace, where they get to witness more real world customer performance issues than they can TPS report at.
Episodes
Mentioned books

Jan 16, 2023 • 50min
Learning from Incidents is what good SREs do with Laura Nolan
Incidents happen! And when asking Laura Nolan who was an SRE at Google and Slack, healthy organizations should take proper time to analyze and learn from them. This will improve future incident response as well as overall system resiliency.Tune in to this episode and hear Laura’s tips & tricks what makes a good SRE organization. It starts with doing good write ups of incidents, doing your research on incident reports of software and services that you are looking into using. We also spent a good amount of time discussing root cause analysis where she highlighted an incident that happened at her time at Google and what she learned about outdated alerting.Thanks Laura for a great discussion and lots of insights.Here are the additional links we discussed during the podcastLaura on LinkedIn: https://www.linkedin.com/in/laura-nolan-bb7429/Laura on Twitter:https://twitter.com/lauraliftsIncident Template talk @ SRECon: https://www.usenix.org/conference/srecon22emea/presentation/nolan-breakWhat SRE could be talk @ SRECon: https://www.usenix.org/conference/srecon22emea/presentation/nolan-sreHowie Post-Incident Guide: https://www.jeli.io/howie/welcomeMy philosophy on Alerting article: https://docs.google.com/document/d/199PqyG3UsyXlwieHaqbGiWVa8eMWi8zzAn0YfcApr8Q/edit

Jan 1, 2023 • 42min
What happened in 2022 and where 2023 is taking us!
What a year 2022 was! We had 25! episodes with amazing guests from all over the world covering topics from Kubernetes, OpenTelemetry, DevOps, SRE, Cloud Migrations, DNS, Value Streams all the way to Persona Driven Engineering and drawing parallels with Digital Marketing. If you are new to our podcast check out the playlist and listen to some of those we mentioned during our episode!Now its time to say Thank You listeners for the continued support. After 5+ years of podcasting we still see rising numbers of downloads which is the best motivation for us to keep going. Stay tuned as we are going to cover industry relevant topics going into 2023 – or is it year 53? (only those will know that listen to the full episode)

Dec 19, 2022 • 53min
Building the right thing: Learning from digital marketing expert Bernhard Dominguez
“If I wouldn’t measure it I wouldn’t know it!” or “Build, Measure, Learn! ”These quotes could be from any engineer building new digital services, observing them in production and based on that learn how to improve their software.They are however from Bernhard Dominguez, Digital Consultant at FACTOR, who we invited to the show. Bernhard highlights a lot of parallels between his work planning and executing digital marketing strategies and the world we live in: designing, operating and optimizing complex software systems.Tune in and learn about how important it is to understand your real target groups (=end users), how to define clear goals (=SLOs), how to change from campaign to funnel activities (=User Journeys) and why it is so important to get an outsider’s opinion before implementing your next big project! (=We have always done it this way) If you want to follow up with Bernhard and his work check out the following links we discussed during the podcast:Bernhard on LinkedInFACTORPodcast (German): Newsletter MarketingPodcast (German): Build - Measure - Learn

Dec 5, 2022 • 53min
SRE for the non-unicorns (aka Enterprises) with James Brookbank
You have a CISO (Chief Security Information Officer) but no CRO (Chief Reliability Officer)? You blame people if systems crash? You scale your people in the rate of scaling your infrastructure? If you answer any of those questions with YES then you should tune into this podcast as you probably struggle adopting Site Reliability Engineering (SRE) in your organization.James Brookbank, Cloud Solutions Architect, has dealt with resiliency topics in a large enterprise prior to joining Google. In our conversation he shares advice he gives Enterprises to convert the excitement about SRE into actual implementation. James gave some good guidance on what good and not so good projects are to start with. He gives practical examples on what it means to change your company culture and why there doesn’t have to be an SRE for every service.In our call we discussed the SRE in Enterprise talk at DevOpsDays Boston and SRECon EMEA as well as their recent book. Here are all the relevant links:James Brookbank on Linkedin:https://www.linkedin.com/in/jamesbrookbank/SRECon EMEA Slides: https://www.usenix.org/system/files/srecon22_slides_mcghee.pdfDevOpsDays Boston 2022 Session Recording: https://www.youtube.com/watch?v=__e7b25QOHcEnterprise Roadmap to SRE Book: https://sre.google/resources/practices-and-processes/enterprise-roadmap-to-sre/

Nov 21, 2022 • 43min
What is Dynatrace Grail and Why should you care with Andreas Lehofer
Dynatrace recently announced Grail – promising boundless observability, security and business analytics in context.You may think: that’s a lot of nice words that other solutions claim as well. So why should you care about Grail? What is the real problem it solves and how does it solve it?Tune in and hear from Andreas Lehofer, Chief Product Officer at Dynatrace as he boils it down to two critical issues:* Cost vs Value of your data: Current approaches are expensive as you keep 95% of your data not knowing whether you ever need it!* Functional Limits with having siloed observability data: When you need answers the current siloed approach is slow and limited!Thanks Andreas for the discussion, the insights on the hidden costs of current approaches, the technical explanation on our architecture as well as giving us some glimpse on what’s coming next.Show Links:Dynatrace Grail Announcement:https://www.dynatrace.com/platform/grail/Andreas Lehofer on Linkedin:https://www.linkedin.com/in/andreaslehofer/

Nov 7, 2022 • 42min
How I became an SRE in FinTech and what this means with Diana Najda
“I was not that interested in coding but more in understanding the impact of software on human beings” says Diana Najda, SRE & Monitoring Lead, when we asked her how she ended up leading the efforts around Site Reliability Engineering.Tune in to our conversation and learn how Diana is bridging the gap between Dev, Ops and Business by ensuring that the right people get the right telemetry data from their observability platform. She gives us insights into her definition of DevOps and SRE, how she helps teams setting up SLOs (Service Level Objectives) and how she proves the ROI (Return On Investment) into the SRE practices!Last piece of advice Diana gives everyone interested: “SRE might be buzzword it loses the buzz the more you hear it – BUT - its really cool because SREs make the life of Dev and Ops easier every day”If you want to connect with Diana reach her on LinkedIn: https://www.linkedin.com/in/diannajda/

Oct 24, 2022 • 49min
How to fail at Serverless (without even trying) with Kam Lasater
Serverless and other emerging technologies hide the complexity of the underlying runtimes from developers. This is great for productivity but can make it really hard when troubleshooting behavior that needs deeper insight into those runtimes, platforms or frameworks.In this episode we hear from Kam Lasater, Founder of Cyclic Software. Kam has run into several walls while he was implementing solutions from scratch using Serverless technologies as well as other popular cloud services. He recently presented a handful of those scenarios at DevOpsDays Boston 2022.Tune in and learn from Kam as he walks us through two of those challenges he covered during his DevOpsDays talk. If you want to learn more make sure to watch the full talk on YouTube: https://www.youtube.com/watch?v=xB9vsSl93mE If you want to learn more from or about Kam check out the following links:YouTube video from DevOpsDays Boston: https://www.youtube.com/watch?v=xB9vsSl93mECyclic Website: https://www.cyclic.sh/Cyclic Blog: https://www.cyclic.sh/blog/Twitter: https://twitter.com/seekayelPersonal Website: https://kamlasater.com/LinkedIn: https://www.linkedin.com/in/kamlasater/

Oct 10, 2022 • 43min
How to optimize performance and cost of k8s workloads with Stefano Doni
Over the years we learned how to optimize the performance of our JVMs, our CLRs or our databases instances by tweaking settings around heap sizes, garbage collection behavior or connection and thread pools.As we move our workloads to k8s we need to adapt our optimization efforts as they are new nobs to turn. We need to factor in how resource and request limits on pods impact your application runtimes that run on your clusters. Out of memory problems are all of a sudden no longer just depending on the java heap size alone!To learn more about k8s optimization best practices we have invited Stefano Doni, CTO of Akamas. Stefano walks us through key learnings as the team at Akamas has helped organizations optimize the performance, resiliency and cost of their k8s workloads. You will learn about proper memory settings, CPU throttling and how to start saving costs as you move more workloads to k8s. To learn more about Akamas go here: https://www.akamas.io/If you happen to be at KubeCon 2022 in Detroit make sure to visit their boothShow Links:Stefano on Linkedin: https://www.linkedin.com/in/stefanodoni/A Guide to Autonomous Performance Optimization with Dynatrace and Akamas: https://www.youtube.com/watch?v=i7MuEjeOvX0

Sep 26, 2022 • 47min
Value Streams – Tying Business Results to your DevOps & Cloud Transformation with Adam Dahlgren
In economic turbulent times leaders get asked questions like: “What’s the return on investment of your DevOps or Cloud Transformation? Did we really get better and more efficient? Or did we just blow a lot of money out the window?”Connecting business results with your technical initiatives is what would answer those questions. To learn how this works we invited Adam Dahlgren, SVP Product at Allstacks. From Adam we learn about Value Stream Management, how to align with your top level OKRs and how to improve your DORA and SPACE metrics. Because as Adam says in the beginning: “Inspection is coming especially during turbulent economic times and they will question your investment in transformation projects!” If you want to follow up with Adam check out the following links we discussed:LinkedIn: https://www.linkedin.com/in/adam-dahlgren/What are DORA Metrics: https://www.allstacks.com/blog/dora-metrics/?hsLang=enWhat is the SPACE Framework: https://queue.acm.org/detail.cfm?id=3454124Allstack: https://www.allstacks.com/DevOps World sessions from Allstack: https://events.devopsworld.com/widget/cloudbees/devopsworld22/conferenceSessionDetails?tab.day=20220929&search=dora

Sep 12, 2022 • 48min
Why is it always DNS, TLS or Bad Config? This and many other learnings from Philipp Krenn
We all want to leverage technology to solve problems. New and shiny toys are appealing to look which sometimes means we loose the insights on the base technologies that powers most of our connected lives, such as DNS or TLS.In this podcast we invited Philipp Krenn (@xeraa), Dev Advocate Team Lead at Elastic, and learn about DNS, TLS and other bad config changes. We learn about Log4Shell, how the Java Security Manager was a big help in fighting Log4Shell, why its been deprecated and also get his thoughts into CDD (Conference Driven Development)And if you ever visit Vienna – chances are you meet Philipp dancing Waltz with tourists 😊Show Links:To learn more from Philipp start withHis personal website: https://xeraa.net/Twitter: https://twitter.com/xeraaLinkedIn: https://www.linkedin.com/in/philippkrennHis conference schedule (past & future): https://xeraa.net/events/