Learning Bayesian Statistics

Alexandre Andorra

Are you a researcher or data scientist / analyst / ninja? Do you want to learn Bayesian inference, stay up to date or simply want to understand what Bayesian inference is?

Then this podcast is for you! You'll hear from researchers and practitioners of all fields about how they use Bayesian statistics, and how in turn YOU can apply these methods in your modeling workflow.

When I started learning Bayesian methods, I really wished there were a podcast out there that could introduce me to the methods, the projects and the people who make all that possible.

So I created "Learning Bayesian Statistics", where you'll get to hear how Bayesian statistics are used to detect black matter in outer space, forecast elections or understand how diseases spread and can ultimately be stopped.

But this show is not only about successes -- it's also about failures, because that's how we learn best. So you'll often hear the guests talking about what *didn't* work in their projects, why, and how they overcame these challenges. Because, in the end, we're all lifelong learners!

My name is Alex Andorra by the way, and I live in Estonia. By day, I'm a data scientist and modeler at the https://www.pymc-labs.io/ (PyMC Labs) consultancy. By night, I don't (yet) fight crime, but I'm an open-source enthusiast and core contributor to the python packages https://docs.pymc.io/ (PyMC) and https://arviz-devs.github.io/arviz/ (ArviZ). I also love https://www.pollsposition.com/ (election forecasting) and, most importantly, Nutella. But I don't like talking about it – I prefer eating it.

So, whether you want to learn Bayesian statistics or hear about the latest libraries, books and applications, this podcast is for you -- just subscribe! You can also support the show and https://www.patreon.com/learnbayesstats (unlock exclusive Bayesian swag on Patreon)!

Episodes

Mentioned books

Mar 5, 2025 • 1h 4min

#127 Saving Sharks... with Python, Causal Inference and Aaron MacNeil

Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!Intro to Bayes Course (first 2 lessons free)Advanced Regression Course (first 2 lessons free)Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!Visit our Patreon page to unlock exclusive Bayesian swag ;)Thank you to my Patrons for making this episode possible!Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire, Mike Loncaric, David McCormick, Ronald Legere, Sergio Dolia and Michael Cao.Takeaways:Sharks play a crucial role in maintaining healthy ocean ecosystems.Bayesian statistics are particularly useful in data-poor environments like ecology.Teaching Bayesian statistics requires a shift in mindset from traditional statistical methods.The shark meat trade is significant and often overlooked.Ray meat trade is as large as shark meat trade, with specific markets dominating.Understanding the ecological roles of species is essential for effective conservation.Causal language is important in ecological research and should be encouraged.Evidence-driven decision-making is crucial in balancing human and ecological needs.Expert opinions are crucial for understanding species composition in landings.Trade dynamics are influenced by import preferences and species availability.Bayesian modeling allows for the incorporation of various data sources and expert knowledge.Field data collection is essential for validating model assumptions.The complexity of trade relationships necessitates a nuanced approach to modeling.Understanding the impact of management interventions on landings is critical.The role of scientists in informing policy is vital for effective conservation efforts.Chapters:00:00 Introduction to Marine Biology and Statistics04:33 The Role of Bayesian Statistics in Marine Research10:09 Challenges in Teaching Bayesian Statistics21:58 The Importance of Sharks in Ecosystems26:35 Understanding Shark Meat Trade and Conservation32:09 The Trade in Ray and Shark Meat36:18 Modeling Landings and Trade42:56 Challenges in Data Integration44:50 Running Complex Models51:57 Expert Elicitation and Prior Construction55:52 Future Directions and Research56:46 Reflections on Science and PolicyLinks from the show:Fisheries Lab: https://ifisheries.org/?page_id=83LBS #51 Bernoulli’s Fallacy & the Crisis of Modern Science, with Aubrey Clayton: https://learnbayesstats.com/episode/51-bernoullis-fallacy-crisis-modern-science-aubrey-claytonTranscriptThis is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

Feb 19, 2025 • 55min

#126 MMM, CLV & Bayesian Marketing Analytics, with Will Dean

Join Will Dean, a statistician and data scientist at PyMC Labs, as he dives into the world of Bayesian marketing analytics. He shares insights on leveraging customer lifetime value (CLV) and media mix modeling to optimize marketing strategies. The discussion highlights the significance of productionizing models and the challenges that come with it, including version control and model management. With a focus on open-source collaboration, Will emphasizes the importance of continuous learning and innovative approaches to empower marketers with actionable data.

Feb 5, 2025 • 58min

#125 Bayesian Sports Analytics & The Future of PyMC, with Chris Fonnesbeck

Chris Fonnesbeck, a trailblazer in sports analytics and a core developer of PyMC, shares insights from his unique journey that blends marine biology and statistical consulting with sports modeling. He discusses the evolution of sports analytics, particularly in baseball, and highlights the impact of Bayesian methods. Fonnesbeck dives into advanced modeling techniques using Gaussian processes and emphasizes community contributions to the future of PyMC. With humor, he relates the complexities of polling data to sports analytics, showing the fun side of data interpretation.

Jan 22, 2025 • 1h 36min

#124 State Space Models & Structural Time Series, with Jesse Grabowski

Jesse Grabowski, PhD candidate at Paris 1 Pantheon-Sorbonne and principal data scientist at PyMC Labs, dives into the intricate world of state space models in time series analysis. He discusses the powerful adaptability of Bayesian methods in econometrics, emphasizing how they enhance forecasting accuracy. Grabowski highlights the balance between model complexity and simplicity, the significance of understanding trends, and the practical applications of innovations and latent states. Plus, he unwraps the role of the Kalman filter in managing dynamic data.

Jan 10, 2025 • 1h 32min

#123 BART & The Future of Bayesian Tools, with Osvaldo Martin

Osvaldo Martin, a collaborator on various open-source Bayesian projects and educator at PyMC Labs, discusses the power of Bayesian Additive Regression Trees (BART). He explains how BART simplifies modeling for those lacking domain expertise. The conversation also highlights advancements in tools like PyMC-BART and PreliZ, emphasizing their contributions to prior elicitation. Osvaldo shares insights on integrating BART with Bambi and the importance of interactive learning in teaching Bayesian statistics. Additionally, he touches upon future enhancements for user experience in Bayesian analysis.

Dec 26, 2024 • 1h 23min

#122 Learning and Teaching in the Age of AI, with Hugo Bowne-Anderson

Hugo Bowne-Anderson is an independent data and AI consultant, known for his insights in education and podcasting. He discusses the challenges of building and deploying Large Language Model applications, stressing the importance of feedback in data science education. Hugo shares his journey in creating practical courses, highlighting the need for continuous learning in tech. He emphasizes collaboration between data scientists and software engineers and reflects on the evolving landscape of AI, calling for problem-solving over specific tools for aspiring data scientists.

Dec 11, 2024 • 1h 8min

#121 Exploring Bayesian Structural Equation Modeling, with Nathaniel Forde

Nathaniel Forde, a staff data scientist at Personio and a contributor to the PyMC ecosystem, shares insights on Bayesian structural equation modeling. He highlights the importance of confirmatory factor analysis in validating constructs and discusses the flexibility of Bayesian methods in analyzing complex relationships. Forde emphasizes the necessity of model validation and sensitivity analysis to ensure robust findings. He also reflects on his journey into data science, noting how early challenges shaped his approach to data quality and causal inference.

Nov 27, 2024 • 1h 2min

#120 Innovations in Infectious Disease Modeling, with Liza Semenova & Chris Wymant

Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!My Intuitive Bayes Online Courses1:1 Mentorship with meOur theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!Visit our Patreon page to unlock exclusive Bayesian swag ;)-------------------------Love the insights from this episode? Make sure you never miss a beat with Chatpods! Whether you're commuting, working out, or just on the go, Chatpods lets you capture and summarize key takeaways effortlessly.Save time, stay organized, and keep your thoughts at your fingertips.Download Chatpods directly from App Store or Google Play and use it to listen to this podcast today!https://www.chatpods.com/?fr=LearningBayesianStatistics-------------------------Takeaways:Epidemiology focuses on health at various scales, while biology often looks at micro-level details.Bayesian statistics helps connect models to data and quantify uncertainty.Recent advancements in data collection have improved the quality of epidemiological research.Collaboration between domain experts and statisticians is essential for effective research.The COVID-19 pandemic has led to increased data availability and international cooperation.Modeling infectious diseases requires understanding complex dynamics and statistical methods.Challenges in coding and communication between disciplines can hinder progress.Innovations in machine learning and neural networks are shaping the future of epidemiology.The importance of understanding the context and limitations of data in research. Chapters:00:00 Introduction to Bayesian Statistics and Epidemiology03:35 Guest Backgrounds and Their Journey10:04 Understanding Computational Biology vs. Epidemiology16:11 The Role of Bayesian Statistics in Epidemiology21:40 Recent Projects and Applications in Epidemiology31:30 Sampling Challenges in Health Surveys34:22 Model Development and Computational Challenges36:43 Navigating Different Jargons in Survey Design39:35 Post-COVID Trends in Epidemiology42:49 Funding and Data Availability in Epidemiology45:05 Collaboration Across Disciplines48:21 Using Neural Networks in Bayesian Modeling51:42 Model Diagnostics in Epidemiology55:38 Parameter Estimation in Compartmental ModelsThank you to my Patrons for making this episode possible!Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström and Stefan.Links from the show:LBS #21, Gaussian Processes, Bayesian Neural Nets & SIR Models, with Elizaveta Semenova: https://learnbayesstats.com/episode/21-gaussian-processes-bayesian-neural-nets-sir-models-with-elizaveta-semenova/Liza’s website: https://www.elizaveta-semenova.com/Liza on GitHub: https://github.com/elizavetasemenovaLiza on LinkedIn: https://www.linkedin.com/in/elizaveta-semenova/Liza on Google Scholar: https://scholar.google.com/citations?user=jqGIgFEAAAAJ&hl=enChris' page: https://www.bdi.ox.ac.uk/Team/c-wymantChris on GitHub: https://github.com/chrishivChris on LinkedIn: https://www.linkedin.com/in/chris-wymant-65661274/Chris on Blue Sky: https://bsky.app/profile/chriswymant.bsky.socialChris on Google Scholar: https://scholar.google.com/citations?user=OJ6t2UwAAAAJ&hl=enPriorVAE Paper: Explains how to build an emulator for a GP using a deep generative model (Variational Autoencoder, or VAE) and apply it within MCMC. Link to the paperPriorCVAE Paper: Builds on PriorVAE by encoding model parameters along with emulating stochastic process realisations. Includes examples for GPs, ODEs, and double-well models. Link to the paperStanCon 2024 Tutorial: A tutorial covering the basics of sequential decision-making, with a demo of Bayesian Optimization using Stan. Link to the tutorialNumpyro Course: Materials from a course Liza taught -- great for learning Numpyro. Link to the courseaggVAE Paper: An application of PriorVAE to the problem of changing boundaries. Link to the paperTranscriptThis is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

Nov 13, 2024 • 1h 25min

#119 Causal Inference, Fiction Writing and Career Changes, with Robert Kubinec

Robert Kubinec, an assistant professor of political science at the University of South Carolina, dives into the complexities of studying corruption and the innovative survey techniques that can aid in obtaining honest data. He shares insights on how Bayesian methods enhance research by estimating latent variables and uncertain outcomes. Additionally, Kubinec discusses his novel, 'The Bayesian Hitman,' highlighting how fiction writing can improve academic skills. The conversation emphasizes the importance of community in statistics and the potential of real-time surveys to transform social science research.

Oct 30, 2024 • 59min

#118 Exploring the Future of Stan, with Charles Margossian & Brian Ward

Charles Margossian, a research fellow at the Flatiron Institute, and Brian Ward, a core developer of Stan, dive into the future of the Stan programming language. They discuss recent innovations like the addition of tuples, which enhance data handling efficiency. The duo emphasizes the importance of improved error messages for beginners adjusting to Stan's complexities. They also highlight community engagement and the development of new samplers to enhance performance, paving the way for user-friendly features that make Bayesian statistics more accessible.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner