
Drill to Detail
Mark Rittman is joined each episode by a special guest from the world of business intelligence, analytics and big data.
Latest episodes

Mar 13, 2018 • 36min
Drill to Detail Ep.51 'Druid, Imply and OLAP Analysis on Event-Level Datasets' With Special Guest Fangjin Yang
Mark Rittman is joined by Special Guest Fangjin Yang to talk about the history of Druid, a high-performance, column-oriented, distributed data store originally developed by the team at Metamarkets to provide fast ad-hoc access to large amounts of event-level marketing data, and his work at Imply to commercialise Druid and build a suite of supporting query and data management tools.Druid project homepageDruid - A Real-Time Analytical Data Store (pdf)Druid - Learning about the Druid ArchitectureImply.io homepageDruid, Imply and Looker 5 bring OLAP Analysis to BigQuery’s Data Warehouse

Feb 27, 2018 • 1h 10min
Drill to Detail Ep.50 'Agile BI, Karl Marx and Our Man from Moscow' With Special Guests Stewart Bryson and Alex Gorbachev
Mark Rittman is joined in this 50th Episode Special by our original guest on the first episode of Drill to Detail, Stewart Bryson, to talk about developing agile BI applications using FiveTran, SnowflakeDB and Looker and his recent work developing a BI solution for Google Play Marketing using Google Data Studio and Google Cloud Platform. We're also joined later in the show by Alex Gorbachev from Pythian, our mystery guest who Stewart then interviews flawlessly armed only with a set of questions given to him as the guest was unveiled ... though be sure to listen past the final closing music for the bonus out-takes.#115 Google Play Marketing with Dom Elliott and Stewart BrysonThe Next-Generation Jump ProgramThe Data Sharehouse is HereFrom Data Warehouse to Data SharehouseAlex Gorbachev profile on Pythian.com

Feb 5, 2018 • 46min
Drill to Detail Ep.49 'Trifacta, Google Cloud Dataprep and Data Wranging for Data Engineers' With Special Guest Will Davis
Mark Rittman is joined by Will Davis from Trifacta to talk about the public beta of Google Cloud Dataprep, Trifacta's data wrangling platform and topics including metadata management, data quality and data management for big data and cloud data sources.Google Cloud Dataprep on Google Cloud Platform"Google Cloud Dataprep: Spreadsheet-Style Data Wrangling Powered by Google Cloud Dataflow""A New Cloud-Based Data Prep Solution from Google & Trifacta"Trifacta website"A Breakthrough Approach to Exploring and Preparing Data"Trifacta platform architecture"Garbage In, Garbage Out: Why Data Quality Matters""How to Put an Effective Metadata Strategy in Place"

Jan 23, 2018 • 1h 2min
Drill to Detail Ep.48 'Mondrian OLAP, Apache Calcite and Database Dis-Aggregation' With Special Guest Julian Hyde
- Oracle Designer page on Oracle.com- Bitmap Index page on Wikipedia- Mondrian project page on Github- Mondrian OLAP Server page on Wikipedia- MultiDimensional eXpressions (MDX) page on Wikipedia- Julian Hyde blog - Apache Calcite project homepage- Apache Calcite Introduction and Overview deck- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California

Dec 26, 2017 • 1h 45min
Drill to Detail Ep.47 'Business Analytics 2018 Predictive and Best-Practice Christmas & New Year Special' With Special Guest Christian Berg
Mark is joined by long-term industry veteran and friend Christian Berg to talk about surviving fifteen years as a contractor in analytics industry, changes he's seen in the market and in how project are approached, the value in getting involved in the community, and in a specially extended Christmas and New Year edition we look back at what was topical in 2017 and what are Christian's predictions for 2018 ... and appoint Christian as Head of our Best Practices Found on the Internet.

Dec 19, 2017 • 55min
Drill to Detail Ep.46 'Market Trends and Findings from the BI Survey 17' With Special Guest Dr. Carsten Bange
Mark Rittman is joined in this episode of Drill to Detail by Dr. Carsten Bange from BARC to talk about findings from the recently completed BI Survey 17 including the continuing move to modern BI platforms and self-service desktop tools, analytics adoption trends and the increasing incorporation of BI functionality within business applications, the surprising topicality of master data management and data governance ... and whatever happened to Nigel Pendse and his legendary OLAP Report?

Dec 13, 2017 • 58min
Drill to Detail Ep.45 'Tellius, YellowFin and the State of AI in Analytics Today' With Special Guest Jen Underwood
Mark Rittman is joined in this episode by returning special guest Jen Underwood to talk about what's new and innovative in the BI and analytics industry right now, and how AI and machine learning are this year's data discovery and data visualization.

Dec 8, 2017 • 47min
Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.

Nov 30, 2017 • 53min
Drill to Detail Ep.43 'Oracle Analytics, Data Visualization Desktop 4.0 and The Art of Product Management' with Special Guest Mike Durran
Mark is joined by Mike Durran from the Oracle Analytics Product Management team in this UKOUG Tech’17 special to talk about his route into product management via the Oracle Discoverer BI tool, Oracle’s latest product in this space Oracle Data Visualization Desktop 4 and its new features, and Mike’s upcoming sessions at the UK Oracle User Group’s Tech’17 event next week in Birmingham, UK.

Nov 12, 2017 • 44min
Drill to Detail Ep.42 'Evaluex, ML and Optimizing BigQuery & Athena' With Special Guest Avi Zloof
Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.