Drill to Detail cover image

Drill to Detail

Latest episodes

undefined
Mar 13, 2018 • 36min

Drill to Detail Ep.51 'Druid, Imply and OLAP Analysis on Event-Level Datasets' With Special Guest Fangjin Yang

Mark Rittman is joined by Special Guest Fangjin Yang to talk about the history of Druid, a high-performance, column-oriented, distributed data store originally developed by the team at Metamarkets to provide fast ad-hoc access to large amounts of event-level marketing data, and his work at Imply to commercialise Druid and build a suite of supporting query and data management tools.Druid project homepageDruid - A Real-Time Analytical Data Store (pdf)Druid - Learning about the Druid ArchitectureImply.io homepageDruid, Imply and Looker 5 bring OLAP Analysis to BigQuery’s Data Warehouse
undefined
Feb 27, 2018 • 1h 10min

Drill to Detail Ep.50 'Agile BI, Karl Marx and Our Man from Moscow' With Special Guests Stewart Bryson and Alex Gorbachev

Mark Rittman is joined in this 50th Episode Special by our original guest on the first episode of Drill to Detail, Stewart Bryson, to talk about developing agile BI applications using FiveTran, SnowflakeDB and Looker and his recent work developing a BI solution for Google Play Marketing using Google Data Studio and Google Cloud Platform. We're also joined later in the show by Alex Gorbachev from Pythian, our mystery guest who Stewart then interviews flawlessly armed only with a set of questions given to him as the guest was unveiled ... though be sure to listen past the final closing music for the bonus out-takes.#115 Google Play Marketing with Dom Elliott and Stewart BrysonThe Next-Generation Jump ProgramThe Data Sharehouse is HereFrom Data Warehouse to Data SharehouseAlex Gorbachev profile on Pythian.com
undefined
Feb 5, 2018 • 46min

Drill to Detail Ep.49 'Trifacta, Google Cloud Dataprep and Data Wranging for Data Engineers' With Special Guest Will Davis

Mark Rittman is joined by Will Davis from Trifacta to talk about the public beta of Google Cloud Dataprep, Trifacta's data wrangling platform and topics including metadata management, data quality and data management for big data and cloud data sources.Google Cloud Dataprep on Google Cloud Platform"Google Cloud Dataprep: Spreadsheet-Style Data Wrangling Powered by Google Cloud Dataflow""A New Cloud-Based Data Prep Solution from Google & Trifacta"Trifacta website"A Breakthrough Approach to Exploring and Preparing Data"Trifacta platform architecture"Garbage In, Garbage Out: Why Data Quality Matters""How to Put an Effective Metadata Strategy in Place"
undefined
Jan 23, 2018 • 1h 2min

Drill to Detail Ep.48 'Mondrian OLAP, Apache Calcite and Database Dis-Aggregation' With Special Guest Julian Hyde

- Oracle Designer page on Oracle.com- Bitmap Index page on Wikipedia- Mondrian project page on Github- Mondrian OLAP Server page on Wikipedia- MultiDimensional eXpressions (MDX) page on Wikipedia- Julian Hyde blog - Apache Calcite project homepage- Apache Calcite Introduction and Overview deck- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California
undefined
Dec 26, 2017 • 1h 45min

Drill to Detail Ep.47 'Business Analytics 2018 Predictive and Best-Practice Christmas & New Year Special' With Special Guest Christian Berg

Mark is joined by long-term industry veteran and friend Christian Berg to talk about surviving fifteen years as a contractor in analytics industry, changes he's seen in the market and in how project are approached, the value in getting involved in the community, and in a specially extended Christmas and New Year edition we look back at what was topical in 2017 and what are Christian's predictions for 2018 ... and appoint Christian as Head of our Best Practices Found on the Internet.
undefined
Dec 19, 2017 • 55min

Drill to Detail Ep.46 'Market Trends and Findings from the BI Survey 17' With Special Guest Dr. Carsten Bange

Mark Rittman is joined in this episode of Drill to Detail by Dr. Carsten Bange from BARC to talk about findings from the recently completed BI Survey 17 including the continuing move to modern BI platforms and self-service desktop tools, analytics adoption trends and the increasing incorporation of BI functionality within business applications, the surprising topicality of master data management and data governance ... and whatever happened to Nigel Pendse and his legendary OLAP Report?
undefined
Dec 13, 2017 • 58min

Drill to Detail Ep.45 'Tellius, YellowFin and the State of AI in Analytics Today' With Special Guest Jen Underwood

Mark Rittman is joined in this episode by returning special guest Jen Underwood to talk about what's new and innovative in the BI and analytics industry right now, and how AI and machine learning are this year's data discovery and data visualization.
undefined
Dec 8, 2017 • 47min

Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney

Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
undefined
Nov 30, 2017 • 53min

Drill to Detail Ep.43 'Oracle Analytics, Data Visualization Desktop 4.0 and The Art of Product Management' with Special Guest Mike Durran

Mark is joined by Mike Durran from the Oracle Analytics Product Management team in this UKOUG Tech’17 special to talk about his route into product management via the Oracle Discoverer BI tool, Oracle’s latest product in this space Oracle Data Visualization Desktop 4 and its new features, and Mike’s upcoming sessions at the UK Oracle User Group’s Tech’17 event next week in Birmingham, UK.
undefined
Nov 12, 2017 • 44min

Drill to Detail Ep.42 'Evaluex, ML and Optimizing BigQuery & Athena' With Special Guest Avi Zloof

Mark is joined in this episode by Avi Zloof from Evaluex to talk about the new world of elastically-provisioned cloud-hosted analytic databases such as Google BigQuery and Amazon Athena, how their pricing model and vendor strategy differs from the traditional database vendors, and how machine learning can be used to automate performance tuning and optimize workloads in this new world of large-scale distributed query and storage.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app