What's New In Data cover image

What's New In Data

Latest episodes

undefined
Apr 1, 2025 • 1h 17min

Database Kernel Development, Streaming PII Obfuscation, and Change Data Capture with Alok Pareek

Alok Pareek, Co-founder and EVP of Products at Striim, joins What’s New in Data to dive into the game-changing innovations in Striim’s latest release. We explore how real-time data streaming is transforming analytics, operations, and decision-making across industries. Alok breaks down the challenges of building reliable, low-latency data pipelines and shares how Striim’s newest advancements help businesses process and act on data faster than ever. From cloud adoption to AI-driven insights, we discuss what’s next for streaming-first architectures and why the shift to real-time data is more critical than ever.Learn more about our latest release on Striim's Release Highlight page.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Mar 25, 2025 • 53min

Building for Scale: AWS’s Marc Brooker on Distributed SQL

Marc Brooker, VP and Distinguished Engineer for Databases at AWS, dives into the world of Distributed SQL (DSQL) and its innovative serverless architecture. He explains how DSQL is reshaping cloud databases by balancing consistency, availability, and scalability, eliminating traditional hassles. Brooker shares insights on enhancing distributed systems with EC2 clock synchronization, achieving consistency in multi-region databases, and the exciting integration of AI with serverless technologies, all while setting new standards in high-performance cloud computing.
undefined
Mar 18, 2025 • 1h 11min

Scaling Databases in the AI Era: Insights from Andy Pavlo (Carnegie Mellon University)

In this engaging discussion, Andy Pavlo, an Associate Professor at Carnegie Mellon University, explores the dynamic landscape of databases. He delves into the distinctions between OLTP and OLAP systems and discusses the unique challenges of distributed databases. Pavlo highlights the innovative rise of vector databases and how they integrate with AI, emphasizing their capabilities for similarity searches. The conversation also touches on the evolution of data formats and the importance of clean data in modern analytics, making it a must-listen for data enthusiasts.
undefined
Nov 21, 2024 • 40min

From the Marines to Data Engineering with Alexander Noonan (Dagster Labs)

Alex Noonan, a developer advocate at Dagster, shares his fascinating journey from Marine aircraft mechanic to data engineering guru. He discusses the challenges of social media shifts post-Elon Musk's Twitter takeover, exploring how platforms like Blue Sky are changing community dynamics. Alex offers insights on networking strategies across LinkedIn, Reddit, and more, emphasizing the relevance of tailored content. He also highlights AI's transformative role in streamlining data processes, showcasing innovations like Dagster that enhance the day-to-day workflow of data professionals.
undefined
Oct 31, 2024 • 40min

Leveraging Data and AI in your Go-to-Market Strategy with Everett Berry from Clay

Everett Berry joins the show again to share fresh perspectives on applying data in your sales and go-to-market strategy. We talk about his journey to a key role at Clay, where he leads Go-To-Market Engineering to tackle the complexities of serving and enhancing data for sales. Everett discusses how Clay's tools improve data accuracy and reach, helping businesses streamline revenue operations with smarter data use.In this episode, we also dive into AI's impact on sales operations and revenue processes. Everett and John explore how AI agents and human teams interact, the integration of customer data platforms with CRMs, and the merging of RevOps and data teams. Picture a future where AI autonomously handles data tasks, changing how teams collaborate and redefining organizational roles. For anyone following the fast-paced world of sales tech, this conversation offers a forward-looking view on autonomous data management and its potential to transform business practices.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 25, 2024 • 50min

From Apache Kafka to PostgreSQL, PostgreSQL maturity and extensions, and building on PostgreSQL with Gwen Shapira (CPO at Nile)

Gwen Shapira, co-founder and CPO of Nile, is a notable force in the PostgreSQL world after leading Kafka development at Confluent. She shares her journey from cloud-native technologies to the PostgreSQL community, highlighting its vibrant evolution. Interesting discussions include PostgreSQL 17's new features, the integration of vector embeddings for AI applications, and the importance of SSL for secure connections. Gwen also explores how PostgreSQL supports diverse SaaS applications, emphasizing its flexibility and scalability.
undefined
Oct 10, 2024 • 14min

Shifting Data Quality Left, New O'Reilly Book, and Data Contracts with Chad Sanderson and Mark Freeman from Gable

Join us as we catch up with Chad Sanderson and Mark Freeman from Gable, live from Big Data London.  Discover Chad's insights from his well-attended talk and why the data scene in London has everyone buzzing. We're diving deep into the  concept of shifting data quality left, ensuring upstream data producers are as invested in data governance, privacy, and quality as their downstream counterparts. Chad and Mark also give us a sneak peek into their upcoming O'Reilly book on Data Contracts, complete with the charming Algerian racer lizard as its symbolic mascot. In this engaging conversation, Chad and Mark offer practical advice for data operators ready to embark on the journey of data contracts. They emphasize the importance of starting small and nurturing a strong cultural initiative to ensure success. Listen as they share strategies on engaging leadership and fostering a collaborative environment, providing a framework not just for implementation but also for securing leadership buy-in. This episode is packed with expert advice and real-world experiences that are a must-listen for anyone in the data field.John Kutay chimes in with examples of innovative data operators such as George Tedstone deploying Data Contracts at National Grid. Data Contracts and shifting data quality left will certainly be an area that many data teams prioritize as their workloads become increasingly operational. Download a preview of 'Data Contracts' here.Learn more about Gable.Follow Chad Sanderson on LinkedIn.Follow Mark Freeman on LinkedIn.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 4, 2024 • 24min

Joe Reis at Big Data LDN

Join us as we sit down with Joe Reis, live at Big Data LDN (London) 2024. Joe shares his partnership with DeepLearning.ai and AWS through his new course on Data Engineering. Joe's new course promises to elevate your data skills with hands-on exercises that marry foundational knowledge with cutting-edge practices. We dive into how this course complements his seminal book, "Fundamentals of Data Engineering," and why certification is valuable for those looking for foundational, hands-on knowledge to be a data practitioner. But that's not all; we also dissect the hurdles of adopting modern data architectures like data mesh in traditionally siloed companies. Using Conway's Law as a lens, Joe discuss why businesses struggle to transition from outdated infrastructures to decentralized systems and how cross-disciplinary skills—a concept inspired by mixed martial arts—are crucial in this endeavor as he cleverly calls it 'Mixed Model Arts'. Check out Joe's Work: Fundamentals of Data Engineering bookNew Coursera courses by Joe ReisWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Sep 27, 2024 • 36min

Is Text-to-SQL Ready for Prime Time? Insights from Ethan Ding, CEO of TextQL

Ethan Ding, co-founder and CEO of TextQL, dives into the revolutionary text-to-SQL technology transforming data analysis. He shares how natural language queries empower users, eliminating the need for coding skills. The conversation highlights the challenges of data management and the crucial role of high-quality data for decision-making. Ethan draws interesting parallels between AI in self-driving cars and data querying, showcasing the future of self-service analytics and how TextQL seamlessly integrates with existing BI tools to boost productivity.
undefined
Sep 19, 2024 • 42min

Small Data, Big Impact: Insights from MotherDuck's Jacob Matson

What makes MotherDuck and DuckDB a game-changer for data analytics? Join us as we sit down with Jacob Matson, a renowned expert in SQL Server, dbt, and Excel, who recently became a developer advocate at MotherDuck. During this episode, Jacob shares his compelling journey to MotherDuck, driven by his frequent use of DuckDB for solving data challenges. We explore the unique attributes of DuckDB, comparing it to SQLite for analytics, and uncover its architectural benefits, such as utilizing multi-core machines for parallel query execution. Jacob also sheds light on how MotherDuck is pushing the envelope with their innovative concept of multiplayer analytics.Our discussion takes a deep dive into MotherDuck's innovative tenancy model and how it impacts database workloads, highlighting the use of DuckDB format in Wasm for enhanced data visualization. Jacob explains how this approach offers significant compression and faster query performance, making data visualization more interactive. We also touch on the potential and limitations of replacing traditional BI tools with Mosaic, and where MotherDuck stands in the modern data stack landscape, especially for organizations that don't require the scale of BigQuery or Snowflake. Plus, get a sneak peek into the upcoming Small Data Conference in San Francisco on September 23rd, where we'll explore how small data solutions can address significant problems without relying on big data. Don't miss this episode packed with insights on DuckDB and MotherDuck innovations!Small Data SF Signup  Discount Code: MATSON100What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner