What's New In Data

Striim
undefined
Mar 18, 2025 • 1h 11min

Scaling Databases in the AI Era: Insights from Andy Pavlo (Carnegie Mellon University)

In this engaging discussion, Andy Pavlo, an Associate Professor at Carnegie Mellon University, explores the dynamic landscape of databases. He delves into the distinctions between OLTP and OLAP systems and discusses the unique challenges of distributed databases. Pavlo highlights the innovative rise of vector databases and how they integrate with AI, emphasizing their capabilities for similarity searches. The conversation also touches on the evolution of data formats and the importance of clean data in modern analytics, making it a must-listen for data enthusiasts.
undefined
Nov 21, 2024 • 40min

From the Marines to Data Engineering with Alexander Noonan (Dagster Labs)

Alex Noonan, a developer advocate at Dagster, shares his fascinating journey from Marine aircraft mechanic to data engineering guru. He discusses the challenges of social media shifts post-Elon Musk's Twitter takeover, exploring how platforms like Blue Sky are changing community dynamics. Alex offers insights on networking strategies across LinkedIn, Reddit, and more, emphasizing the relevance of tailored content. He also highlights AI's transformative role in streamlining data processes, showcasing innovations like Dagster that enhance the day-to-day workflow of data professionals.
undefined
Oct 31, 2024 • 40min

Leveraging Data and AI in your Go-to-Market Strategy with Everett Berry from Clay

Everett Berry joins the show again to share fresh perspectives on applying data in your sales and go-to-market strategy. We talk about his journey to a key role at Clay, where he leads Go-To-Market Engineering to tackle the complexities of serving and enhancing data for sales. Everett discusses how Clay's tools improve data accuracy and reach, helping businesses streamline revenue operations with smarter data use.In this episode, we also dive into AI's impact on sales operations and revenue processes. Everett and John explore how AI agents and human teams interact, the integration of customer data platforms with CRMs, and the merging of RevOps and data teams. Picture a future where AI autonomously handles data tasks, changing how teams collaborate and redefining organizational roles. For anyone following the fast-paced world of sales tech, this conversation offers a forward-looking view on autonomous data management and its potential to transform business practices.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 25, 2024 • 50min

From Apache Kafka to PostgreSQL, PostgreSQL maturity and extensions, and building on PostgreSQL with Gwen Shapira (CPO at Nile)

Gwen Shapira, co-founder and CPO of Nile, is a notable force in the PostgreSQL world after leading Kafka development at Confluent. She shares her journey from cloud-native technologies to the PostgreSQL community, highlighting its vibrant evolution. Interesting discussions include PostgreSQL 17's new features, the integration of vector embeddings for AI applications, and the importance of SSL for secure connections. Gwen also explores how PostgreSQL supports diverse SaaS applications, emphasizing its flexibility and scalability.
undefined
Oct 10, 2024 • 14min

Shifting Data Quality Left, New O'Reilly Book, and Data Contracts with Chad Sanderson and Mark Freeman from Gable

Join us as we catch up with Chad Sanderson and Mark Freeman from Gable, live from Big Data London.  Discover Chad's insights from his well-attended talk and why the data scene in London has everyone buzzing. We're diving deep into the  concept of shifting data quality left, ensuring upstream data producers are as invested in data governance, privacy, and quality as their downstream counterparts. Chad and Mark also give us a sneak peek into their upcoming O'Reilly book on Data Contracts, complete with the charming Algerian racer lizard as its symbolic mascot. In this engaging conversation, Chad and Mark offer practical advice for data operators ready to embark on the journey of data contracts. They emphasize the importance of starting small and nurturing a strong cultural initiative to ensure success. Listen as they share strategies on engaging leadership and fostering a collaborative environment, providing a framework not just for implementation but also for securing leadership buy-in. This episode is packed with expert advice and real-world experiences that are a must-listen for anyone in the data field.John Kutay chimes in with examples of innovative data operators such as George Tedstone deploying Data Contracts at National Grid. Data Contracts and shifting data quality left will certainly be an area that many data teams prioritize as their workloads become increasingly operational. Download a preview of 'Data Contracts' here.Learn more about Gable.Follow Chad Sanderson on LinkedIn.Follow Mark Freeman on LinkedIn.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 4, 2024 • 24min

Joe Reis at Big Data LDN

Join us as we sit down with Joe Reis, live at Big Data LDN (London) 2024. Joe shares his partnership with DeepLearning.ai and AWS through his new course on Data Engineering. Joe's new course promises to elevate your data skills with hands-on exercises that marry foundational knowledge with cutting-edge practices. We dive into how this course complements his seminal book, "Fundamentals of Data Engineering," and why certification is valuable for those looking for foundational, hands-on knowledge to be a data practitioner. But that's not all; we also dissect the hurdles of adopting modern data architectures like data mesh in traditionally siloed companies. Using Conway's Law as a lens, Joe discuss why businesses struggle to transition from outdated infrastructures to decentralized systems and how cross-disciplinary skills—a concept inspired by mixed martial arts—are crucial in this endeavor as he cleverly calls it 'Mixed Model Arts'. Check out Joe's Work: Fundamentals of Data Engineering bookNew Coursera courses by Joe ReisWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Sep 27, 2024 • 36min

Is Text-to-SQL Ready for Prime Time? Insights from Ethan Ding, CEO of TextQL

Ethan Ding, co-founder and CEO of TextQL, dives into the revolutionary text-to-SQL technology transforming data analysis. He shares how natural language queries empower users, eliminating the need for coding skills. The conversation highlights the challenges of data management and the crucial role of high-quality data for decision-making. Ethan draws interesting parallels between AI in self-driving cars and data querying, showcasing the future of self-service analytics and how TextQL seamlessly integrates with existing BI tools to boost productivity.
undefined
Sep 19, 2024 • 42min

Small Data, Big Impact: Insights from MotherDuck's Jacob Matson

What makes MotherDuck and DuckDB a game-changer for data analytics? Join us as we sit down with Jacob Matson, a renowned expert in SQL Server, dbt, and Excel, who recently became a developer advocate at MotherDuck. During this episode, Jacob shares his compelling journey to MotherDuck, driven by his frequent use of DuckDB for solving data challenges. We explore the unique attributes of DuckDB, comparing it to SQLite for analytics, and uncover its architectural benefits, such as utilizing multi-core machines for parallel query execution. Jacob also sheds light on how MotherDuck is pushing the envelope with their innovative concept of multiplayer analytics.Our discussion takes a deep dive into MotherDuck's innovative tenancy model and how it impacts database workloads, highlighting the use of DuckDB format in Wasm for enhanced data visualization. Jacob explains how this approach offers significant compression and faster query performance, making data visualization more interactive. We also touch on the potential and limitations of replacing traditional BI tools with Mosaic, and where MotherDuck stands in the modern data stack landscape, especially for organizations that don't require the scale of BigQuery or Snowflake. Plus, get a sneak peek into the upcoming Small Data Conference in San Francisco on September 23rd, where we'll explore how small data solutions can address significant problems without relying on big data. Don't miss this episode packed with insights on DuckDB and MotherDuck innovations!Small Data SF Signup  Discount Code: MATSON100What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Aug 2, 2024 • 45min

Sovereign AI, Redpanda vs Apache Kafka, The Future of Data Streaming with Alex Gallego (CEO of Redpanda)

In this engaging discussion, Alex Gallego, CEO of Redpanda and former motorcycle and tattoo machine builder, shares his journey from childhood inventions to tech innovations. He dives into the revolutionary shift from batch processing to real-time data streaming with Redpanda, highlighting its cost and performance benefits over Apache Kafka. Alex emphasizes the importance of data sovereignty and the 'Bring Your Own Cloud' approach. He also discusses emerging trends like Sovereign AI, which prioritize data control for businesses and developers, reshaping the future of data infrastructure.
undefined
Jul 12, 2024 • 26min

Secrets Management Simplified: Insights from Doppler's Brian Vallelunga

Imagine losing your most important digital keys and leaving your entire kingdom vulnerable to attacks. In this episode, we promise to equip you with the knowledge to prevent such disasters. Join us as we sit down with Brian Vallelunga, the CEO and founder of Doppler, to unravel the critical importance of secrets management in software development. Brian shares his deep expertise on what secrets are—those crucial digital keys that unlock access to sensitive data—and illustrates through a personal story the severe consequences of failing to protect them. Discover how data breaches can wreak havoc, leading to brand reputation damage, customer churn, legal battles, and even personal distress.But it’s not all doom and gloom. Brian introduces us to Doppler, a game-changing tool that simplifies the tedious process of secrets management, making it an integral part of the modern development workflow. Learn how Doppler empowers developers to secure sensitive data efficiently, eliminating common headaches like managing environment files and manual secret updates. We also delve into practical implementation timelines, showing that effective secrets management is achievable for companies of all sizes with the right tools. Brian provides actionable advice for engineering teams on securing secrets within applications and highlights valuable resources for further learning. Tune in to safeguard your company’s digital assets and fortify your secrets management strategy.Follow Brian on:doppler.comX (Twitter) - @vallelungabrianWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app