What's New In Data cover image

What's New In Data

Latest episodes

undefined
Nov 21, 2024 • 40min

From the Marines to Data Engineering with Alexander Noonan (Dagster Labs)

Alex Noonan, a developer advocate at Dagster, shares his fascinating journey from Marine aircraft mechanic to data engineering guru. He discusses the challenges of social media shifts post-Elon Musk's Twitter takeover, exploring how platforms like Blue Sky are changing community dynamics. Alex offers insights on networking strategies across LinkedIn, Reddit, and more, emphasizing the relevance of tailored content. He also highlights AI's transformative role in streamlining data processes, showcasing innovations like Dagster that enhance the day-to-day workflow of data professionals.
undefined
Oct 31, 2024 • 40min

Leveraging Data and AI in your Go-to-Market Strategy with Everett Berry from Clay

Everett Berry joins the show again to share fresh perspectives on applying data in your sales and go-to-market strategy. We talk about his journey to a key role at Clay, where he leads Go-To-Market Engineering to tackle the complexities of serving and enhancing data for sales. Everett discusses how Clay's tools improve data accuracy and reach, helping businesses streamline revenue operations with smarter data use.In this episode, we also dive into AI's impact on sales operations and revenue processes. Everett and John explore how AI agents and human teams interact, the integration of customer data platforms with CRMs, and the merging of RevOps and data teams. Picture a future where AI autonomously handles data tasks, changing how teams collaborate and redefining organizational roles. For anyone following the fast-paced world of sales tech, this conversation offers a forward-looking view on autonomous data management and its potential to transform business practices.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 25, 2024 • 50min

From Apache Kafka to PostgreSQL, PostgreSQL maturity and extensions, and building on PostgreSQL with Gwen Shapira (CPO at Nile)

What does it take to go from leading Kafka development at Confluent to becoming a key figure in the PostgreSQL world? Join us as we talk with Gwen Shapira, co-founder and chief product officer at Nile, about her transition from cloud-native technologies to the vibrant PostgreSQL community. Gwen shares her journey, including the shift from conferences like O'Reilly Strata to PostgresConf and JavaScript events, and how the Postgres community is evolving with tools like Discord that keep it both grounded and dynamic.We dive into the latest developments in PostgreSQL, like hypothetical indexes that enable performance tuning without affecting live environments, and the growing importance of SSL for secure database connections in cloud settings. Plus, we explore the potential of integrating PostgreSQL with Apache Arrow and Parquet, signaling new possibilities for data processing and storage.At the intersection of AI and PostgreSQL, we examine how companies are using vector embeddings in Postgres to meet modern AI demands, balancing specialized vector stores with integrated solutions. Gwen also shares insights from her work at Nile, highlighting how PostgreSQL’s flexibility supports SaaS applications across diverse customer needs, making it a top choice for enterprises of all sizes.Follow Gwen on:Nile BlogX (Twitter)LinkedInNile DiscordWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 10, 2024 • 14min

Shifting Data Quality Left, New O'Reilly Book, and Data Contracts with Chad Sanderson and Mark Freeman from Gable

Join us as we catch up with Chad Sanderson and Mark Freeman from Gable, live from Big Data London.  Discover Chad's insights from his well-attended talk and why the data scene in London has everyone buzzing. We're diving deep into the  concept of shifting data quality left, ensuring upstream data producers are as invested in data governance, privacy, and quality as their downstream counterparts. Chad and Mark also give us a sneak peek into their upcoming O'Reilly book on Data Contracts, complete with the charming Algerian racer lizard as its symbolic mascot. In this engaging conversation, Chad and Mark offer practical advice for data operators ready to embark on the journey of data contracts. They emphasize the importance of starting small and nurturing a strong cultural initiative to ensure success. Listen as they share strategies on engaging leadership and fostering a collaborative environment, providing a framework not just for implementation but also for securing leadership buy-in. This episode is packed with expert advice and real-world experiences that are a must-listen for anyone in the data field.John Kutay chimes in with examples of innovative data operators such as George Tedstone deploying Data Contracts at National Grid. Data Contracts and shifting data quality left will certainly be an area that many data teams prioritize as their workloads become increasingly operational. Download a preview of 'Data Contracts' here.Learn more about Gable.Follow Chad Sanderson on LinkedIn.Follow Mark Freeman on LinkedIn.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Oct 4, 2024 • 24min

Joe Reis at Big Data LDN

Join us as we sit down with Joe Reis, live at Big Data LDN (London) 2024. Joe shares his partnership with DeepLearning.ai and AWS through his new course on Data Engineering. Joe's new course promises to elevate your data skills with hands-on exercises that marry foundational knowledge with cutting-edge practices. We dive into how this course complements his seminal book, "Fundamentals of Data Engineering," and why certification is valuable for those looking for foundational, hands-on knowledge to be a data practitioner. But that's not all; we also dissect the hurdles of adopting modern data architectures like data mesh in traditionally siloed companies. Using Conway's Law as a lens, Joe discuss why businesses struggle to transition from outdated infrastructures to decentralized systems and how cross-disciplinary skills—a concept inspired by mixed martial arts—are crucial in this endeavor as he cleverly calls it 'Mixed Model Arts'. Check out Joe's Work: Fundamentals of Data Engineering bookNew Coursera courses by Joe ReisWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Sep 27, 2024 • 36min

Is Text-to-SQL Ready for Prime Time? Insights from Ethan Ding, CEO of TextQL

Can AI really make your data analysis as easy as talking to a friend? Join us for an enlightening conversation with Ethan Ding, the co-founder and CEO of TextQL, as he shares his journey from Berkeley graduate to pioneering the text-to-SQL technology that's transforming how businesses interact with their data. Discover how natural language queries are breaking down barriers, making data analysis accessible to everyone, regardless of technical skill. Ethan delves into the historical hurdles and the game-changing advancements that are pushing the boundaries of AI and large language models in data querying.Ever wondered how the quest for full autonomy in self-driving cars relates to data? We draw fascinating parallels between these two cutting-edge fields, emphasizing the importance of structured systems over chaotic, AI-driven approaches. This chapter reveals the often-overlooked limitations of current data management practices and underscores the critical need for high-quality data and robust modeling. Through a comparison of traditional business intelligence tools and advanced AI-driven solutions, we explore what truly makes data querying effective and insightful.Hear from Ethan Deng, co-founder and CEO of TextQL, as he explains how their innovative tool integrates seamlessly with existing BI infrastructures, boosting productivity without the need for disruptive overhauls. Tune in to find out how TextQL is making data-driven decisions faster and smarter, paving the way for a future where data is everyone's best friend.Follow Ethan Ding and TextQL at:  Ethan's LinkedIn: @TheEthanDingEthan's Twitter: @TheEthanDingTextQL's LinkedIn: @TextQL TextQL's Twitter: @TextQL TextQL.comWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Sep 19, 2024 • 42min

Small Data, Big Impact: Insights from MotherDuck's Jacob Matson

What makes MotherDuck and DuckDB a game-changer for data analytics? Join us as we sit down with Jacob Matson, a renowned expert in SQL Server, dbt, and Excel, who recently became a developer advocate at MotherDuck. During this episode, Jacob shares his compelling journey to MotherDuck, driven by his frequent use of DuckDB for solving data challenges. We explore the unique attributes of DuckDB, comparing it to SQLite for analytics, and uncover its architectural benefits, such as utilizing multi-core machines for parallel query execution. Jacob also sheds light on how MotherDuck is pushing the envelope with their innovative concept of multiplayer analytics.Our discussion takes a deep dive into MotherDuck's innovative tenancy model and how it impacts database workloads, highlighting the use of DuckDB format in Wasm for enhanced data visualization. Jacob explains how this approach offers significant compression and faster query performance, making data visualization more interactive. We also touch on the potential and limitations of replacing traditional BI tools with Mosaic, and where MotherDuck stands in the modern data stack landscape, especially for organizations that don't require the scale of BigQuery or Snowflake. Plus, get a sneak peek into the upcoming Small Data Conference in San Francisco on September 23rd, where we'll explore how small data solutions can address significant problems without relying on big data. Don't miss this episode packed with insights on DuckDB and MotherDuck innovations!Small Data SF Signup  Discount Code: MATSON100What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Aug 2, 2024 • 45min

Sovereign AI, Redpanda vs Apache Kafka, The Future of Data Streaming with Alex Gallego (CEO of Redpanda)

In this engaging discussion, Alex Gallego, CEO of Redpanda and former motorcycle and tattoo machine builder, shares his journey from childhood inventions to tech innovations. He dives into the revolutionary shift from batch processing to real-time data streaming with Redpanda, highlighting its cost and performance benefits over Apache Kafka. Alex emphasizes the importance of data sovereignty and the 'Bring Your Own Cloud' approach. He also discusses emerging trends like Sovereign AI, which prioritize data control for businesses and developers, reshaping the future of data infrastructure.
undefined
Jul 12, 2024 • 26min

Secrets Management Simplified: Insights from Doppler's Brian Vallelunga

Imagine losing your most important digital keys and leaving your entire kingdom vulnerable to attacks. In this episode, we promise to equip you with the knowledge to prevent such disasters. Join us as we sit down with Brian Vallelunga, the CEO and founder of Doppler, to unravel the critical importance of secrets management in software development. Brian shares his deep expertise on what secrets are—those crucial digital keys that unlock access to sensitive data—and illustrates through a personal story the severe consequences of failing to protect them. Discover how data breaches can wreak havoc, leading to brand reputation damage, customer churn, legal battles, and even personal distress.But it’s not all doom and gloom. Brian introduces us to Doppler, a game-changing tool that simplifies the tedious process of secrets management, making it an integral part of the modern development workflow. Learn how Doppler empowers developers to secure sensitive data efficiently, eliminating common headaches like managing environment files and manual secret updates. We also delve into practical implementation timelines, showing that effective secrets management is achievable for companies of all sizes with the right tools. Brian provides actionable advice for engineering teams on securing secrets within applications and highlights valuable resources for further learning. Tune in to safeguard your company’s digital assets and fortify your secrets management strategy.Follow Brian on:doppler.comX (Twitter) - @vallelungabrianWhat's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.
undefined
Jun 28, 2024 • 1h 13min

Live from Snowflake Summit: Transforming Data Management Insights with Sanjeev Mohan

What's New in Data's Live Recording from the Salesforce Tower during Snowflake Summit Imagine a world where real-time data processing is the norm, not the exception. In this episode, we bring you a fascinating conversation with Sanjeev Mohan, former VP at Gartner, who unpacks the seismic shifts in the data processing landscape. You'll learn about the convergence of structured and unstructured data, driven by Generative AI, and why streaming is becoming the default method for data processing. Sanjeev highlights the significance of innovations like Iceberg, which create a common table format essential for decision-making across a variety of applications.We then traverse the cutting-edge realm of real-time data streaming platforms, spotlighting technologies and companies such as Materialize and Apache Grid Gain. Sanjeev explains the essential design criteria for these platforms, including scalability, cost performance, and fault tolerance. He also discusses the pivotal role of Kafka and its implementations across major cloud providers. This episode is a treasure trove of insights into how platforms like Snowflake are being utilized beyond their traditional roles to act as streaming databases, redefining the boundaries of data management.In our final segments, we accelerate into the future, examining the rapid advancements in streaming technology and its interplay with AI. Sanjeev reflects on how applications like Tesla and Uber are driving innovation and demonstrates the complexities of handling real-time data replication with tools like Snowpipe Streaming. We also explore the potential for real-time training of Large Language Models (LLMs) and the ever-evolving landscape of data management. Packed with expert analysis and future-forward thinking, this episode is your guide to understanding the groundbreaking technologies shaping the world of data.What's New In Data is a data thought leadership series hosted by John Kutay who leads data and products at Striim. What's New In Data hosts industry practitioners to discuss latest trends, common patterns for real world data patterns, and analytics success stories.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode