a16z Podcast

Reining in Complexity: Data Science & Future of AI/ML Businesses

Aug 21, 2020
In this engaging discussion, Martin Casado, a general partner at Andreessen Horowitz and expert in AI/ML economics, dives into the mind-bending complexities of data science. Together with Peter Wang, co-founder of Anaconda, they explore the notion that data is more of a fluid concept, akin to metaphysics. The conversation touches on the intricacies of navigating data management, the evolution from traditional systems to innovative methodologies, and the implications for organizational structures and software business models in the rapidly advancing tech landscape.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Data Landscape Diversity

  • The data landscape has two extremes: data warehouse maximalists (SQL can do everything) and Hadoop refugees (complex computations require Python/R).
  • Peter Wang argues for a heterogeneous approach, acknowledging SQL's role while emphasizing the diversity of data and tools.
ANECDOTE

Shadow Data Management

  • Peter Wang describes "shadow data management," where companies rely on unofficial data copies due to slow databases.
  • At a bank, a million-dollar Oracle database was dumped into a CSV for faster Python/Java analysis, highlighting this issue.
INSIGHT

Information Systems Deconstruction

  • Peter Wang argues against the rigid division of information systems into hardware, software, and data.
  • He points out this separation arose from differing innovation costs, not a fundamental law, impacting business tool choices.
Get the Snipd Podcast app to discover more snips from this episode
Get the app