Invest Like the Best with Patrick O'Shaughnessy

Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]

12 snips
Jan 28, 2021
Ali Ghodsi, Founder and CEO of Databricks and expert in big data, dives into the evolution of data infrastructures and its transformative impact on businesses. He shares insights on the creation of Apache Spark, discussing its role in solving data processing challenges. Ghodsi emphasizes the importance of leveraging vast datasets for predictive analytics and the collaboration behind groundbreaking innovations at Berkeley's AMP Labs. He also reflects on the future of AI and data management, particularly in healthcare, underscoring its potential to revolutionize early cancer detection.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Moore's Wall and Big Data

  • Around 2000, "Moore's Wall" was hit, as computer speeds stagnated, leading to a shift towards data centers.
  • This, combined with cheaper storage and increased internet users, fueled the first phase of the "big data" revolution.
INSIGHT

Moving Compute to Data

  • Early big data processing relied on moving computation to data due to network limitations.
  • This was solved by innovations like MapReduce, which processed data locally and then aggregated results.
INSIGHT

Network Virtualization

  • Network technology advancements and techniques like those developed at UCSD virtualized the network.
  • This eliminated the need to move code close to the data, enabling faster processing.
Get the Snipd Podcast app to discover more snips from this episode
Get the app