The Data Exchange with Ben Lorica cover image

The Data Exchange with Ben Lorica

Latest episodes

undefined
May 11, 2023 • 36min

Boosting Perception With Synthetic Data

Omar Maher is Director of Product Marketing at Parallel Domain, a startup that is advancing machine perception capabilities by harnessing the power of synthetic data. We delve into the growing adoption of synthetic data and the factors driving its use. We discuss major developments in synthetic data generation and its overlap with Generative AI. The conversation also covers data privacy, intellectual property, the generation of structured data like LiDAR, the current state of adoption, and key research directions to overcome existing challenges.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
20 snips
May 4, 2023 • 43min

Revolutionizing B2B: Unleashing the Power of AI and Data

Simon Chan is the General Partner at Firsthand Alliance, a venture capital fund focused on the future of B2B and enterprise software. We explore the evolution of AI, cloud computing, and business collaboration tools, revealing how a new generation of generative AI technologies is enabling applications to generate content and drive transformative innovation across various industries.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Apr 27, 2023 • 32min

AI Metadata

Gev Sogomonian is co-author of AimStack, an open-source, self-hosted AI metadata tracker that logs all your AI metadata, such as experiments and prompts, and provides a user-friendly UI for comparing and observing them. It also offers an SDK for programmatically querying tracked metadata.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Apr 20, 2023 • 44min

The 2023 AI Index

Raymond Perrault is a Distinguished Computer Scientist at SRI International, and Co-Director of the Steering Committee for the AI Index, an annual report that tracks, collates, distills, and visualizes data relating to AI, to help inform decision-makers and teams to take meaningful action for responsible and ethical AI. Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Apr 13, 2023 • 38min

Custom Foundation Models

Hagay Lupesko, is VP Engineering at MosaicML, a startup that enables teams to easily train large AI models on their data and in their own secure environment. We discuss the the evolution of cloud based machine learning (from “traditional” ML through LLMs), his experience building machine learning applications at leading technology companies, and the need for companies to build their own custom foundation models.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Apr 6, 2023 • 49min

Uncovering and Highlighting AI Trends

Jakub Zavrel is the Founder and CEO at Zeta Alpha, a premier Neural Discovery Platform that utilizes cutting-edge Neural Search technology to enhance the way you and your team uncover, arrange, and disseminate knowledge. Our conversation focuses on the latest developments in artificial intelligence, taking inspiration from their recent viral article featuring the top the 100 most cited AI papers of 2022.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Mar 30, 2023 • 49min

How Data and AI Happened

Chris Wiggins is a Professor at Columbia University and the Chief Data Scientist at the NYTimes.  He is also co-author of a fascinating new historical exploration of how data has been used as a tool in shaping society, from the census to eugenics to Google search. How Data Happened traces the trajectory of data and explores new mathematical and computational techniques that serve to shape people, ideas, society, and economies.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Mar 23, 2023 • 31min

Blazing fast bulk data transfers between any cloud

Paras Jain and Sarah Wooders are graduate students at UC Berkeley’s Sky Computing Lab. They are part of the team behind Skyplane, and open source project that accelerates wide-area transfers in the cloud via overlay routing and parallelism. Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Mar 16, 2023 • 33min

Exhaustion of High-Quality Data Could Slow Down AI Progress in Coming Decades

Pablo Villalobos is a Staff Researcher at  Epoch, and lead author of the recent paper “Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning”.  We discuss the key findings in this paper, as well as a related study Pablo conducted on scaling laws. Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.
undefined
Mar 9, 2023 • 36min

Generating high-fidelity and privacy-preserving synthetic data

Jinsung Yoon (Senior Research Scientist) and Sercan Arik (Staff Research Scientist and Manager) are part of the Google team behind EHR-Safe,  a set of tools for generating highly realistic and privacy-preserving synthetic Electronic Health Records.Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode