Data Engineering Podcast cover image

Data Engineering Podcast

Data Sharing Across Business And Platform Boundaries

Feb 11, 2024
Data sharing across business and platform boundaries is complex due to business rules, regulations, and technical considerations. Andrew Jefferson discusses building a robust system for data sharing, the techno-social considerations, and the Bobsled platform that aims to simplify the process. Topics include challenges of data sharing across cloud platforms, boundaries in data transfer systems, innovative applications of data sharing, shift left and shift right mentality, and the lack of AI and vector database solutions.
59:56

Podcast summary created with Snipd AI

Quick takeaways

  • Building a unified data sharing solution across different cloud platforms is complex due to their unique abstractions and limitations.
  • Careful evaluation of the need for data sharing is important, and data clean rooms or native tools can be more appropriate in certain scenarios.

Deep dives

Data Sharing Challenges and Abstractions

The complexity of building an abstraction over different cloud systems is a major challenge in data sharing. Each platform has its own unique abstractions and limitations, making it difficult to create a unified solution. The devil lies in the details of managing these different platforms, and the challenge is intensified by the similarities and differences across clouds. For example, AWS has access points, which is absent in other clouds like Google Cloud Storage. Execution of serverless functions also requires building an abstraction for different clouds. The nuanced differences in storage and access to shared data make it challenging to build a cohesive solution across platforms.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner