Data Engineering Podcast cover image

Supporting And Expanding The Arrow Ecosystem For Fast And Efficient Data Processing At Voltron Data

Data Engineering Podcast

00:00

The Impact of Spark on Aero on Engineering Productivity and Computer Efficiency

Aero project aims to make Python on Spark a lot faster. One of the initial motivations for aero was to cut down on some of the inefficiencies of that data interchange. By defining a column oriented data format, which could be constructed on the JVM side inside the Spark runtime and then sent over to the Python side,. we were able to make custom code running in PySpark 10 to 100 times faster in some cases.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Game Changer
Gpeeps78
App Store
I cannot recommend this app enough. It belongs in my top three AI apps. It’s that good!
The game changer for learning from podcasts!
Nelson
App Store
I used to use a different app that was able to save excerpts from podcast and really enjoyed it. I could listen to the podcast and quickly save things that I wanted to come back to later. Snipd take this to a whole new level with AI integration, creating summaries of podcasts and summarizing the main takeaways from what I’ve saved and snipped. I really love how it helps me prioritize what podcast to listen to with it summaries & deep dives.