
Episode 549: William Falcon Optimizing Deep Learning Models
Software Engineering Radio - the podcast for professional software developers
00:00
Lightning and Data Versioning
In certain use cases though, like computer vision, the pre-processing steps are to randomize the image. And you can basically cache that, but some people use this a way to augment a data set. So if I have 10 images, for example, and I apply 10 random arguments to each image, suddenly have a data set of 100 images,. which is cool. But in that case, you could keep generating infinite versions of those images, which in which case you may not be able to cache them. Okay. And how about data versioning? What are the challenges operational or otherwise related to data versioning?" "It's really hard and it just really depends on the industry that
Transcript
Play full episode