One of the things that I encourage people to do is get the data in as raw a format as possible. So maybe it's Excel, maybe it's CSV, but you store it somewhere and do not touch it at all. And then if you do need to manipulate it, you have an intermediate directory structure where you store that manipulative file. Then another directory structure Where you store the output file and start getting in that discipline process of managing those input and output files.