Hey Zach! This is awesome! I just subscribed. I took math classes with you at Weber State, but I majored in physics, went into systems engineering, now technical product management, but I keep finding myself going into data engineering, now here I am, excited to learn! Your roadmaps are great and straightforward.
I’m just starting a portfolio project at a venture studio. Open-source data, 18 CSV sheets with a million rows each, dozen columns, needing to join to another 8.7 million rows and dozen columns, and quickly realized I need to learn how to handle and use that much data, and remembered you haha
With machine learning being adopted across business functions, data quality assessment and improvement will be one of the most important tools in the developers toolkit.
Plus reverse scd2 backfill which I did recently.Something like prod scd2 daily run will be done daily and then rest of the time older files in reverse order will be loaded
I was just recently promoted to a Jr Data Engineer position and will definitely be utilizing the info given here. Amazing work 🙌
Great for new comers and beginners.
I wish substack can add code block support soon.
Yeah the code block is really weak right now!
Hey Zach! This is awesome! I just subscribed. I took math classes with you at Weber State, but I majored in physics, went into systems engineering, now technical product management, but I keep finding myself going into data engineering, now here I am, excited to learn! Your roadmaps are great and straightforward.
I’m just starting a portfolio project at a venture studio. Open-source data, 18 CSV sheets with a million rows each, dozen columns, needing to join to another 8.7 million rows and dozen columns, and quickly realized I need to learn how to handle and use that much data, and remembered you haha
Amazing content. Thank you so much! You haven't mentioned anything about cloud services here. Can you please tell us why?
Because cloud services aren't fundamental to data engineering. If you learn these skills, the cloud services are EXTREMELY easy to add.
amazing work
Material excelente e direcionador para quem está iniciando!
Thanks for the detailed information
Thanks for the write up.
With machine learning being adopted across business functions, data quality assessment and improvement will be one of the most important tools in the developers toolkit.
Great roundup to jump start on DE roles for new beginners!
Plus reverse scd2 backfill which I did recently.Something like prod scd2 daily run will be done daily and then rest of the time older files in reverse order will be loaded