Learning Journal

Here is your roadmap for mastering modern data engineering and big data stack with a step-by-step curriculum.





1. Learn Python programming basics. I have a quick jumpstart Python programming course available on Udemy. Take up this course to learn just enough Python for PySpark developers. You can finish it in a weekend with a bunch of coding practices and hands-on exercises. I also have a small project included in this course.





www.udemy.com/course/python-foundation-course/?cou…





2. Once you know Python, you can start learning PySpark programming. I have a separate Usemy course covering PySpark programming from beginners to advanced. This course also includes a small project to help you learn a typical project cycle.





www.udemy.com/course/apache-spark-programming-in-p…





3. The next step is to learn advanced Spark concepts, architecture and critical topics asked in the interviews. This course covers most of the Spark internals and complex interview topics.





www.udemy.com/course/apache-spark-3-beyond-basics/…





4. No data engineering project is complete without incremental processing and learning how to do stream processing. So the next obvious topic is learning how to do stream and incremental data processing. Take this Spark streaming course to master that skill.





www.udemy.com/course/spark-streaming-using-python/…





5. Finally, you must understand the lakehouse architecture and how Databricks allows you to design, develop and deliver lakehouse projects in the cloud. This Udemy course





www.udemy.com/course/master-azure-databricks-for-d…





You can finish all this in a month and start appearing for job interviews.





The above list in the given order is a complete package for modern data engineering using Spark and Databricks stack. It is available to you at less than 2000/- cost on the Udemy platform or free of cost if you have an Udemy subscription.





Last but not the least.

We have a high-potential course for you If you have more than 7-8 years of experience and want to answer performance-tuning questions in your interviews.



Many of my students reported that they could manage all the performance-tuning questions asked by Databricks for their Solution architecture positions. This is the best course to handle the most difficult interview questions and scenarios.





www.scholarnest.in/courses/apache-spark-performanc…



Keep learning and Keep growing.

1 year ago | [YT] | 8