Seattle Data Guy

Data engineering isn't easy. It's not just about writing SQL queries and calling it a day.

There is...

- Constantly changing data infrastructure, like Hadoop, Flink, Snowflake and Databricks
- Data pipelines often break not because of there design but because the source systems changes without telling you
- Data governance and compliance - ensuring that data handling meets stringent legal and security standards.
- A crazy amount of orchestration and ETL tools exist
- Mastering query optimization, indexing strategies, and data partitioning to efficiently handle large volumes of data.
- Understanding multiple disciplines and technologies including networking, system architecture, SFTP, etc
- Trying to implement business logic to fix issues from the application

This barely scratches the surface.

I'm not saying this to complain by the way. I love data engineering.

If you'd like to learn more about data engineering and infra you can check out my newsletter here - seattledataguy.substack.com/

1 year ago | [YT] | 89