Note: if you’ve seen the list elsewhere, it was probably me. I first posted this list on Data Engineering Discord and Data Engineer Cafe.
Books
Data fundamentals (good entrypoint)
- Fundamentals of Data Engineering - Joe Reis & Matt Housley
- Seven Databases in Seven Weeks - Luc Perkins & Eric Redmond & Jim Wilson
- Designing Data-Intensive Applications - Martin Kleppmann
- The Data Warehouse Toolkit - Ralph Kimball & Margy Ross
- Data Science for Business - Foster Provost & Tom Fawcett
- Practical Statistics for Data Scientists - Peter Gedeck & Peter Bruce & Andrew Bruce
Software engineering
- Python Crash Course - Eric Matthes
- The Pragmatic Programmer - Andrew Hunt & David Thomas
Platform
- Terraform: Up & Running - Yevgeniy Brikman
Management
- Team Topologies - Matthew Skelton & Manuel Pais
- Radical Candor - Kim Scott
- Data Teams - Jesse Anderson
- Practical DataOps - Harvinder Atwal
Resources
- https://brendanthompson.com/posts/2021/11/my-terraform-development-workflow
- https://www.terraform-best-practices.com/
- https://github.com/open-guides/og-aws
- https://awesomedataengineering.com/
- https://github.com/opendatadiscovery/awesome-data-catalogs
- https://github.com/datastacktv/data-engineer-roadmap
- https://www.moderndatastack.xyz/stacks
- https://www.secoda.co/glossary
- https://www.gentlydownthe.stream/
- https://b-greve.gitbook.io/beginners-guide-to-clean-data/