Note: if you’ve seen the list elsewhere, it was probably me. I first posted this list on Data Engineering Discord and Data Engineer Cafe.

Books

Data fundamentals (good entrypoint)

  • Fundamentals of Data Engineering - Joe Reis & Matt Housley
  • Seven Databases in Seven Weeks - Luc Perkins & Eric Redmond & Jim Wilson
  • Designing Data-Intensive Applications - Martin Kleppmann
  • The Data Warehouse Toolkit - Ralph Kimball & Margy Ross
  • Data Science for Business - Foster Provost & Tom Fawcett
  • Practical Statistics for Data Scientists - Peter Gedeck & Peter Bruce & Andrew Bruce

Software engineering

  • Python Crash Course - Eric Matthes
  • The Pragmatic Programmer - Andrew Hunt & David Thomas

Platform

  • Terraform: Up & Running - Yevgeniy Brikman

Management

  • Team Topologies - Matthew Skelton & Manuel Pais
  • Radical Candor - Kim Scott
  • Data Teams - Jesse Anderson
  • Practical DataOps - Harvinder Atwal

Resources