As you can see, this is a fork of this repository, the code repository for Data Engineering with Python, published by Packt.
One of the first chapters in the book covers how to set up your work environment to follow the rest of the book, code along as you do so and to perhaps try some things for yourself.
Here you can find a docker-compose.yaml to set up the entire environment as described in the book but in Docker containers. This saves you the hassle of doing everything yourself.
I generally liked reading the book and it has certainly helped me, but without mentioning it anywhere it implicitly assumes that you are working on a Debian based Unix system. That's the only setting in which the instructions on setting up the environment work.
Even if I personally work with such an environment, it still took me some time to do everything as described in the book. And indeed I ended up with a working environment (after googling around in some cases, since some information in the book is outdated and this caused some errors).
Since I wanted to get a better understanding of Docker as well after reading the book, I found this to be a rewarding excercise.
tbd
For more content on data engineering and related projects, visit http://www.dominikschauer.com.
