What I will remember from Fundamentals of Data Engineering book
Recently I got the chance to read the Fundamentals of Data Engineering book . There are several things that I want to remember and I think the easiest way to do that is to write them down here.
The book centers around, so called big idea of Data Engineering Lifecycle . Below is the schema of the lifecycle that this book presented.
The entire book is focused around the key points from the screenshot.
- Generation
- Ingestion
- Transformation
- Serving
- Storage
All of this is well described in each chapter. Also each chapter always mentioned and the undercurrents for each key stage of the lifecycle.
Let’s explore what I will remember for each point. If you want to know everything with all small details, I’m recommending buying the book or if you use O’reilly books subscription give it a try.
Generation
Generation is about how the data is created. What is provided in the book is that we have analog data creation ( vocal speech, writing on papper , etc. ) and digital data , which can be the conversion of the analog data to digital or it is native data from digital service.