Like Lamport’s Clock, Vector Clock is also a logical clock, which is used to assign timestamps for events in a distributed system…
KeyBy is one of the mostly used transformation operator for data streams. It is used to partition the data…
This is the first of a two-part series about getting started with Luigi. The second…
Windowing is a key feature in stream processing systems such as Apache Flink. Windowing splits the continuous…
Flink has a powerful functional streaming API which let application developer specify high-level functions for data transformations. Applications developers can choose different transformations.
The Clock is an important building block in cloud computing systems and distributed systems. Clock is important to maintain…
These were the top 10 stories published by Big Data Processing; you can also dive into yearly archives: 2020, 2021, 2022, and 2023.