PinnedSruthi Sree KumarinBig Data ProcessingApache KafkaApache Kafka is an open-source, distributed, event streaming platform. Apache Kafka allows the decoupling of data streams and processing…Jul 30, 2021Jul 30, 2021
Sruthi Sree KumarinBig Data ProcessingIntroduction to Big DataBig Data is the data that is characterized by four key attributes which are also known as the 4V’s.Sep 26, 2022Sep 26, 2022
Sruthi Sree KumarinBig Data ProcessingObservability in Distributed Systems: Logs, Metrics, and TracesObservability is the ability to measure the internal states of a system by examining its outputs. Logs, Metrics, and Traces are considered…Sep 15, 2022Sep 15, 2022
Sruthi Sree KumarManaged Key State in FlinkThe state is an important concept in Apache Flink. Flink supports both stateful and stateless computation. Two basic types of states in…Nov 21, 2021Nov 21, 2021
Sruthi Sree KumarinBig Data ProcessingCAP TheoremCAP (Consistency, Availability, Partition Tolerance) theorem is one of the fundamental theorems in distributed systems. CAP theorem, also…Aug 3, 2021Aug 3, 2021
Sruthi Sree KumarinBig Data ProcessingTime Attributes in Apache FlinkOne of the major difference between stream and batch processing is the need to explicitly handle time in stream processing. In a stream…Jul 9, 2020Jul 9, 2020
Sruthi Sree KumarinBig Data ProcessingWindowing in Apache FlinkWindowing is a key feature in stream processing systems such as Apache Flink. Windowing splits the continuous stream into finite batches…Jul 8, 2020Jul 8, 2020
Sruthi Sree KumarinBig Data ProcessingVector ClocksLike Lamport’s Clock, Vector Clock is also a logical clock, which is used to assign timestamps for events in a distributed system. Vector…May 17, 20205May 17, 20205
Sruthi Sree KumarinBig Data ProcessingGlobal Snapshot, Chandy Lamport Algorithm & Consistent CutA Global Snapshot or a global state consists of local states of each process in the distributed system along with the in-transit messages…May 17, 2020May 17, 2020
Sruthi Sree KumarinBig Data ProcessingLamport TimestampsThe Clock is an important building block in cloud computing systems and distributed systems. Clock is important to maintain the order of…May 13, 2020May 13, 2020