Sliding Window Algorithm Approach

Linux School Tech
8 min readJul 12, 2024

--

The sliding window algorithm is a technique used to solve problems by defining a window or frame on the input data (such as arrays or strings) and then moving this window over the data to perform operations within the window. This technique is commonly used in algorithms such as finding subarrays with a specific sum, finding the longest substring with unique characters, or solving problems that require a fixed-size window to efficiently process elements.

What other types of problems can they use, including the sliding window algorithm?

The sliding window algorithm is a versatile and powerful technique for solving problems that involve examining and processing consecutive segments of data. It is particularly useful in scenarios where specific conditions need to be evaluated over sliding windows of data. This algorithm has found widespread applications in various domains, including:

Array and String Processing

  • Subarray/Substring Identification: The sliding window algorithm can be employed to identify subarrays or substrings that satisfy certain criteria, such as a specific sum or a unique character count.
  • Finding Long or Short Sequences: This algorithm is effective in locating long or short sequences that meet specific conditions, such as the maximum or minimum sum within a given window.

Data Compression

  • Window-based Data Compression: Data compression algorithms like LZ77 and its variants utilize a sliding window to detect recurring patterns in the input data and replace them with references to previous occurrences.

Image Processing

  • Feature Extraction, Object Detection, and Image Segmentation: In image processing, the sliding window technique can be applied for tasks like feature extraction, object detection, and image segmentation.

Signal Processing

  • Time Series Data Analysis: Time series data can be analyzed using the sliding window approach to identify patterns, trends, or local anomalies.

Network Protocols

  • Window-based Protocols in Computer Networks: Window-based protocols are employed for reliable and efficient data transmission in computer networks. The sender and receiver maintain a window of allowed sequence numbers to manage the data flow.

These diverse applications demonstrate the versatility and power of the sliding window algorithm in addressing a wide range of problems that require examining and processing sequential segments of data.

Key Features of the Sliding Window Algorithm

  • Resource Management: The sliding window algorithm is specifically designed for resource management while processing data streams. It involves controlling the amount of data held in memory at any given time to prevent system overload.
  • Data Stream Processing: This algorithm operates on data streams, meaning data arrives sequentially and continuously, eliminating the need to store the entire dataset in memory. This reduces memory consumption and improves system performance.
  • Sliding Window Mechanism: In this model, the function of interest is applied to a fixed-size window of data in the stream. As the data stream progresses, one item is removed from the end of the window and a new item is added. This mechanism facilitates better resource management and enables real-time data processing.

Practical Applications

  • Network Traffic Control: In computer networks, the sliding window algorithm is employed for traffic control and congestion management between the sender and receiver. This is particularly important in data transfer protocols like TCP, which use congestion windows to determine the number of bytes sent at a time.
  • Big Data Processing: In the realm of big data processing, such as analyzing internet-scale data (Big Data), the sliding window algorithm has emerged as an effective method for processing data in real-time without the need to store the entire dataset in memory.

Practical Example

Finding the Maximum Sum of Five Consecutive Numbers

To illustrate the sliding window algorithm with a practical example, consider the problem of finding the maximum sum of five consecutive numbers in a list. This problem can be applied in various programming scenarios, such as data analysis, game development, and even smart device programming.

Algorithm

  1. Check for Insufficient Elements: If the list contains fewer than five elements, return an appropriate message indicating that the maximum sum cannot be determined due to the lack of five consecutive numbers.
  2. Outer Loop for Window Movement: Initiate an outer loop that iterates from the beginning of the list until three elements before the end. This ensures that the window of size five can move across the entire list.
  3. Inner Loop for Sum Calculation: Within the outer loop, implement an inner loop that iterates from the current position of the outer loop to the fourth element ahead. This calculates the sum of the five elements within the current window.
  4. Update Maximum Sum: Compare the current window sum to the previously recorded maximum sum. If the current window sum is greater, update the maximum sum variable with the new value.
  5. Continue Window Movement: The outer loop continues iterating, moving the window one position forward with each iteration. This process continues until the entire list has been traversed.
def max_sum_of_five_elements(lst):
n = len(lst)
if n < 5:
return "List has fewer than 5 elements"

max_sum = float('-inf') # Initialize maximum sum with negative infinity

for i in range(0, n - 4): # Iterate through the list starting from index 0 up to n-5
current_sum = sum(lst[i:i+5]) # Calculate the sum of the current window
max_sum = max(max_sum, current_sum) # Update the maximum sum if necessary

return max_sum

# Example usage
lst = [5, 7, 1, 4, 3, 6, 2, 9, 2]
print(max_sum_of_five_elements(lst)) # Output: 21

max_sum = float(‘-inf’) : This line initializes a variable named max_sum with the value negative infinity (float('-inf')). This value will be used to store the maximum sum found so far.

In Python, float(‘-inf’) represents negative infinity on the floating-point number scale. It’s a special constant value used in calculations involving floating-point numbers.

Figure 1

The function aims to find the maximum sum of five consecutive elements in a list. To identify the maximum, it needs a starting point for comparison. Numbers themselves can’t be the starting point because you need to compare elements within the list.

Negative infinity acts as a baseline value that is guaranteed to be less than any valid sum of elements in the list. By initializing max_sum with float(‘-inf’), you ensure that any sum encountered during the loop will be greater than the starting point.

max_sum = float('-inf')  # Initialize maximum sum with negative infinity

for i in range(0, n - 4):: This line initiates a for loop that iterates through the list lst using a range from 0 to n - 4. This ensures the loop doesn't go out of bounds when accessing elements within the window of size 5.

Inside the loop, the code calculates the sum for each window of five elements using current_sum The max function is then used to compare current_sum with the current value of max_sum always holds the largest sum encountered so far.

for i in range(0, n - 4):  # Iterate through the list starting from index 0 up to n-5
current_sum = sum(lst[i:i+5]) # Calculate the sum of the current window
max_sum = max(max_sum, current_sum) # Update the maximum sum if necessary

a step-by-step illustration of the loop for

Initialization (Outside the Loop)

Figure 2
  • n = len(lst): This line calculates the length of the list lst which is 9.
  • Since n is 9, the loop will iterate from i = 0 to i = n - 4 = 5 (exclusive of 5).
  • max_sum = float('-inf'): This variable is initialized with negative infinity to ensure any sum encountered in the loop will be greater for comparison.

Loop Iteration 1 (i = 0)

  • current_sum = sum(lst[i:i+5]): This calculates the sum of the first window of elements, which is lst[0:0+5] = [5, 7, 1, 4, 3]. The sum is current_sum = 20.
Figure 3
  • max_sum = max(max_sum, current_sum): Since max_sum is negative infinity initially, max will update max_sum to 20.

Loop Iteration 2 (i = 1)

  • current_sum = sum(lst[i:i+5]): This calculates the sum of the second window, which is lst[1:1+5] = [7, 1, 4, 3, 6]. The sum is current_sum = 21.
Figure 4
  • max_sum = max(max_sum, current_sum): Since max_sum is already 20 from the previous iteration, max will keep max_sum as 20 because 21 is greater but not an update.

Loop Iteration 3 (i = 2)

  • current_sum = sum(lst[i:i+5]): This calculates the sum of the third window, which is lst[2:2+5] = [1, 4, 3, 6, 2]. The sum is current_sum = 16.
Figure 5
  • max_sum = max(max_sum, current_sum): Since max_sum is still 20, max will keep max_sum as 20 because 16 is smaller.

Loop Iteration 4 (i = 3)

  • current_sum = sum(lst[i:i+5]): This calculates the sum of the fourth window, which is lst[3:3+5] = [4, 3, 6, 2, 9]. The sum is current_sum = 24.
Figure 6
  • max_sum = max(max_sum, current_sum): Since max_sum is 20, max will update max_sum to 24 because 24 is greater.

Loop Iteration 5 and Beyond

The loop continues iterating for i = 4 (up to i = n-4 = 5 before stopping). However, since the window size is fixed at 5 and the list ends at index 8, these remaining iterations will not be able to create a complete window of size 5. Therefore, they won't affect the max_sum value which is already 24 (the maximum sum found so far).

Figure 7

After the Loop

The loop finishes iterating, and the final value of max_sum is returned, which is 24 (the maximum sum of five consecutive elements in the list).

In this example, the sliding window algorithm is used to find the largest sum of five consecutive numbers in a list. Using this method, we can efficiently solve this problem with O(n) time complexity, which may be faster and more efficient compared to other methods.

Figure 8

Read More

My YouTube Channel

More shell script videos and linux tutorials on my YouTube Channel.

--

--