Efficient algorithm to count contiguous subarrays that can form arithmetic progressions

Question

I'm working on a problem where I need to count, for each possible common difference D, the number of contiguous subarrays whose elements can be rearranged to form an arithmetic progression with common difference D.

Problem Description

Given an array arr[1..n] which is a permutation of 1..n, for each D from 1 to n, count the number of contiguous subarrays where the set of elements can be rearranged into an arithmetic progression with common difference D.

Example:

text

arr = [5, 1, 3, 2, 4]  (permutation of 1..5)

Expected results:

D = 1: 5 subarrays ([3,2], [1,3,2], [3,2,4], [1,3,2,4], [5, 1, 3, 2, 4])
D = 2: 3 subarrays ([5,1,3], [1,3], [2,4])
D = 3: 0 subarrays
D = 4: 1 subarray ([5,1])
D = 5: 0 subarrays

I've tried an O(n²) solution that checks all contiguous subarrays, but it's too slow for large n .

You could analyse the prime factor decomposition of differences between subsequent array elements. If the difference between two elements is 12, for example, then they MUST be part of some subsequence with stride 12 and they MAY be part of subsequences with strides 2, 3, 4, and 6. Basically an indexing exercise. — DarthGizka
– DarthGizka, Commented Nov 21 at 7:01
n ≤ 10⁵, and the time limit is 2 seconds. The array is a permutation of 1..n, so all values are distinct and consecutive integers. — YAR
– YAR, Commented Nov 21 at 13:42
The expected result for D = 2 seems to be missing the subsequence [1, 3]. — DarthGizka
– DarthGizka, Commented Nov 22 at 18:36

DarthGizka · Accepted Answer · 2025-11-29 09:41:51Z

TL;DR an efficient algorithm based on factor decomposition of differences that relies on an O(n²) component for counting subsequences, just barely avoiding the timeout for worst-case input (full PoC available as C# fiddle)

Each pair of consecutive values in the sequence has a certain absolute difference d (non-zero because all values are distinct). This means that the pair MUST be part of a contiguous subsequence with stride d and it MAY be part of contiguous subsequences whose strides are equal to the other factors of d. It also means that candidate subsequences with other strides cannot continue across the current pair and must necessarily terminate at the first value of the pair.

Ergo you can sweep across the input sequence looking at each pair of consecutive values in order to collect candidate subsequences as follows:

for each factor f of the absolute difference d between a_i and a_i+i:
- if there is no open candidate sequence with stride f, create one with the tuple (a_i, i)
- append the tuple (a_i+1, i+1) to the open subsequence with stride f
for each open candidate sequence whose stride is not a factor of d:
- count the number of contiguous subsequences that form progressions
- throw the candidate sequence away

Here is the example sequence with the difference factors indicated below the gaps between the values, in separate rows per factor. This shows how the candidate sequences come into being:

The tricky bit is the counting of the contiguous subsequences contained in a collected candidate sequence. This sounds a lot like the overall problem description again, but at this point we are in a position to mine the raw number ore efficiently.

Here is the first of the two candidate sequences for factor/stride 2. It contains tuples (value, index), but values and indices are shown in separate rows for clarity:

5 1 3
0 1 2

If the tuples of the candidate sequence are not sorted by value already (i.e. if they are not collected with the aid of a structure that orders them by value), sort them.

1 3 5
1 2 0

In this form it becomes possible to isolate unbroken runs of values and then mine those for compliant subsequences. The example sequence does not generate candidate sequences with separate runs but some other sequence might generate a stride 2 candidate sequence like this, with a gap between 3 and 7:

1 3 7 9
1 2 0 3

Candidate sequence processing

Remember your current position as base position. Step through the tuples, tracking index minima and maxima. If the value chain is unbroken and the index extrema indicate a compact range of the same size then you have found a compliant subsequence; add 1 to your count and step forward. If the value chain is broken, remember the current position for later (as the potential start of a new unbroken subsequence) and repeat the process starting at base position + 1. When you are done with this unbroken run of values, continue with the next one whose beginning you stored earlier.

This caterpillar movement is square in nature, and a test implementation clocked in at 1.6 seconds for the worst-case input (ordered sequence 1 .. 50000 with N * (N - 1) / 2 = 1,249,975,000 subsequences for stride 1).

That is roughly 1 nanosecond per subsequence, meaning there is little speed-up potential left in this approach. Or any other approach that is based on finding/counting all compliant subsequences individually, for that matter, because this necessarily leads to the big bad O(n²).

However, all is not lost. Perhaps we have reduced the original, complex problem to a smaller one. The crucial - and so far O(n²) - operation is this:

given a sequence of integers, count all the pairings where the difference in position plus 1 is equal to the difference in between minimal and maximal value between them (indicating that they represent a contiguous subrange)

Note that 'count' here means 'perform some sub-square algorithm that effectively computes the count without counting each pairing individually'.

As illustration, here's the candidate sequence for stride 1 that is generated by the example sequence, sorted by value and separated into unbroken value runs (of which stride 1 only ever has exactly one that covers the whole input).

1 2 3 4 5 <- value components of the tuples
1 3 2 4 0 <- index components of the tuples, input for above subproblem

I do not have a solution at the moment, but this reduced problem may be easier to solve than the original one.

I have looked at the displacement of inversions as a promising avenue of research, but I quickly shelved that (consider 4 3 2 1 0, which loses no subsequences at all). Still in the running is looking at the absolute values of gaps in the index sequence (other than ±1). Each of those gaps causes a certain number of sequences to be lost and the reach of its effect grows with its size.

Last but not least, it may be possible to curb the worst quadratic excesses by finding unbroken runs - regardless of whether descending or ascending - and treating them wholesale, like a single hop that contributes run_length * (run_length - 1) / 2 counts instead of one. This would make the worst-case input of the original algorithm the easiest input of all,, and so this little trick may just be enough to beat the time-out. ;-) Caveat: an index sequence like 0 3 1 4 2 5 ... does not contain any runs and so the trick cannot work in this case.

גלעד ברקן · Accepted Answer · 2025-12-01 19:46:21Z

Please see Python code below. The algorithm is mine. I got help coding it from Gemini.

from collections import defaultdict
from bisect import bisect_left
from typing import List, Dict, Set, Tuple, DefaultDict
import math
import random


def get_perm(n: int) -> list[int]:
    s = list(range(1, n + 1))
    random.shuffle(s)
    return s


def brute_force(A: list[int]) -> int:
    result = 0
    n = len(A)
    for i in range(n):
        for j in range(i, n):
            subarray = A[i:j+1]
            
            if len(subarray) < 2:
                continue

            sorted_subarray = sorted(subarray)

            diff = sorted_subarray[1] - sorted_subarray[0]
            is_arithmetic = True
            
            for k in range(2, len(sorted_subarray)):
                if sorted_subarray[k] - sorted_subarray[k-1] != diff:
                    is_arithmetic = False
                    break

            if is_arithmetic:
                result += 1

    return result


def get_all_divisors_up_to_n(n: int) -> List[List[int]]:
    """
    Pre-calculates all divisors for every number from 1 up to n using a sieve-like method.
    Returns a list where index i contains a list of all divisors of i.
    """
    divisors = [[] for _ in range(n + 1)]
    for d in range(1, n + 1):
        for multiple in range(d, n + 1, d):
            divisors[multiple].append(d)
    return divisors


def map_values_to_indices(A: List[int]) -> Dict[int, int]:
    """
    Creates a map from the value of an element in array A to its index.
    
    Args:
        A: The input array of integers.

    Returns:
        A dictionary where keys are the values from A and values are their indices.
    """
    return {val: i for i, val in enumerate(A)}


def right_sweep(A: List[int], D: int, l: int, r: int, val_to_idx: Dict[int, int]) -> DefaultDict[int, List[int]]:
    """
    Performs a sweep from the middle index M to the right boundary r to find 
    the rightmost R that, when paired with the middle index M, forms an AP.

    Args:
        A: The input array.
        D: The arithmetic difference (stride).
        l: The left boundary of the current D&C segment.
        r: The right boundary of the current D&C segment.
        val_to_idx: Map of array values to their indices.

    Returns:
        A map where key L is the start index of a crossing AP (L=M), and 
        the value is a list of valid right boundary indices R.
    """
    M = (l + r) // 2
    res: DefaultDict[int, List[int]] = defaultdict(list)
    
    L = M
    R = M
    min_val = A[M]
    max_val = A[M]
    missing: Set[int] = set()
    
    curr = M + 1
    while curr <= r:
        if curr > R:
            # Step 1: Expand R by one index
            R = curr
            val = A[R]
            
            # Check difference congruence for adjacent elements
            if abs(val - A[R-1]) % D != 0:
                return res
            
            # Step 2: Update min/max and the missing set based on the new element
            if val < min_val:
                for v in range(val + D, min_val, D):
                    missing.add(v)
                min_val = val
            elif val > max_val:
                for v in range(max_val + D, val, D):
                    missing.add(v)
                max_val = val
            else:
                # Check if the new element fits the AP structure
                if (val - min_val) % D != 0:
                    return res

            if val in missing:
                missing.remove(val)
                
            # Step 3: Use the missing set to expand the AP range [L, R]
            while missing:
                target = next(iter(missing))
                if target not in val_to_idx:
                    return res
                
                idx = val_to_idx[target]
                
                # Element must be within the current D&C segment
                if idx > r or idx < l:
                    return res
                
                # Expand R to include the missing element at idx > R
                if idx > R:
                    # Fill the gap by checking all intermediate elements
                    for k in range(R + 1, idx + 1):
                        v_k = A[k]
                        if abs(v_k - A[k-1]) % D != 0:
                            return res
                        
                        # Update bounds and check for new missing elements
                        if v_k < min_val:
                            for v in range(v_k + D, min_val, D):
                                missing.add(v)
                            min_val = v_k
                        elif v_k > max_val:
                            for v in range(max_val + D, v_k, D):
                                missing.add(v)
                            max_val = v_k
                        else:
                            if (v_k - min_val) % D != 0:
                                return res

                        if v_k in missing:
                            missing.remove(v_k)
                    R = idx
                # Expand L to include the missing element at idx < L
                elif idx < L:
                    # Fill the gap by checking all intermediate elements
                    for k in range(L - 1, idx - 1, -1):
                        v_k = A[k]
                        if abs(v_k - A[k+1]) % D != 0:
                            return res
                            
                        # Update bounds and check for new missing elements
                        if v_k < min_val:
                            for v in range(v_k + D, min_val, D):
                                missing.add(v)
                            min_val = v_k
                        elif v_k > max_val:
                            for v in range(max_val + D, v_k, D):
                                missing.add(v)
                            max_val = v_k
                        else:
                            if (v_k - min_val) % D != 0:
                                return res

                        if v_k in missing:
                            missing.remove(v_k)
                    L = idx
                else:
                    # This should not happen if logic is correct, but included for safety
                    return res

        # Step 4: Record the result if the AP is complete and unique
        if not res[L] or res[L][-1] != R:
            if not missing:
                res[L].append(R)
        
        curr += 1
        
    return res


def left_sweep(A: List[int], D: int, l: int, r: int, val_to_idx: Dict[int, int], rs_map: DefaultDict[int, List[int]]) -> int:
    """
    Performs a sweep from the middle index M to the left boundary l.
    It counts:
    1. Newly found cross-boundary APs [L, R].
    2. Extensions of those APs using rs_map (APs [L', R'] where L' < L and R' > R).
    
    Args:
        A: The input array.
        D: The arithmetic difference (stride).
        l: The left boundary of the current D&C segment.
        r: The right boundary of the current D&C segment.
        val_to_idx: Map of array values to their indices.
        rs_map: The result of right_sweep used for extensions.

    Returns:
        The total count of cross-boundary APs in the segment [l, r].
    """
    M = (l + r) // 2
    total_count = 0
    valid_L_keys: Set[int] = set()
    
    if M + 1 > r:
        return 0
        
    L = M + 1 # Left boundary of the current sweep range [L, R]
    R = M + 1 # Right boundary of the current sweep range [L, R]
    min_val = float('inf')
    max_val = float('-inf')
    missing: Set[int] = set()

    curr = M # Index currently being added to the left side
    while curr >= l:
        L_before = L # Record L before expansion
        
        if curr < L:
            L = curr
            val = A[L]
            
            # Initialization case: [L, R] is [M, M+1]
            if L + 1 == R:
                val_R = A[R]
                
                if abs(val - val_R) % D != 0:
                    break
                    
                min_val = min(val, val_R)
                max_val = max(val, val_R)

                for v in range(min_val + D, max_val, D):
                    missing.add(v)
            
            # Expansion case: L is moving further left
            else:
                if abs(val - A[L+1]) % D != 0:
                    break

                if val < min_val:
                    for v in range(val + D, min_val, D):
                        missing.add(v)
                    min_val = val
                elif val > max_val:
                    for v in range(max_val + D, val, D):
                        missing.add(v)
                    max_val = val
                else:
                    if (val - min_val) % D != 0:
                        break

                if val in missing:
                    missing.remove(val)
                
            # Expand AP range [L, R] using the missing set (similar to right_sweep)
            while missing:
                target = next(iter(missing))
                if target not in val_to_idx:
                    break
                
                idx = val_to_idx[target]
                
                if idx > r or idx < l:
                    break
                
                # Expand L to include the missing element at idx < L
                if idx < L:
                    for k in range(L - 1, idx - 1, -1):
                        v_k = A[k]
                        if abs(v_k - A[k+1]) % D != 0:
                            break
                        
                        if v_k < min_val:
                            for v in range(v_k + D, min_val, D):
                                missing.add(v)
                            min_val = v_k
                        elif v_k > max_val:
                            for v in range(max_val + D, v_k, D):
                                missing.add(v)
                            max_val = v_k
                        else:
                            if (v_k - min_val) % D != 0:
                                break

                        if v_k in missing:
                            missing.remove(v_k)
                    L = idx
                # Expand R to include the missing element at idx > R
                elif idx > R:
                    for k in range(R + 1, idx + 1):
                        v_k = A[k]
                        if abs(v_k - A[k-1]) % D != 0:
                            break
                            
                        if v_k < min_val:
                            for v in range(v_k + D, min_val, D):
                                missing.add(v)
                            min_val = v_k
                        elif v_k > max_val:
                            for v in range(max_val + D, v_k, D):
                                missing.add(v)
                            max_val = v_k
                        else:
                            if (v_k - min_val) % D != 0:
                                break

                        if v_k in missing:
                            missing.remove(v_k)
                    R = idx
                else:
                    break

        # Record L if it's a valid starting point for an AP found by right_sweep
        if L in rs_map:
            valid_L_keys.add(L)

        # Counting step: only happens if the range [L, R] is a complete AP
        if not missing:
            # Count 1: The AP [L, R] itself is a valid cross-boundary subarray
            total_count += 1
            
            # Count 2: Extensions from all valid L keys found so far
            for L_key in valid_L_keys:
                if L_key in rs_map:
                    R_list = rs_map[L_key]
                    
                    # Use binary search to find the count of R' in R_list such that R' > R
                    start_index = bisect_left(R_list, R + 1)
                    count_greater_R = len(R_list) - start_index
                    
                    total_count += count_greater_R
        
        # Move curr to the next potential starting point L:
        # If L was successfully expanded (L < L_before), jump to L-1 to skip already checked indices.
        # Otherwise, move one step left (curr -= 1).
        if L < L_before:
            curr = L - 1
        else:
            curr -= 1
        
    return total_count


def count_arithmetic_progressions_iterative(A: List[int], D: int, val_to_idx: Dict[int, int], l_start: int, r_end: int) -> int:
    """
    Implements the core divide-and-conquer logic iteratively using a stack.

    Args:
        A: The input array.
        D: The arithmetic difference (stride).
        val_to_idx: Map of array values to their indices.
        l_start: The starting index of the segment.
        r_end: The ending index of the segment.

    Returns:
        The total count of APs within the segment [l_start, r_end].
    """
    if l_start >= r_end:
        return 0
        
    total_count = 0
    stack: List[Tuple[int, int]] = [(l_start, r_end)]
    
    while stack:
        l, r = stack.pop()
        
        if l >= r:
            continue

        M = (l + r) // 2

        # 1. Calculate the map of valid right extensions for APs starting at M (right half)
        rs_map = right_sweep(A, D, l, r, val_to_idx)
        
        # 2. Calculate the count of APs that cross the boundary M/M+1 (combination step)
        cross_count = left_sweep(A, D, l, r, val_to_idx, rs_map)
        
        total_count += cross_count
        
        # 3. Add subproblems to the stack
        stack.append((l, M))
        stack.append((M + 1, r))
        
    return total_count


def f(A: List[int]) -> int:
    n = len(A)

    # Pre-calculate value-to-index map once
    val_to_idx = map_values_to_indices(A)
 
    total_count = 0

    divisor_table = get_all_divisors_up_to_n(n)

    ranges: DefaultDict[int, int] = defaultdict(int)

    for i in range(1, n):
        diff = abs(A[i] - A[i-1])
        current_divisors = divisor_table[diff]

        # Close any ranges whose stride no longer divides the current diff
        for stride in list(ranges.keys()):
            if stride not in current_divisors:
                start = ranges[stride]
                total_count += count_arithmetic_progressions_iterative(A, stride, val_to_idx, start, i - 1)
                del ranges[stride]

        # Open new ranges for each divisor D
        for D in current_divisors:
            if not D in ranges:
                ranges[D] = i - 1

    # Close all still-open ranges
    for D in list(ranges.keys()):
        start = ranges[D]
        total_count += count_arithmetic_progressions_iterative(A, D, val_to_idx, start, n - 1)

    return total_count


num_tests = 100
n = 10

for _ in range(num_tests):
  A = get_perm(n)
  brute = brute_force(A)
  ff = f(A)
  if brute != ff:
    print(brute, ff, A)
    break

print("Done.")

This looks really sophisticated, but trying to understand it makes my head swim. ;-) What are the timings for random and worst-case inputs of length 5, 50, 500, 5000 and 50000?

Collectives™ on Stack Overflow

Efficient algorithm to count contiguous subarrays that can form arithmetic progressions

2 Answers 2

Candidate sequence processing

1 Comment

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Candidate sequence processing

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related