Module 0079: Non-recursive merge sort

In module 0077, we have already explored the non-recursive merge sort in terms of an example to show how it works. Here, we begin with the algorithm, then we will do some analysis on the algorithm.

2.1 Split

1define sub split
2  by reference in : FIFO
3  by value out1 : pointer to FIFO
4  by value out2 : pointer to FIFO
5  local lastValue : integer
6  in.rewind()
7  out1->rewrite()
8  out2->rewrite()
9  lastValue

\leftarrow - \infty

10 while

\neg

(in.isempty()) do
11    if in.head() < lastValue then
12      // cannot continue the run
13      local tmp : pointer to FIFO
14      tmp

\leftarrow

out1
15 out1

\leftarrow

out2
16 out2

\leftarrow

tmp
17 end if
18 lastValue

\leftarrow

in.remove()
19 out1

-

>append(lastValue)
20  end while
21  in.close()
22  out1

-

>close()
23 out2

-

>close()
24end define sub

In listing 1, we rely on a FIFO data structure. The methods for a FIFO are as follows:

The split operation reads from the input FIFO in, then output consecutive runs to the two output FIFOs in an interleaved way. Whenever a run is broken, as detected on line 11, the pointers to the output FIFOs are swapped. This ensures that we only want to output to the FIFO pointed to by out1.

2.2 Merge

1define sub merge
2  by reference out : FIFO
3  by reference in : array [2] of FIFO
4  local lastValue : integer
5  local sorted : boolean
6  local op : integer
7  static lookup : array [boolean][boolean][boolean] of integer
8   initialized to
9   //      lv

\leq

in0 lv

\leq

in1 in0

\leq

in1
10      lookup[true ] [true ] [true ] = 0,
11      lookup[true ] [true ] [false] = 1,
12      lookup[true ] [false] [true ] = 0,
13      lookup[true ] [false] [false] = 0,
14      lookup[false] [true ] [true ] = 1,
15      lookup[false] [true ] [false] = 1,
16      lookup[false] [false] [true ] = 0,
17      lookup[false] [false] [false] = 1
18  sorted

\leftarrow

true
19  out.rewrite()
20  in[0].rewind()
21  in[1].rewind()
22  lastValue

\leftarrow - \infty

23 while

\neg

(in[0].isempty()

\land

in[1].isempty()) do
24 if in[0].isempty() then
25 op

\leftarrow 1

26 else if in[1].isempty() then
27 op

\leftarrow 0

28 else
29 op

\leftarrow

lookup[lastValue

\leq

in[0].head()]
30 [lastValue

\leq

in[1].head()]
31 [in[0].head()

\leq

in[1].head()]
32 end if
33 sorted

\leftarrow

sorted

\land

lastValue

\leq

(in[op].head())
34 lastValue

\leftarrow

in[op].head()
35    out.append(in[op].remove())
36  end while
37  out.close()
38  in[0].close()
39  in[1].close()
40  return sorted
41end define sub

The merge algorithm is a little more complicated than the split algorithm. Much of the logic is captured by the lookup array. This array is three-dimensional, but each dimension is only a Boolean.

The ﬁrst index indicates whether lastValue is less than or equal to the head of the ﬁrst input FIFO. If this is true, it means the head of the ﬁrst input FIFO is a candidate as the next value to append to the output FIFO.

The second index indicates whether lastValue is less than or equal to the head of the second input FIFO. Again, if this is true, the head of the second input FIFO is a candidate to be next value appended to the output FIFO.

The third index indicates whether the head of the ﬁrst input FIFO is less than or equal to that of the second input FIFO. This is useful when the heads of both input FIFOs are candidates, or when both of them are not candidates.

The values of the lookup array simply indicates the index of the input FIFO to use.

The conditions on lines 24 and 26 checks to see if a FIFO is empty. If so, use the other FIFO. Because of the precondition of the loop, we are guaranteed that at least one FIFO is not empty.

The actual operations are speciﬁed on lines 33 to 35. First, we maintain the sorted status of the output FIFO. This can be done by comparing what we are about to append to the last value appended. Next, we remember the value to be appended as the “last value”. Then, we append the value to the output FIFO.

Because we keep track of whether the output FIFO is sorted, we can return the value of the local variable sorted to indicate whether the output FIFO is sorted.

2.3 Mergesort

Once we deﬁne the merge and split operations, then we can deﬁne the merge sort algorithm itself in listing 3.

1define sub mergesort
2  by reference toBeSorted : FIFO
3  local tmpFIFO : array [2] of FIFO
4  repeat
5   split(toBeSorted, address of tmpFIFO[0], address of tmpFIFO[1])
6  until merge(toBeSorted, tmpFIFO)
7end define sub

This algorithm is rather simple. All we have to do is to keep split-merging until the merged result is sorted.

3 Data structure

3.1 What is it?

The FIFO type is an abstraction of data types that have the ﬁrst-in-ﬁrst-out nature. The queue abstract data type ﬁts in this category. However, a sequential ﬁle also does. This is because when a sequential ﬁle is opened for reading, we can only read in one direction. When a sequential ﬁle is opened for writing, we can only write in one direction (same direction as reading).

3.2 Linked list queue

A queue abstract data type is a FIFO. The translation of a queue ADT to a FIFO is straightforward. Some FIFO methods are not applicable to queues.

3.3 File

3.4 Buﬀered read

The head method allows the algorithm to “peek” the ﬁrst item of a FIFO without removing it. Neither the ADT of a queue nor ﬁle operations implement this kind of operation.

As a result, the FIFO implementation needs some additional data members. First of all, as we rewind a FIFO, we need to follow the buﬀered read logic in listing 4 after performing the implementation dependent rewind operation (reopening a ﬁle, for example).

1empty

\leftarrow

implementation.isempty()
2if

\neg

empty then
3 buffer

\leftarrow

implementation.remove()
4end if

The FIFO remove method needs use buﬀered read. However, it needs to remember the value to return ﬁrst, otherwise the buﬀered read operation overwrite the value to be returned. The logic of FIFO remove is listed in listing 5

1tmp

\leftarrow

buffer
2bufferread()
3return tmp

The FIFO isempty is not a wrapper of the isempty method of the implementation. This is because we are performing buﬀered read. The actual implementation may be empty, but the FIFO itself may not be empty yet, because the last item in the actual implementation is actually stored in the data member buffer. In other words, FIFO isempty is always behind the implementation isempty by one read.

4 How long does it take?

4.1 Split

The split subroutine reads every value from the input FIFO, and outputs every value to one of the two output FIFOs. As a result, the amount of of time required by each invocation of “split” is proportional to the number of items to sort.

4.2 Merge

The merge subroutine reads every value from the two input FIFOs, and outputs each value to the output FIFO. As a result, the amount of time required by each invocation of “merge” is proportional to the number of items to sort.

4.3 Mergesort

How many iterations do we need? That’s the real question. Each iteration of the loop in “mergesort” calls split and merge each once. In other words, the execution time of each iteration of the loop in mergesort is proportional to the number of items to sort.

The question boils down to how quickly the algorithm reduce the number of runs in the worst case.

4.3.1 Merging two runs

A run can be considered as a tuple

(a_{1}, a_{2}, a_{3}, \dots a_{n})

. By deﬁnition, elements in a run are sorted. In our example,

\forall i, j \in [1 \dots n] : (i \leq j) \Rightarrow (a_{i} \leq a_{j})

, assuming the tuple is sorted in a non-decreasing way.

Now, let us consider two runs,

A = (a_{1}, \dots a_{n})

and

B = (b_{1}, \dots b_{m})

. We claim that after we merge there two runs, the result is a single run with

n + m

values.

Let us use proof-by-contradiction. We ﬁrst negate the proposition that we want to prove, yielding “

(a_{1}, \dots a_{n})

and

(b_{1}, \dots b_{m})

are two runs, after the merge process, the result is not a single run.”

Let us symbolically represent the merge result as

C = (c_{1}, \dots c_{m + n})

. Let us deﬁne the ﬁrst break point of the run as

\exists i \in [1 \dots m + n - 1] : c_{i} > c_{i + 1} \land \neg (\exists j \in [1 \dots i - 1] : c_{j} > c j + 1)

. Let us assume

c_{i + 1}

corresponds to

b_{q}

(c_{1}, c_{2}, \dots, c_{p}, c_{p + 1}, \dots, c_{i}, c_{i + 1}, \dots, c_{m + n})

in which

c_{i} > c_{i + 1}

and

c_{p} > c_{i + 1}

If one of the

c_{j}, j \in [p \dots i]

comes from run

B

, we have a contradiction because run

B

is sorted.

If all of the

c_{j}, j \in [p \dots i]

come from run

A

, we still have a contradiction. Let

c_{p}

correspond to

a_{r}

. When we choose the value of

c_{p}

, the two candidates must have been

a_{r}

and

b_{q}

. We know that neither

a_{r}

nor

b_{q}

breaks the run as

c_{p}

because

i > p

, and

i

is deﬁned to be the ﬁrst break of the run in

C

According to the logic of the merge algorithm, we do not choose the larger one, which means that

a_{r} \leq b_{q}

. But this contradicts the fact that

a_{r} = c_{p} > c_{i + 1} = b_{q}

Because we can only arrive at contradictions, the original proposition is true. This means that when we merge to runs,

A

and

B

, the result tuple (list)

C

must be a single run.

4.3.2 Number of iterations

The split algorithm ensures that if the original FIFO has

n

runs, then each output FIFO has about

n ∕ 2

runs. When

n

is odd, then one output FIFO has

(n ∕ 2) + 1

runs.

The merge algorithm, with the proof from the previous section, combines the runs from both input FIFOs into

(n ∕ 2) + 1

runs when

n

is odd. If

n

is even, then the number of runs in the result is

n ∕ 2

This means that if we start with 17 runs, there will be at most 9 runs after the ﬁrst split-merge, at most 5 runs after the second iterations, at most 3 runs after the third, at most 2 runs after the fourth, and sorted after the ﬁfth.

We can think about this backwards. If the result of a split-merge has

n

runs, then the previous iteration has

2 (n - 1) + 1

runs in the worst case. As a result,

k

split-merge iterations can sort

2^{k - 1} + 1

runs.

Want to see the proof of this? Let us deﬁne

f (1) = 2

, and

f (x) = 2 (f (x - 1) - 1) + 1

. Here, the function

f (x)

computes the least number of runs given the algorithm completes in

x

iterations. I claim that

\forall x \geq 1 : f (x) = 2^{x - 1} + 1

. We can prove this by induction.

f (1) = 2^{1 - 1} + 1 = 2

is our base case. The induction step states that for all

x \geq 1

, if we assume

f (x) = 2^{x - 1} + 1

, then

In the worst case that each run is a single value,

k

split-merge iterations sort

2^{k - 1} + 1

values. Because the execution time of each iteration is proportional to the number of values, the execution time of merge sort is proportional to

k (2^{k - 1} + 1)

for an input size of

2^{k - 1} + 1

Instead of expressing everything in

k

, which is the number of iterations needed to sort the input, let us use

N = 2^{(} k - 1) + 1

instead. We can, then, express the execution time to be proportional to

k \cdot N

. Note that

{log}_{2} (N) = {log}_{2} (2^{k - 1} + 1 \approx {log}_{2} (2^{k - 1}) = k - 1

. That means

l o g_{2} (N) + 1 \approx k

. As a result, the execution time is proportional to

({log}_{2} (N) + 1) N \approx N {log}_{2} (N)

This means merge sort has a complexity of

O (n log (n))

. A sorting algorithm cannot have a better complexity! Another module discusses the concept of the big-oh notation.