Module 0295: Binary number comparison

Tak Auyeung, Ph.D.

September 14, 2021

1 About this module
2 Comparison, in general
3 Unsigned comparison
4 “Signed” interpretation
4.1 Food for thought

1 About this module

Prerequisites: 0283, 0284
Objectives: This module investigates how binary numbers are compared, based on the “signed” and “unsigned” interpretations.

2 Comparison, in general

Comparison is an operator that returns a value that is true or false. Although we are familiar with 6 comparison operators, the less-than operator alone is suﬃcient to “span” the other operators. The equivalence is listed as follows:

\(x < y \Leftrightarrow x < y\), this one requires no explanation!
\(x > y \Leftrightarrow y < x\), this is just ﬂipping the sides.
\(x \le y \Leftrightarrow !(y < x)\), less obvious, but makes sense.
\(x \ge y \Leftrightarrow !(x < y)\), ﬂipping the order of the previous one.
\(x \ne y \Leftrightarrow (x < y)+(y < x)\), a trickier way to look at not equal to
\(x = y \Leftrightarrow !((x <y)+(y < x))\), the logical negation of the previous one.

However, even though just “less than” is enough to span all the other comparison operators, knowing whether a binary number is representing a value that is less than that of another binary number is not a straight forward matter.

3 Unsigned comparison

Unsigned comparison is a little bit easier. Let us ﬁrst establish some ground rules:

\(0 < 1\) is true
\(1 < 0\) is false

These two are conclusive on their own, even if we are comparing multi-bit numbers. Let us consider the case of comparing \(x\) to \(y\), and we are only interested to know whether the unsigned value represented by \(x\) is less than the unsigned value represented by \(y\).

For notation purposes, let \(x_i\) represent the \(i\)th bit of \(x\). Likewise for \(y\).

We know that \(x_i\) represents the quantity (presence) of \(2^i\) of the value represented by \(x\) as a binary number. We also know that \(2^n > \sum _{i=0}^{n-1}2^i\). Or, do we?

How do we prove \(2^n > \sum _{i=0}^{n-1}2^i\)? Just because it works for all the concrete examples that we can come up with is not suﬃcient. Let’s see if we can make a rigorous proof. Because the power of 2 appear on both sides, this suggests that proof by induction may be a good idea.

The base case is when \(n=1\). When \(n=1\), we can spell out the comparison as \(2^1 = 2 > 1 = 2^0\). Now, we can prove the induction step. The induction assumption is that \(2^k > \sum _{i=0}^{k-1} 2^i\) for some \(k>1\). Our job is to show that \(2^{k+1} > \sum _{i=0}^{k} 2^i\).

This is how the proof goes:

\begin {align*} 2^{k+1} & > \sum _{i=0}^{k} 2^i \\ 2^k + 2^k & > \sum _{i=0}^{k} 2^i \\ 2^k & > \sum _{i=0}^{k-1} 2^i \\ \end {align*}

But this is just what the induction assumption is assuming! We have the proof done now!

Now we want to prove that in an \(m\)-bit subtraction, \(u_m(x)<u_m(y) \Leftrightarrow t_m = 1\), assuming \(t_0 = 0\). In this subtraction, \(x\) is the \(m\)-bit minuend, and \(y\) is the \(m\)-bit subtrahend. \(u_m(n)\) is the \(m\)-bit unsigned value interpreted from the bit pattern n, \(u_m(n)=\sum _{i=0}^{m-1}n_i2^i\).

Let us use proof-by-induction again. The base case is when \(m=1\). Recall in the subtraction module that \(t_{i+1}=b(x_i,y_i)+b(q_i,k_i)\), for binary subtraction \(b(x,y)=!xy\), and \(q_i=x_i \oplus y_i\).

The base is proven by showing that \(t_1=!x_0y_0 + (x_0 \oplus y_0)r_0\). Since \(k_0=0\), \(t_1 = !x_0y_0\). This means that \(t_1=1 \Leftrightarrow (x_0=u_1(x)=0 \wedge y_0=u_1(y)=1) \Leftrightarrow u_1(x)<u_1(y)\).

The induction step assumes the theorem is true for bit widths up to \(m-1\). In other words, we assume that for any \(m-1\) bit binary subtraction, \(t_{m-1}=1 \Leftrightarrow u_{m-1}(x)<u_{m-1}(y)\).

Given this assumption, let us consider \(t_m\) in an \(m\)-bit subtraction. We know that \(t_m = b(x_{m-1},y_{m-1}) + b(q_{m-1},t_{m-1})\). There are two cases to consider.

The ﬁrst case is to establish that \(t_m=1\) implies \(u_m(x)<u_m(y)\) in \(m\) bits. There are two possibilities to consider.
- The ﬁrst possibility is when \(b(x_{m-1},y_{m-1})=1\). This is possible iﬀ \(x_{m-1}=0 \wedge y_{m-1}=1\). But according to the corollary earlier, this implies \(u_m(x)<u_m(y)\).
- The second possibility is when \(b(q_{m-1},t_{m-1})=1\). This is possible only if \(q_{m-1}=0\), which implies \(x_{m-1}=y_{m-1}\). Because \(x_{m-1}=y_{m-1}\), This also implies \(t_{m-1}=1\). Based on the induction assumption, this means \(u_{m-1}(x)<u_{m-1}(y)\) in \(m-1\) bits. This implies that \(u_m(x)<u_m(y)\) in \(m\) bits as well because \(x_{m-1}=y_{m-1}\).
Now we need to also prove that in \(m\)-bit, if \(u_m(x)< u_m(y)\), then \(t_{m}=1\). There are two impossibilities in which \(u_m(x)<u_m(y)\) in \(m\)-bit.
- The ﬁrst case is when \(x_{m-1}=0 \wedge y_{m-1}=1\). This guarantees \(b(x_{m-1},y_{m-1})=!x_{m-1}y_{m-1}=1\), which then implies \(t_m=1\)
- The second case is when \(u_m(x)<u_m(y)\) in \(m\) bits, but \(x_{m-1}=y_{m-1}\), this implies \(u_{m-1}(x)<u_{m-1}(y)\) in \(m-1\) bits. \(x_{m-1}=y_{m-1}\) implies \(q_{m-1}=0\). Using the assumption of induction, this means \(t_{m-1}=1\). If \(t_{m-1}=1\), then \(b(q_{m-1},t_{m-1})\) is guaranteed to be 1.

This concludes the proof!

4 “Signed” interpretation

In a signed interpretation, the value represented by a bit pattern is not exactly the sum of powers of 2 like with an “unsigned” interpretation. This introduces a bit of problem.

First of all, the borrow bit \(t_i\) is no longer of any use in terms of the determination of whether the minuend is less than the subtrahend.

Recall that two-s complement is merely a method to compute the arithmetic negation of the binary representation of a signed value. Given \(m\) bits, the actual arithmetic negation method is \(2^m-x\).

The way values are assigned to an \(m\)-bit binary representation can be summarized as follows:

\begin {align*} v(x) & = (x_{m-1}) ? (-(2^m - \sum _{i=0}^{m-1}x_i2^i)) : (\sum _{i=0}^{m-1}x_i2^i) \\ & = (x_{m-1}) ? ((\sum _{i=0}^{m-1}x_i2^i)-2^m) : (\sum _{i=0}^{m-1}x_i2^i) \\ & = (x_{m-1}) ? ((\sum _{i=0}^{m-2}x_i2^i)-(2^m-x_{m-1}2^{m-1})) : (\sum _{i=0}^{m-2}x_i2^i) \\ & = (x_{m-1}) ? ((\sum _{i=0}^{m-2}x_i2^i)-(x_{m-1}2^{m-1})) : (\sum _{i=0}^{m-2}x_i2^i-x_{m-1}2^{m-1}) \\ & = (\sum _{i=0}^{m-2}x_i2^i)-x_{m-1}2^{m-1} \end {align*}

This is why bit \(m-1\) (the most signiﬁcant bit) is also called the sign bit because its value aﬀects the sign of the value represented by the bit pattern.

Comparison of signed values is relatively straightforward because of the following derivation:

\begin {align*} x & < y \Leftrightarrow \\ x-y & < y - y \Leftrightarrow \\ x-y & < 0 \end {align*}

Deﬁning \(d=x-y\) as the diﬀerence, we should be able to use the sign bit of the diﬀerence, \(d_{m-1}\) to indicate whether \(x\) is less than \(y\). However, the boundary cases are tricky.

There are two ways to cross the boundary. A negative value minus a positive value can become “positive” because the result crosses the \(\pm \) boundary. A positive number minus a negative value can cross the same boundary, but in the other direction. Let us example some 4-bit examples.

\(6-3\) is non-negative, 6 is not less than than 3.
\(-1-(-2)\) is non-negative, -1 is not less than -2.
\(3-4\) is negative, 3 is less than 4.
\(-4-(-1)\) is negative, -4 is less than -1.
\(-4-5\)
\(5-(-4)\)
\(6-(-5)\)

When the result of a subtraction exceeds the range of a signed number, the sign of the diﬀerence makes no sense. But this also means we can no longer rely on the sign of the diﬀerence alone to indicate whether the minuend is smaller than the subtrahend.

The overﬂow ﬂag/condition is deﬁned as the sign of the result being wrong. In the case of 4-bit binary numbers, \(5-(-4)\) results in overﬂow being true. In the subtraction \(x-y=d\), the two ways to overﬂow are

\(x < 0 \wedge y \ge 0 \wedge d \ge 0\)
\(x \ge 0 \wedge y < 0 \wedge d < 0 \)

Of course, we are determining whether the minuend \(x\), subtrahend \(y\) or diﬀerence \(d\) is less than 0 or not purely based on their sign ﬂags.

The overﬂow ﬂag is usually denoted by “O” (upper case o) or “V”. The equation to deﬁne it for subtraction needs to capture the two cases when the value of diﬀerence exceeds what can be represented.

\(O=x_{m-1}!y_{m-1}!d_{m-1}+!x_{m-1}y_{m-1}d_{m-1}\)

Again, \(m\) is the number of bits of each number. Using this notation, we can also design the sign ﬂag/condition that reﬂects the sign of the result: \(S=d_{m-1}\). It is merely the most signiﬁcant bit of the result.

Using the S and O ﬂags, we have enough information to determine whether the minuend is less than the subtrahend when both represent signed values in a subtraction.

Let us consider the four combinations of the S and O ﬂags:

\(S=0 \wedge O=0\): there is no overﬂow condition, so we can trust the S ﬂag indicating the less-than condition. Since \(S=0\), the minuend is not less than the subtrahend.
\(S=0 \wedge O=1\): there is an overﬂow condition, which means the S ﬂag is opposite to what it is supposed to be. Since the observed \(S=0\), the real sign is true. The minuend is less than the subtrahend.
\(S=1 \wedge O=0\): there is no overﬂow condition, the sign ﬂag is set. This means we can trust the sign ﬂag to indicate the minuend is less than the subtrahend.
\(S=1 \wedge O=1\): there is an overﬂow condition, but the sign ﬂag is set. This means we cannot trust the observed sign ﬂag, the actual sign ﬂag is false. The minuend is not less than the subtrahend.

As an exercise, it is helpful to think of examples for all four cases. The analysis does indicate that whenever \(S \ne O\), the minuend is less than the subtrahend. Whenever \(S = O\), the minuend is not less than the subtrahend.

For our convenience, we deﬁne a new ﬂag \(L=S!O + !SO\) so that we can use this one ﬂag to indicate in a subtraction of signed values whether the minuend is less than the subtrahend.

4.1 Food for thought

Now that we can see how the subtraction of two signed binary numbers can overﬂow the result, what about addition? Can the addition of two signed binary numbers overﬂow the sum? If so, how should the overﬂow ﬂag O be deﬁned?