This is a quick post about the Chinese Remainder Theorem. Specifically, how to use it to solve a system of system of simple modular equations.

This came up today in an Advent of Code problem, and enough people were asking questions about the theorem to warrant a little blog post, in my opinion.

In brief, the CRT lets us solve a system of equations:

x ⋮ x \equiv a_{1} mod n_{1} \equiv a_{k} mod n_{k}

by efficiently finding the $x$ that satisfies all of these constraints, provided that each of the $n_{i}$ are coprime, i.e. share no common divisors.

The Modulo Operation

The behavior of mod should be pretty well understood, at least for positive numbers. mod x m means dividing x with m, and looking at what’s left, so we have:

mod 220 2 == 0
mod 221 2 == 1
mod 67 16 == 3

etc.

Another way of defining this is to say that $m o d (x, m)$ is the difference between $x$ , and the largest multiple of $m$ less than $x$ , i.e.

mod (x, m) := x - k \cdot m

where $k \cdot m \leq x$ , and $\forall k^{'} \cdot m \leq x . k^{'} \leq k$ .

The behavior of mod for negative numbers varies depending on the programming language. In haskell, for example, mod (-x) m is -(mod x m). But according to the mathematical definition we’ve just given, this wouldn’t be the case.

For example $mod (- 1, 3) = 2$ , because the closest multiple of $3$ , below $- 1$ , would be $- 3$ , and there’s a difference of $2$ in order to reach $- 1$ from there. Similarly, we would have $mod (- 5, 3) = 1$ , because the closest multiple of $3$ here would be $- 6$ .

We can define a function to emulate this behavior, in languages where mod (-x) m = -(mod x m):

fullMod :: Integer -> Integer -> Integer
fullMod x m = mod (mod x m + m) m

If x is positive, then we have mod x m + m, and we take that modulo m. But, adding a multiple of m, doesn’t change the modulus. This is because $mod (a + b, m) = mod (mod (a, m) + mod (b, m), m)$ , and $mod (m, m) = 0$ .

If x is negative, then we end up with -(mod x m) + m, i.e. m - mod x m after the first step. This is the correct behavior, since if we have something like $mod (- 5, 3)$ , we want to end up with $1$ , but instead we get $- 2$ first. By adding this to $3$ , we end up with the $1$ we wanted. We don’t need to do another reduction at this point, so the extra mod _ m changes nothing.

Modular Arithmetic

If we look at all of the integers, modulo some m, we end up with the set of numbers ${0, 1, \dots, m - 1}$ . Since these are still integers, we can carry out addition and multiplication as usual, but remembering to take the modulus with m afterwards. This gives us a ring $Z / m Z$ , or $C_{m}$ , which is quite a bit shorter.

For example, in $C_{3}$ , we have $1 + 2 = 0$ , $2 \cdot 2 = 1$ , and other possibilities.

Note:

To be super pedantic, the elements of $C_{m}$ are not ${0, \dots, m - 1}$ , but rather ${[0], [1], \dots}$ , where $[x]$ denotes the set of integers with a given modulus $x$ . In practice, these work out the same, and we won’t need this level of detail.

For each integer $x \in Z$ , we have a corresponding modulus in $C_{m}$ . Many integers will have the same modulus. To express that two integers have the same modulus, we say:

x \equiv y mod m

Usually, $y$ will be reduced completely, and so we this means $mod (x, m) = y$ . The subtle difference here is that $mod$ outputs an integer in the range ${0, \dots, m - 1}$ , but in the equivalence equation, $x$ and $y$ don’t necessarily have to be in this range.

As further examples, we have:

65 - 5 \equiv 0 \equiv - 1 \equiv 2 mod 3 mod 3 mod 3

One useful property is that if:

x \equiv y mod m

Then:

x - y \equiv 0 mod m

i.e. $x - y$ is a multiple of $m$ .

Bézout’s Algorithm

We just need one more tool before we can tackle the CRT: Bézout’s Algorithm. This algorithm states that if we have two integers $a, b \in Z$ , and their greatest common divisor is $g cd (a, b)$ , then we can find two integers $x, y \in Z$ , such that:

x \cdot a + y \cdot b = g cd (a, b)

This is basically a suped up version of Euclid’s algorithm, finding not only the greatest common divisor, but also the necessary integer factors to get the equation above.

Euclid’s Algorithm

Remember that the $g c d$ of two numbers should return the largest number dividing both of them.

Let’s go over the idea behind Euclid’s algorithm first.

Note that $gcd (a, 0) = a$ . Anything divides $0$ , so we just need a divisor of $a$ . The largest one is just $a$ itself.

Now, let’s say we can decompose $a$ :

a = q b + r

This uses euclidean division to find a quotient and a remainder, with $r < b$ .

We can show that any divisor of $a$ and $b$ is a divisor of $b$ and $r$ , and vice versa.

First, let’s show $d ∣ b, d ∣ r ⟹ d ∣ a$ .

Note that if $d ∣ b$ , then $b = k_{b} d$ for some factor $k_{b}$ .

With this in hand, it’s clear that if $d ∣ b, d ∣ r$ , then we can write $a = q b + r$ as

a = q k_{b} d + k_{r} d = (q k_{b} + k_{r}) d

This means that $d ∣ a$ .

Now for $d ∣ a, d ∣ b ⟹ d ∣ r$ .

We can write $a - q b = r$ , and get

k_{a} d - q k_{b} d = r

and thus $d ∣ r$ .

Since the common divisors of $a$ and $b$ are nothing more than the common divisors of $b$ and $r$ , $g cd (a, b)$ must be $g cd (b, r)$ .

This gives us a recursive algorithm to calculate the $g cd$ :

gcd :: Integer -> Integer -> Integer
gcd a b | a < b = gcd b a
gcd a 0 = a
gcd a b =
  let r = mod a b
  in gcd b r

The first clause is the swapping rule, the second the base case, with $gcd (a, 0)$ , and the third implements the recursive rule we just went over.

Extending the Algorithm

Bézout’s algorithm extends this to find not only the $g cd$ , but also two factors $x, y$ such that $x a + y b = g cd (a, b)$

We have a similar base case:

With $g cd (a, 0) = a$ , it’s obvious that:

1 \cdot a + 0 \cdot 0 = a

Now, for the general case, we decompose $a$ into $q b + r$ , like before.

Let’s say we’ve found $x, y$ such that:

x b + yr = g cd (b, r) = g cd (a, b)

But, we can write $r$ as:

r = a - q b

Which gives us:

g cd (a, b) = x b + yr = x b + y (a - q b) = y a + (x - q y) b

So $(x, y)$ becomes $(y, (x - q))$

We can write out a similar algorithm pretty easily:

bezout :: Integer -> Integer -> (Integer, Integer)
bezout a b | a < b = let (x, y) = bezout b a in (y, x)
bezout a 0 = (1, 0)
bezout a b =
  let q = div a b
      r = mod a b
      (x, y) = bezout b r
  in (y, x - q * y)

The Chinese Remainder Theorem

We now have all the tools we need to tackle the CRT itself!

To set the stage again, our starting point is a system of $k$ equations:

x x ⋮ x \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2} \equiv a_{k} mod n_{k}

and our goal here, is to find some integer $x \in Z$ satisfying each of these equations.

The CRT applies in the case where $n_{1}, n_{2}, \dots, n_{k}$ are all coprime.

This means that:

g cd (n_{i}, n_{j}) = 1

for any pair $i, j$ .

Two Equations

Let’s first look at the case where we have just two equations:

x x \equiv a_{1} mod n_{1} \equiv b_{2} mod n_{2}

At first this might seem tricky, but remember that we assumed that $n_{1}$ and $n_{2}$ are coprime. This means that $g cd (n_{1}, n_{2}) = 1$ . Let’s try using Bézout’s algorithm!

This gives us $m_{1}, m_{2}$ such that:

m_{1} n_{1} + m_{2} n_{2} = 1

Notice if that if we look at this equation modulo $n_{1}$ , we get:

m_{2} n_{2} \equiv 1 mod n_{1}

This is because modulo $n_{1}$ , the extra multiple $m_{1} n_{1}$ simply disappears

But multiplying anything by $1$ gives us back that same number, even modulo $n_{1}$ .

This means that:

a_{1} m_{2} n_{2} \equiv a_{1} mod n_{1}

But hey, that’s one of the things we wanted to be true!

We can apply a similar argument to show that:

a_{2} m_{1} n_{1} \equiv a_{2} mod n_{2}

So, we have a solution for each part, but how can we combine them to get a solution for both parts at once?

We just add them together!

a_{1} m_{2} n_{2} + a_{2} m_{1} n_{1}

This works, because modulo $n_{1}$ , the right part disappears, and we get $a_{1} m_{2} n_{2}$ , which is $a_{1}$ , as we saw before.

Similarly, modulo $n_{2}$ , the left part disappears, and we get $a_{2} m_{1} n_{1}$ , which also works out to $a_{2}$ , as we saw before.

This gives us a solution for the case of two equations.

Generalizing

Let’s say we have $k$ equations, instead of just $2$ :

x x ⋮ x \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2} \equiv a_{N} mod n_{N}

What we’re going to be doing is solving the first two equations, and then using that to produce a new system of $N - 1$ equations.

Let $a_{1, 2}$ be a solution to the first $2$ equations, i.e.

a_{1, 2} a_{1, 2} \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2}

What we can prove is:

x \equiv a_{1, 2} mod n_{1} n_{2}

if and only if

x x \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2}

The first direction is simple. If $x \equiv a_{1, 2} mod n_{1} n_{2}$ , then $x - a_{1, 2}$ is a multiple of $n_{1} n_{2}$ , i.e. it is $k n_{1} n_{2}$ for some $k$ . It’s then clear that $x - a_{1, 2}$ is also a multiple of $n_{1}$ , as well as $n_{2}$ .

This means that:

x x \equiv a_{1, 2} mod n_{1} \equiv a_{1, 2} mod n_{2}

But, remember that $a_{1, 2}$ is a solution to the first two equations, giving us:

x x \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2}

$□$

For the other direction, we assume that:

x x \equiv a_{1} mod n_{1} \equiv a_{2} mod n_{2}

Using the fact that $a_{1, 2}$ is a solution, once more we get:

x x \equiv a_{1, 2} mod n_{1} \equiv a_{1, 2} mod n_{2}

This means that $(x - a_{1, 2})$ is a multiple of both $n_{1}$ and $n_{2}$ ,i.e.

x - a_{1, 2} = k_{1} n_{1} = k_{2} n_{2}

But, since $g cd (n_{1}, n_{2}) = 1$ we can show that $k_{1} = k n_{2}$ :

We can write $k_{1} n_{1}$ as

k_{1} (1 - m_{2} n_{2}) = k_{2} n_{2}

using Bézout, but then we get:

k_{1} = k_{1} m_{2} n_{2} + k_{2} n_{2} = (k_{1} m_{2} n_{2} + k_{2}) n_{2}

showing that $k_{1}$ is indeed a multiple of $n_{2}$ .

Thus we have:

x - a_{1, 2} = k n_{2} n_{1}

but this means that:

x \equiv a_{1, 2} mod n_{1} n_{2}

as desired $□$ .

Since $x \equiv a_{1, 2} mod n_{1} n_{2}$ is equivalent to solving both of the first two equations, we can simply replace both of those equations, to get:

x x ⋮ x \equiv a_{1, 2} mod n_{1} n_{2} \equiv a_{3} mod n_{3} \equiv a_{N} mod n_{N}

We now have one less equation, so this works as a recursive rule.

As a base case, if we end up with a single equation:

x \equiv a mod n

we simply choose $a$ as our solution.

Concretely

Alrighty, that was a bit of a mouthful, but hopefully it’ll be a bit more understandable if we write this out in code.

We have:

solve :: [(Integer, Integer)] -> Integer

as our signature. This function will return the smallest positive integer solving a list of $(a, n)$ congruence pairs.

For zero equations, anything is a solution. So we just return $0$

solve [] = 0

For a single equation, the solution is obvious:

solve [(a, n)] = fullMod a n

Note that we use the fullMod function we defined earlier, so that we get the smallest positive solution. a itself would be a solution, but we normalize it here to get the “nicest” version of it.

Now for two or more equations:

solve ((a1, n1) : (a2, n2) : rest) =
  let (m1, m2) = bezout n1 n2
      a12 = a1 * m2 * n2 + a2 * m1 * n1
  in solve ((a12, n1 * n2) : rest)

So, we solve the first two equations, using Bézout, to get:

a_{1, 2} = a_{1} m_{2} n_{2} + a_{2} m_{1} n_{1}

And then we produce an equivalent equation $x \equiv a_{1, 2} mod n_{1} n_{2}$ , and continue solving, having replaced those two equations with this single one.

Conclusion

Hopefully this was a clear enough explanation. The wikipedia article is actually pretty good for this subject, so I’d recommend that as a good reference.

If you like this kind of thing I’d recommend picking up an Algebra book, like “Contemporary Abstract Algebra”, or “Algebra Chapter 0”.

cronokirby

Explorer

Chinese Remainder Theorem for Programmers

The Modulo Operation

Modular Arithmetic

Bézout’s Algorithm

Euclid’s Algorithm

Extending the Algorithm

The Chinese Remainder Theorem

Two Equations

Generalizing

Concretely

Conclusion

Graph View

Table of Contents