Main reference: http://chrisballance.com/wp-content/uploads/2015/10/CRC-Primer.html
First, let us talk about the need for CRC
There are many ways to make sure of the integrity of data that is transferred to us from a sender over a medium, e.e. a network.
We could hash, and send the hash(e.g. SHA1, SHA256, Blake, etc.) along with it; we could send the parity along with it(You count the number of 1s in the data, and check
if it is even or odd. odd parity goes as one, while even parity goes as 0); we could also use CRC. We use CRC sometimes because it gives us better promises (no guarantees)
than simple approaches like Parity, and isn't as complex as hashing.
We could count the number of bytes in the message (or data) and perform the addition modulo 256 to make it all fit into a byte. The issue is that there are 256 possible
checksums. That limitation means some errors will go unnoticed so long as the errors still compute to an unaltered checksum. This will happen on average one out of
256 times as probability tells us. We could do something to mitigate this effect; we could increase the size of the checksum so that it contains 2 bytes, i.e.
2^ 16 (i.e. 65536) possible values instead of 2^8. The problem with this is that it does not help us much. Each coming byte being appended in computing the checksum will likely
affect just one byte(emphasis on likely). Addition is so simple an operation (or function) that it doesn't spread its effects quite randomly enough. When I add two bytes, there's a high chance that
the result will remain in the first byte, and if it doesn't it'll be in the second. It's not randomized enough.
You know what's cool? The division is quite random; far more random than addition. The division will touch every single byte accumulated as you compute the checksum. The division allows each additional byte to
have an effect on the entire checksum register. Width and Chaos are therefore essential for any good checksum.
So what is CRC? It means Cyclic Redundancy Check
The basic idea of CRC algorithms is simply to treat the message as an enormous binary number, to divide it by another fixed binary number, and to make the remainder from this division the checksum. Upon receipt of the message, the receiver can perform the same division and compare the remainder with the "checksum" (transmitted remainder). ~ Chris Ballance
Remember the way division works? Check out 1559/9.:
1559 = 0000011000010111 base 2
9 = 1001 base 2
0000000010101101 = 173 = quotient
__________________
1001 | 0000011000010111 = 0617 = 1559 = DIVIDEND
DIVISOR 0000.,,....,.,,,
----.,,....,.,,,
0000,,....,.,,,
0000,,....,.,,,
----,,....,.,,,
0001,....,.,,,
0000,....,.,,,
----,....,.,,,
0011....,.,,,
0000....,.,,,
----....,.,,,
0110...,.,,,
0000...,.,,,
----...,.,,,
1100..,.,,,
1001..,.,,,
====..,.,,,
0110.,.,,,
0000.,.,,,
----.,.,,,
1100,.,,,
1001,.,,,
====,.,,,
0111.,,,
0000.,,,
----.,,,
1110,,,
1001,,,
====,,,
1011,,
1001,,
====,,
0101,
0000,
----
1011
1001
====
0010 = 02 = 2 = REMAINDER
Consider how the division above works. As the divisor consumes each bit, it changes the remainder which is passed down and down and shifted to the right a lot of times. Add an extra byte to the dividend and the remainder will get kicked around about 8 more times, making th checksum register susceptible to greater change i.e. having higher entropy.
CRC uses polynomial arithmetic, meaning that the message is treated as a polynomial and each bit is a coefficient of the polynomial variable (say x
).
The degree of the polynomial ranges from 0
to n-1
if the length of the message is n
bits.
10111
becomes 1*x^4 + 0*x^3 + 1*x^2 + 1*x^1 + 1*x^0
or x^4 + x^2 + x^1 + x^0
So, imagine we want to multiply 1101
by 1011
, we'll have:
(x^3 + x^2 + x^0)(x^3 + x^1 + x^0)
= (x^6 + x^4 + x^3
+ x^5 + x^3 + x^2
+ x^3 + x^1 + x^0) = x^6 + x^5 + x^4 + 3*x^3 + x^2 + x^1 + x^0
Because we know that they are all coefficients of 2, out operations become simpler,
and we need to perform our additions modulo 2. Interesting things happen
Our polynomial changes: as `3` becomes `2 + 1`, leading `3x^3` to become `2x^3 + 1x^3`,
and the,`2x^3` turning to `1x^4`, leading to it joining with the existing
`x^4`, making it `2x^4`, which also trickles down (or up?) to the existing `x^5`,
making it `2x^5`, and itself joing with the `x^6` to become `x^7`
In the end, we have:
x^7 + x^3 + x^2 + x^1 + x^0
But this is not how CRC works. CRC uses pulynomial arithmetic mod 2
. We don't presume that we know the value of x
as we did in the above. So we cannot carry anything. Nothing flows up or down.
All operations are limited to the to interacting bits. This is interesting, isnt it, because addition in this case would definitely look like normal binary addition with no carries.
And remember how binary addition in half-adders are done. The sum
id the XOR
of the bits, while the carry
is the AND
of the bits. Here, the carry
is useless.
Remember the XOR
table:
"The reader should note the similarity between polynomial arithmetic and multiple-precision arithmetic, where the radix b is substituted for x. The chief difference is that the coefficient u_k of x^k in polynomial arithmetic bears little or no relation to its neighboring coefficients x^{k-1} [and x^{k+1}], so the idea of "carrying" from one place to another is absent. In fact polynomial arithmetic modulo b is essentially identical to multiple precision arithmetic with radix b, except that all carries are suppressed." ~Donald Knuth
Somethin interesting happens when we do binary arithmetic with no carries: bot +
and -
operations are the same. They are XOR
0+0=0
0+1=1
1+0=1
1+1=0
0-0=0
0-1=1
1-0=1
1-1=0
1010 = 1010 + 0011
1010 = 1010 - 0011
What then is multiplication in this case? Multiplication is a number of shif operation (the number being the length of the second bit string). Those bit shifts are added together
the same way we added the with XOR
arithmetic. The bit strings added are either the entire first number or a bit string of the same length but all zeros depending
on the value of the consecutive bits in the second number, which could either be 0
or 1
.
Consider:
1101
x 1011
----
1101
1101.
0000..
1101...
-------
1111111
-------
What then is division? Division must define when a number goes into
another. Generally, we say that a polynomial b
goes into another polynomial a
if the highest power of
b
is <= the highest power of a
and the coefficient of that elemt with the higest power in b
is <= that of a
. This is not the way a mathematician will put it, but please manage. Forgive.
e.g.
1100001010
_______________
10011 | 11010110110000
10011,,.,,....
-----,,.,,....
10011,.,,....
10011,.,,....
-----,.,,....
00001.,,....
00000.,,....
-----.,,....
00010,,....
00000,,....
-----,,....
00101,....
00000,....
-----,....
01011....
00000....
-----....
10110...
10011...
-----...
01010..
00000..
-----..
10100.
10011.
-----.
01110
00000
-----
1110 = Remainder. ~ source: Tanenbaum81
First, we need to choose a polynomial. Let the width be w. For instance, if we choose 110111, the width w
is 5 because it is the value of the highest power of two in the binary string. Append w
number of zeroes to the message m
whose crc
checksum we want to calculate. In this case, if m
were 11001010101
, it'd become 1100101010100000
, with 5 zeroes. The next thing to do is to divide our changed m
by the polynomial using the polynomial arithmetic we described above.
The quotient is useless. The remainder will either have width w
or less. Think about it. It couldn't be of the same length as the poly we used (which would be w+1
). It would either be w
or less. The remainder is the crc checksumm
. We append it to the original message. The key is to ensure that the register remains
sized as w
even if it is less than w
in length. If it is less than w
in length, then zeroes will fill in the first few positions to make up for it.
The reason is that when we want to evaluate the integrity of the data, we would only need to divide by the polynomial we chose (which is usually chosen by a standard) and check that it yields zero as the remainder. Now, I'll explain why it must be so if you don't see it yet. (I didn't see it at first because it wasn't clearly stated that the register size must retain its width when appending the remainder to the original message before transmission).
Consider:
m = 1101011011
poly = 10011
m' = 11010110110000 (we appended `0000`, which is of width 4, the degree of the poly)
After dividing, our remainder is: 1110
We append it to m, and so our transmitted message is: 11010110111110
Consider that 11010110111110 is equal to 11010110110000 + 1110
By the remainder theorem if a(b + r/a) = z, meaning that when we divide z by a, we get b, which is a whole number,
and r/a which is a fraction because r is less than a. In such a case, z = ab + r, and to get z back, we only need to add ab to r.
In our case, adding the message `m` to the remainder of its division by `poly` (which is essentially what we did when we appended the remainder
of width `w` to `m` before transmitting) means that what is left after dividing `m` by `poly`, being `r` or `crc`, has been
added back to the fatter `m`. Now, we were not supposed to add it, we were supposed to remove it so that that
remainder which kept us from dividing neatly would be removed, but it does not matter because, in our polynomial arithmetic,
addition is isomorphic to subtraction. When we then divide the entire transmitted polynomial by our divisor (the poly we chose),
what we should get as the remainder is zero.
From the long division we did above, we can get an algorithm. First, we choose a polynomial poly
, and then we fill our crc
register with w
zeroes.
We do not need to use (and indeed we shouldn't) a register of size w+1
which is the width of the poly
. We append w
zeroes to our polynomial and our shifting and addition begin.
We lay out the m'
from its highest bit to its lowest bit, and we move from lest to right.
Now, as we move, we shift the crc
register to the left by one and drop down a bit from m'
to take the place of the bit that would've been replaced by zero after a left shift. Knowing that for every shift left, a bit falls off - the leftmost one- we check the value of that bit that falls off if it is 1 then we xor
our register against poly
if it isn't, we pass which is the same as xor
-ing the register (i.e. the crc
) against zero, as xor
-ing against a zeroed mask is a no-op. The final value in the register after consuming our last bit (which must be a zero, remember?) is the crc
checksum. All we need to do is append it to the message m
and transmit it. The receiver needs only to take the transmitted message and subject it to our procedure for polynomial arithmetic (the shiftings and xor
-ings) without doing anything to the message received. If the remainder left in the register is zero, then they may accept it. Otherwise, they have the right to say the message is compromised.
But this algorithm is slow