Gauss's Proof of the Constructibility of the Heptadecagon
Table of Contents
- Motivation
- Background and Theory
- Converting the Problem to Number Theory
- Constructibility Proofs
- Conclusion
Motivation
Using a ruler and compass, we can construct a regular heptadecagon ($17$-gon), but we cannot construct a regular heptagon ($7$-gon). Why? This fact is counter-intuitive as a $17$-gon seems much more complicated than a $7$-gon. How would one even prove such a claim?
All of this began with the ancient Greeks around 300 BC. They loved their ruler-and-compass constructions and were interested in regular polygons. The question became, which polygons are possible to construct and which are not? This problem remained unsolved until 1796 when 19-year-old Carl Friedrich Gauss came along to crack the problem wide open.
In this post, I want to showcase Gauss’s beautiful solution to this problem. The rigorous proof requires deep results from Galois theory. My goal in this post is to give a high-level explanation of the result and give the reader a general intuition as to why it should be true.
Background and Theory
Main Result
Gauss proved the following. A regular $n$-gon is constructible if and only if
\[n = 2^{k} p_1 p_2 \ldots p_m\]where $k$ is any nonnegative integer and each $p_{\ell}$ are distinct Fermat primes, which is a prime of the form $p = 2^{2^i} + 1$ for some integer $i$.
The only current known Fermat primes are $3$, $5$, $17$, $257$, and $65537$. Thus, a $17$-gon can be constructed, but a $7$-gon cannot. Here is a list of all of the constructible regular $n$-gons under $300$.
In this post, I am going to give a proof sketch of this result. It will not be 100% rigorous, but I hope it will be enough to convince you of why this is true. For a complete proof, I recommend reading Gauss’s most famous book Disquisitiones Arithmeticae.
What the Greeks Knew
In Euclid’s infamous textbook The Elements, he provided a few regular polygon constructions. First, he could construct an equilateral triangle, square, and regular pentagon. The equilateral triangle and square are quite simple. The regular pentagon is more complicated, but not too bad.
Unfortunately, the resources I have found online for these are severely lacking. Maybe that means I have to do a series on this. If you know of a comprehensive resource on Greek ruler-and-compass constructions please let me know. The best I could find are the following: equilateral triangle, square, and regular pentagon.
Once you’re able to construct these, there are regular polygons you can construct for free using two methods. In the first method, inscribe a regular $n$-gon into a circle and draw the perpendicular bisector. Then, we can construct a regular $2n$-gon by extending the perpendicular bisector to the circle. Below is an example which constructs a regular hexagon using an equilateral triangle.
In the second method, given a regular $p$-gon and $q$-gon which are coprime, we can construct a regular $pq$-gon. Below is an example of constructing a regular $15$-gon from an equilateral triangle and a regular pentagon.
Place the equilateral triangle and regular pentagon at a common vertex (top of the circle). We make two observations. First, every point made by both the equilateral triangle and regular pentagon must lie on the regular $15$-gon. Second, since $3$ and $5$ are coprime, there must be some vertex in the triangle and some vertex in the pentagon which are adjacent (an exercise to the reader to prove why). These are circled in red. Thus, this will give us the length of one side of the regular $15$-gon, which allows us to generate the rest of it.
Euclid knew all of the above. Thus, the Greeks could theoretically construct regular $n$-gons of the form
\[n = 2^{k} \cdot 3^{b_1} \cdot 5^{b_2}\]where $k$ is any nonnegative integer and \(b_1, b_2 \in \{0, 1\}\) are binary values. However, this is where their knowledge stopped. No one could find a construction for any other regular $n$-gon that did not fall into the above pattern, nor knew whether it was even possible.
A Connection to Algebra
Everything the Greeks did used strict Euclidean geometry. However, in order to thoroughly solve this problem, we need algebra. A major breakthrough came from René Descartes who formed a bridge between Greek constructions and algebra. In his book La Géométrie, he showed the following.
Such numbers are called constructible numbers. I will not give a proof of this. However, the intuition is that rulers can draw lines and compasses can draw circles. A ruler and compass construction plays with the intersections of lines and circles.
Recall the general formulas for lines and circles.
\[y = mx + b \qquad\qquad (x - x_0)^2 + (y - y_0)^2 = r^2\]Since any length is constructed via a sequence of circle/line intersections, they are the result of equations involving only the operations $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$. Refer here for a more detailed proof or study Galois theory.
An important corollary is the following.
This is pretty intuitive. If we can construct the length $\cos \alpha$ or $\sin \alpha$, then transport that length to a unit circle, raise the perpendicular line, and draw the corresponding triangle with angle $\alpha$. More detail is given here.
Converting the Problem to Number Theory
The Complex Plane
Using Descartes’ theorem, we will now convert the geometric problem into algebraic number theory. To do this, we imagine a regular $n$-gon inscribed inside the unit circle. Below are examples using a regular pentagon and a regular heptadecagon ($17$-gon).
Now, consider this regular $n$-gon in the complex plane. Recall that any complex number can be written in Cartesian coordinates and Polar coordinates.
\[x + i y = r e^{i \theta}\]where $(x, y)$ gives the horizontal and vertical displacement of the complex number relative to the origin and $(r, \theta)$ gives the magnitude and angle of the complex number relative to the origin. Using polar notation, we see that the multiplication of two complex numbers is elegantly expressed as multiplying the magnitudes and adding the angles.
\[(r_1 e^{i \theta_1}) \cdot (r_2 e^{i \theta_2}) = (r_1 \cdot r_2) e^{i (\theta_1 + \theta_2)}\]Now consider the angle $\zeta$ (labeled in red).
\[\zeta = \cos(2 \pi/n) + i \sin(2 \pi/n) = e^{i \frac{2 \pi}{n}}\]Since the magnitude is $1$, multiplying $\zeta$ by itself results in rotation around the unit circle in increments of $2 \pi/n$. Therefore, each of the vertices of our regular $n$-gon can be represented by
\[\zeta^k = \cos(2 \pi k/n) + i \sin(2 \pi k/n) = e^{i \frac{2 \pi k}{n}} \qquad \text{for all } k \in \mathbb{Z}\]Therefore, we have successfully converted the geometric polynomial into a set of algebraic variables. Now our goal is to find an exact formula for the values of the $\zeta$’s (in particular an expression for the quantity $\cos (\frac{2 \pi}{n})$). If we find that this expression only the operations $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, then the associated regular $n$-gon must be constructible.
The Cyclotomic Equation and Its Roots
Consider the polynomial
\[z^n - 1 = 0\]The fundamental theorem of algebra tells us that this polynomial must have exactly $n$ complex roots. In fact, $\zeta^1, \zeta^2, \ldots, \zeta^{n-1}, \zeta^n$ are exactly these roots!
\[(\zeta^k)^n = (\zeta^n)^k = 1^k = 1 \qquad \text{for all } k \in \mathbb{Z}\]Satisfying the above property means $\zeta^1, \zeta^2, \ldots, \zeta^{n-1}, \zeta^n$ are by definition the nth roots of unity.
The root $\zeta^0 = \zeta^n = 1$ is called the trivial root and is not really of interest to us. Using high school algebra, we can factor out this root to get the polynomial.
\[1 + z + z^2 + \ldots + z^{n-1} = 0\]This is called the cyclotomic equation, which is the polynomial whose roots are $\zeta^1, \zeta^2, \ldots, \zeta^{n-1}$.
What makes the cyclotomic equation special is that its roots are deeply connected. To see why, define the following notation.
\[\langle a \rangle \equiv \{ a^i \ \mid \ i = 1, 2, \ldots, n \}\]In simple terms, this notation says to multiply $a$ by itself a bunch of times and see what you get. In technical terms, this is the cyclic subgroup generated by $a$. If $n$ is prime, then $\zeta^1, \zeta^2, \ldots, \zeta^{n-1}$ each satisfy the following property.
\[\langle \zeta^k \rangle = \langle \zeta^1 \rangle \simeq \mathbb{Z} / n \mathbb{Z} \qquad \text{for all } k \in \mathbb{Z}\]In simple terms, successive multiplication of any root will cycle around the unit circle and hit every vertex exactly once. In technical terms, $\zeta^k$ is a generator of the group $\mathbb{Z} / n \mathbb{Z}$. Below shows an example for $\zeta^3$ in the regular $7$-gon.
Here is an anti-example of $\zeta^2$ in a regular hexagon which does not satsify the above property (since $2$ and $6$ share a common factor, thus we get stuck in a subcycle).
A root which satisfies this property is called a primitive root. If $n$ is prime, then all roots of the cyclotomic equation are primitive roots. The intuition is that since we assumed $n$ is prime, no integer can evenly divide it, and therefore you could never get a subcycle. For more details, I recommend studying group theory, in particular cyclic groups and cyclic subgroups.
Side note, if $n$ is not prime, then we can decompose it into its primes, solve for the cyclotomic equation for each of these primes, and reconstruct the roots of $n$. Therefore, we only consider the case where $n$ is prime.
Constructibility Proofs
Helpful Properties
Before we can move on to constructibility proofs, there are two properties of interest. The first is the following.
\[\zeta + \zeta^2 + \ldots + \zeta^{n-1} = -1\]There are a few ways to derive this. The easiest is to use symmetry. From the graph, we can see that the center of mass of the points must be at the origin, Therefore, $\zeta + \zeta^2 + \ldots + \zeta^{n-1} + \zeta^n = 0$ and $\zeta^n = 1$.
The second property involves the complex conjugate of $\zeta^k$. Graphically, these correspond to the points of the polynomial on the mirrored side of the $x$-axis.
\[(\zeta^k)^* = \left ( e^{i \frac{2 \pi k}{n}} \right )^* = e^{- i \frac{2 \pi k}{n}} = e^{i \frac{2 \pi (-k)}{n}} = \zeta^{-k}\]Now, consider the following
And in general
\[\zeta^k + \zeta^{-k} = 2 \cos(2 \pi k/n) \qquad\text{for each } k \in \mathbb{Z}\]Therefore, we have now isolated the quantity $\cos (\frac{2 \pi}{n})$, which is ultimately what we need to find an expression for.
Constructibility Proof for a Regular Pentagon
Ultimately, we want to prove that a regular $17$-gon is constructible. As a warm-up, we are going to first prove this result for a regular pentagon. This will help show the main idea of these types of proofs, and make the proof of the regular $17$-gon less daunting.
Recall that our goal is to derive the analytical expression for the angle of a regular pentagon (in particular the formula for $\cos(2 \pi / 5)$). If we can show that $\cos(2 \pi / 5)$ can be expressed only using the operations $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, then we have proven that a regular pentagon is constructible.
We consider the two pairs of conjugates and define the following
Now, from the helpful properties above, we derive the following. Recall that $\zeta^{-k} = \zeta^{5-k}$ for the roots of a regular pentagon.
Also
Now comes the brilliant trick. Consider the following polynomial.
\[(z - \gamma_1)(z - \gamma_2) = 0\]We know the roots must be $\gamma_1$ and $\gamma_2$. However, we can find the roots in a second way
Therefore, using high school algebra, we know that the roots must be $\displaystyle \frac{-1 \pm \sqrt{5}}{2}$. You can convince yourself that that $\displaystyle \gamma_1 = \frac{-1 + \sqrt{5}}{2}$ and $\displaystyle \gamma_2 = \frac{- 1 - \sqrt{5}}{2}$. Therefore
Therefore, since $\cos(2 \pi / 5)$ can be written as an algebraic expression involving only the operations $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, a regular pentagon must be constructible (which we already knew).
Constructibilty Proof for a Regular Heptadecagon
Now, let’s do the same for a regular $17$-gon. It’s going to be the same general idea as a regular pentagon but with a few more complications.
Consider the following variable definitions.
These may seem like they came out of nowhere, but I assure you there are deep, deep mathematics behind them. Unfortunately, it’s too much for me to cover in this already long post. I am planning on making a follow-up post. For further reading, I recommend studying Galois theory and then reading this.
Now, we are going to do a similar procedure to what we did for the regular pentagon in order to solve for $\alpha_1 = 2 \cos(2 \pi / 17)$. I will assert the following and leave it as an exercise for the reader to verify. Proving $\gamma_1 \cdot \gamma_2$ without using brute force is quite tricky. There is a good article here.
As before, we can consider the polynomial
and solve for its roots
Continuing, we do the same thing with $\beta_1$ and $\beta_2$. To save space, I am going to assert the necessary steps and the reader can fill in the gaps if they are inclined.
We actually need the value of $\beta_3$, so we do the same procedure.
Finally, we do this procedure one last time with $\alpha_1$ and $\alpha_4$. The reader can verify that
Again, to save space I will leave it to the reader to simplify the final result. We know that $\alpha_1 = 2 \cos(2 \pi / 17)$. Therefore, we get
Thus, since $\cos(2 \pi / 17)$ can be written as an expression involving only $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, a regular heptadecagon must be constructible. This is what Gauss proved at 19 years old. For the actual construction of the regular heptadecagon, Numberphile has a great video.
Why Is A Heptagon Not Constructible?
Some may be wondering why this same procedure can’t be used on a regular heptagon. Well it can, but the proof is going to fail. Let’s see why.
As before, we are going to get
But now we have a problem. There is no way we can further combine these equations. The best we can do is the following
I will let the reader verify the following
Therefore, we have the polynomial
\[z^3 - z^2 - 2z + 1 = 0\]The factors of this are really gross, but the key point is that since it’s a trinomial the solution will involve cube roots. A value involving a cube root cannot be constructed using a ruler and compass. Therefore, $\cos(2 \pi / 7)$ is not constructible.
Conclusion
At the beginning of this post, I asserted that a regular $n$-gon is constructible if and only if
\[n = 2^{k} p_1 p_2 \ldots p_m\]where $k$ is any nonnegative integer and each $p_{\ell}$ are distinct Fermat primes, which is a prime of the form $p = 2^{2^i} + 1$ for some nonnegative integer $i$.
With all of our new knowledge and understanding of this problem, let’s explain why this is true.
First, the $2^k$ part comes from starting with a regular polygon and bisecting each edge as many times as you like. We showed an example this by converting an equilateral triangle into a regular hexagon.
Second, the $p_1 p_2 \ldots p_m$ part is explained by the trick which lets us combine any two coprime regular polygon. In particular, given a regular $p$-gon and regular $q$-gon (where $p$ and $q$ are coprime), we can construct a regular $pq$-gon. We showed an example of this method to construct a regular $15$-gon using an equilateral triangle and a regular pentagon.
Therefore, all that remains to be explained is why the primes need to be Fermat primes.
Descartes provided a prided a bridge between ruler-and-compass constructions and algebra. He says that if a length can be expressed as a formula involving only $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, then it is constructible (and vice versa). Thus, if we can construct $\cos(2 \pi / n)$, then we can construct the corresponding regular $n$-gon.
Then, using number theory, we came up with a procedure for finding the exact formula of $\cos(2 \pi / n)$ (assuming that $n$ is prime). In order to ensure that the solution involved only the operations $\ +, \ -, \ \times, \ \div, \text{ and } \sqrt{\ }$, we needed each step to involve finding the roots of a binomial. The only way to ensure that is for $n$ to be a Fermat prime. If it was not a Fermat prime, then eventually the procedure would produce a trinomial or larger, which would result in the solution involving cube root, or fifth root, etc. Therefore, it would not be constructible.