The Steiner Inellipse

For simplicity, a polnomial defined on this page is seperable, i.e., its roots are distinct. Everything still holds true for polynomials whose roots have multiplicity greater than 1, but sometimes this is annoying to check.

It's easy to see that the root of the derivative of a quadratic is the midpoint of the roots of the quadratic: Let p(x)=(xα)(xβ). Then p(x)=(xα)+(xβ). So setting p(x):=0, we get x=α+β2.

But what happens when we have a cubic? Do its two roots follow a similar even distribution? Here I will show a theorem, due to Marden, showing that the roots of a cubic have a nice geometric property.

Theorem. Let p(x)C[x] be a cubic polynomial with roots z1, z2, and z3. Then the roots of the derivative p(x) are exactly the focii of an ellipse which is tangent to the midpoints of the line segments z1z2, z2z3, and z3z1. This ellipse is known as the Steiner Inellipse.

an example of the steiner inellipse of a polynomial

The proof I'll show here is an adaptation of Dan Kalman's proof, (which is an amalgamation of proofs by Marden [from 1929] and Bocher [from 1892], although the first proof seems to be attributed to Siebeck [from 1864]). To begin we'll state a few facts that we'll need. If you're not convinced of any of these facts, I prove them out of the way; see their adjacent links.

Facts:

  1. Let M:CC be a linear transformation and let p(z)=(zz1)(zz2)(zz3). Define pM(z)=(zM(z1))(zM(z2))(zM(z3)). Then if z is a root of p(z), M(z) is a root of pM(z). In other words, we can transform the initial triangle Δz1z2z3 via any linear transformation and its roots will follow appropriately.
  2. For a quadratic polynomial p(x)=x2+bx+c with roots z1 and z2, the coefficients are given by b=(z1+z2) and c=z1z2.
  3. Let f1 and f2 be focii of an ellipse, and let p be a point outside of the ellipse. The point p induces (exactly) two tangent lines of the ellipse which intersect p; call the intersection of these lines with the ellipse t1,t2. Then f1pt1=f2pt2.
  4. ellipse angles

We'll break the proof down into two lemmas...

Lemma 1. Let T be the triangle in C with vertices z1,z2,z3, which are the roots of the monic polynomial p(x). Then if the ellipse with the roots of p(x) as the focii passes through the midpoint of one side of T, it is actually tangent to that side of T.

Proof. By Fact 1, we can apply some linear transformation such that the vertices of T are -1, 1, and w, with Im(w)>0. We want to show that the horizontal axis is tangent to the ellipse. So we have p(z)=(z1)(z+1)(zw) and thus p(z)=(z1)(z+1)+(z+1)(zw)+(z1)(zw)=3(z22w3z13). So (by Fact 2) if α1 and α2 are the roots of p, they satisfy α1+α2=2w3 and α1α2=13. In particular, thinking of αj=rjeiθj, we see that the roots lie in the upper half-plane and that θ1+θ2=π. In other words, as vectors, α1 and α2 are supplementary. So the angles made by the focii and the point 0 on the ellipse with the x-axis are equal. Thus, the horizontal axis is a tangent line of the ellipse. This is what we wanted to show.

construction of triangle in lemma 1

Lemma 2. Let T be the triangle in C with vertices z1,z2,z3, which are the roots of the monic polynomial p(x). If the ellipse with roots of p(x) as the focii is tangent to one side of T, then it is tangent to all sides of T.

Proof. Again by Fact 1, we can let the vertices of T be 0, 1, and w for Im(w)>0. Then p(z)=z(z1)(zw)=z3(1+w)z2+wz. Taking the derivative, we get p(z)=3z22(1+w)z+w. By similar tactics as in Lemma 1, we see that α1+α2=2(1+w) and α1α2=w. Note that both α1 and α2 are in the upper half-plane. Let αj=rjeiθj and w=rweiθw. Without loss, say θ1θ2 So we get from the equations above that θ1+θ2=θw. In other words 10α1=w0α2. By Fact 3 (with 0 as the point p), we get that the line 0w is a tangent line to the ellipse. We can reapply this argument to the third edge (perhaps on a different linear transformation) to get the desired result.

construction of triangle in lemma 2

We now have the material to prove the theorem.

Proof of Marden's Theorem. Let E be the ellipse of a polynomial p tangent to one side of the induced triangle T as constructed in Lemma 1. By Lemma 2, E is actually tangent to the other two sides. The points of tangency with the other two sides must be at the midpoints, because if not, we could repeat the construction with the other two sides, giving us three non-identical ellipses with the same focii tangent to the same three lines. But this can't happen.



< Part 2 back to main page >