The resultant

Like Euler products for number-theoretic functions, the resultant is one of those amazingly simple gadgets that you’d never imagine existed.  Given two polynomials f and g in a single variable, there is a number called the resultant, denoted res(f,g), such that:

  1. res(f, g) = 0 iff f and g share a common root
  2. The coefficients of res(f, g) are polynomials in the coefficients of f and g.

Think about this for a second.  Given the coefficients of an arbitrary polynomial, we have in general no algebraic expression for its roots, but nonetheless we have a way of determining if two polynomials share a root by simply adding and multiplying together some of their coefficients!

First, let’s see why such a thing ought to exist.  Say that f(x) = p (x-a_1)\cdots(x-a_n) and g(x) = q (x-b_1)\cdots(x-b_m), and define

res(f, g) = p^n q^m \displaystyle \prod_{i=1}^n \prod_{j =1}^m (a_i - b_j).

This clearly satisfies condition (1) above — the product will equal zero iff a_i = b_j for some i and j.  However, it also satisfies condition (2).  Why?  Well, if we regard res(f, g) as a polynomial in the a_i‘s, with coefficients which are polynomials in the b_i‘s, then it’s a symmetric polynomial in the a_i‘s — it’s invariant under permuting the order of the a_i‘s.  Further, if we regard one of the coefficients of this polynomial as a polynomial in the b_i‘s, then this polynomial is also symmetric.

Why does this matter?  Well, the space of symmetric polynomials in n variables is spanned by the elementary symmetric polynomials

1,

x_1 + \cdots + x_n,

x_1 x_2 + x_1 x_3 + \cdots + x_{n-1} x_n,

and so forth.

But you’ll recognize that the coefficients of a (monic, univariate) polynomial are precisely the elementary symmetric polynomials in its roots!  That is, given a monic polynomial in one variable, any symmetric polynomial of its roots is just a linear combination of the coefficients.  (Throwing in the p^n and q^m at the front handles the case when the polynomials aren’t symmetric.)

I vaguely remember learning about elementary symmetric polynomials in my undergrad algebra sequence, but at the time I had no real idea what they were for.  They didn’t look that complicated, so I figured they probably didn’t matter too much.  As it turns out, though, the whole subject of invariant theory is really interesting, and symmetric polynomials are just the first nontrivial example.

As an added bonus, note that we can also determine whether a polynomial has a double root by calculating res(f, f’), where f’ is the derivative of f.  (You can define the derivative of a polynomial without using any calculus — just consider the power rule et al. as definitions instead of theorems.)  Now res(f, f’) is zero iff f and f’ share a root.  Suppose a is a root of f; then f(x) = (x - a)^n g(x) for some n and some g, where a is not a root of g.  Taking the derivative, we have f'(x) = n (x - a)^{n-1} g(x) + (x-a)^n g'(x), so f'(a) = n 0^{n-1} g(x) + 0^n g'(a) = 0 if n > 1, or f'(a) = g(a) + 0^n g'(a) = g(a) \neq 0 if n = 1.

Up to a sign, the resultant res(f, f’) is known as the discriminant, as you’ll remember from high-school algebra when they seemingly needlessly assigned this fancy name to the term b^2 - 4 a c appearing in the quadratic formula.  The form here generalizes to univariate polynomials of arbitrary degree, but in fact it can be generalized further, to arbitrary multivariate polynomials as well.

This entry was posted in Uncategorized and tagged . Bookmark the permalink.

Leave a comment