I was reading Popov and Vinberg’s book on invariant theory when they asserted a basic lemma (without proof) that I didn’t recognize. It was this (p. 155):
Lemma 2.4: Suppose is an extension of the field
, and
is a group of
-automorphisms of
. Suppose
is a (not necessarily finite-dimensional) vector space over
, and
is a subspace of
that is stable under the natural action of
(induced by the action on the first factor). Then
is generated by invariant vectors.
I asked for a proof on Math.SE. Alex Youcis pointed me to these notes of J. Milne (see section c). They’re phrased in slightly different language, but they do the job. The heart of the matter is the following lemma:
Lemma 16.6 (this is Milne’s numbering, though I’m not being completely faithful to his statement or notation): Suppose is a field extension such that
satisfies
. (Note this is a stronger set of assumptions about
than the above.) Let
be a
-vector space. If
is a nontrivial
-subspace that is stable under the action of
, then it contains a nontrivial
-invariant vector. (NB: the
-invariant vectors are precisely those that lie in the image of the natural
-linear map
given by
.)
The proof of this lemma is as follows. Choose a basis for
, where
is some indexing set. Then any element of
is a finite sum of the form
, with
for each
in the (finite) sum. Since
is nontrivial, we can choose a
such that the number of terms in this representation
is minimal among nonzero elements of
, and since
is a
-vector space, we can scale this
by an element of
so that one of the coefficients in this sum is
; thus we can without loss of generality assume
(where we have labeled
as the basis vector corresponding to the coefficient
) with the number of nonzero terms minimal. Since
is
-stable, for any
we then have
, and therefore, their difference
is in
as well. Note this sum has fewer nonzero terms than the sum representing
, and we conclude by
‘s minimality that this new sum is zero, and thus that
, since the
‘s are linearly independent over
and thus the
‘s are linearly independent over
.
We can conclude that for each
and for each
. It is immediate that
is
-invariant, proving the existence of a nontrivial
-invariant vector. Since
, the
‘s all lie in
, and this, together with similar logic as above (linear independence of
‘s over
), also lets us recognize that
-invariant vectors are exactly those of the form
with all
‘s in
, i.e. those that are images of some
under the natural
-linear map
, as per the parenthetical statement in the lemma. QED.
I recognized the proof of Lemma 16.6 as very similar to the proof I learned many years ago of Artin’s lemma, one of the building blocks in Artin’s develoment of the fundamental theorem of Galois theory. Artin’s lemma is the statement that if is a field,
is a finite group of automorphisms of
, and
, then
. I spent some time yesterday morning thinking through this relationship, and I realized that the proof of Artin’s lemma can be reformulated in an elegant way to use Lemma 16.6. I wanted to record that here.
Proof of Artin’s lemma using Lemma 16.6:
Consider any finite collection of elements of elements of
that are linearly independent over
. Our goal is to show that
. We will do this by constructing a certain
-linear map
and showing it is injective. The map is defined by
-linear extension of the following map on the standard basis
for
:
.
The right side here is an element of with entries indexed by elements of
.
We are going to apply Lemma 16.6 with . In this situation,
is
. We will take
as
. (To invoke the lemma we will have to show it is
-stable.) The assumption that the
‘s are linearly independent over
is then going to imply that there is no
-invariant vector in
, from which Lemma 16.6 will imply that
itself is zero, i.e. that
is injective, as desired. Here are the details:
First, we claim is
-stable. We see this as follows. Let
(where each lies in
) be an arbitrary element of
, i.e. assume that
.
Looking coordinate-by-coordinate, this equation states that for each
. Then, if
is arbitrary, we have
as well. As traverses
,
does as well, so these equations (for each
) can be arranged into the vector equation
.
The left side is precisely . Thus for any
,
as well, i.e.
is
-stable.
Thus we can invoke Lemma 16.6: if is nontrivial, then it contains a nontrivial
-invariant element
, which in view of the parenthetical note at the end of the lemma, has the form
with each
and some
nonzero. But then
is the statement that
.
Extracting the coordinate where from this vector equation, we get
.
But since each , this is a nontrivial linear relation between the
‘s over
, which is not possible since we presumed them to be linearly independent. We conclude
is trivial, i.e.
is injective, and it follows that
. This completes the proof of Artin’s lemma.