Kac-Moody Lie Algebras Chapter III

Kac-Moody Lie Algebras
Chapter III: Representation theory

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 10 September 2012

Abstract.
This is a typed version of I.G. Macdonald's lecture notes on Kac-Moody Lie algebras from 1983.

To begin with, let $A = (a_{i j})$ be any $n \times n$ matrix over the field $k$ . Eventually $A$ will have to be a symmetrizable Cartan matrix, but we shall bring in that assumption only when it becomes necessary.

Recall that

𝔤 = 𝔤 (A) = 𝔥 + \sum_{α \in R} 𝔤_{α} (direct sum)

and that each root space $𝔤_{α}$ is a finite-dimensional (1.7).

Let $M$ be a $𝔤$ –module, i.e. a $k$ –vector space on which $𝔤$ acts, so that we are given a Lie algebra homomorphism $π : 𝔤 \to 𝔤 𝔩 (M),$ which extends to $π : U (𝔤) \to End (M),$ i.e $M$ is a $U (𝔤)$ –module. Notation $(M, π)$ when I want to be pedantic. More often then not I shall suppress $π$ and write $x . v$ or $x v$ for $π (x) v$ $(x \in 𝔤, v \in M)$ .

Weights

For any $𝔤$ –module $M$ and any $λ \in 𝔥^{*}$ we define

M_{λ} ≔ {v \in M : h . v = λ (h) v for all h \in 𝔥} .

If $M_{λ} \neq 0$ we say that $λ$ is a weight of $M$ , that $M_{λ}$ is the weight space and that the elements of $M_{λ}$ are the weight vectors for the weight $λ$ . We have

\begin{matrix} M_{λ} ≅ {Hom}_{U (𝔥)} (E_{λ}, M) & (✶) \end{matrix}

where $E_{λ}$ is the $1$ –dimensional $𝔥$ –module defined by $λ,$ that is to say $E_{λ} = k e_{λ}$ where $h . e_{λ} = λ (h) e_{λ}$ for all $h \in 𝔥 .$ The isomorphism $(✶)$ associates to each $v \in M_{λ}$ the homomorphism $E_{λ} \to M$ which takes $e_{λ}$ to $v .$ From $(✶)$ it follows that (for a fixed $λ \in 𝔥^{*}$ ) $M \mapsto M_{λ}$ is a left exact functor (from $𝔤$ –modules to $𝔥$ –modules).

Example: $(M, π) = (𝔤, ad) .$ The weight spaces are $𝔥$ and the $𝔤_{α},$ and the set of weights is $R \cup {0} .$

(3.1)

For $α \in R \cup {0}$ and $λ \in 𝔥^{*}$ we have
$𝔤_{α} . M_{λ} \subset M_{λ + α}$
The sum $M^{'} = \sum_{λ \in 𝔥^{*}} M_{λ}$ is direct, and $M^{'}$ is a $𝔤$ –submodule of $M .$
If $φ : M \to N$ is a $𝔤$ –module homomorphism, then $φ (M_{λ}) \subset N_{λ}$ for all $λ .$

Proof.

Let $x \in 𝔤_{α},$ $v \in M_{λ},$ $h \in 𝔥 .$ Then we calculate
$\begin{matrix} h . (x . v) & = & x . h . v + [h, x] . v \\ = & λ (h) x . v + α (h) x . v \\ = & (λ + α) (h) x . v \end{matrix}$
so that $x . v \in M_{λ + α} .$
If the sum $\sum M_{λ}$ is not direct, there will be nontrivial relations of the form
$\begin{matrix} \sum_{i = 1}^{m} v_{λ_{i}} = 0 & (1) \end{matrix}$
where $v_{λ_{i}} \in M_{λ_{i}}, v_{λ_{i}} \neq 0$ and $λ_{1}, \dots, λ_{m} \in 𝔥^{*}$ are all distinct. Choose such a relation with $m$ $(\geq 2)$ as small as possible. By operating on (1) with an element $h \in 𝔥,$ we obtain
$\begin{matrix} \sum_{i = 1}^{m} λ_{i} (h) v_{λ_{i}} = 0 & (2) \end{matrix}$
Choose $h \in 𝔥$ such that $λ_{1} (h) \neq λ_{2} (h),$ multiply (1) by $λ_{1} (h)$ and subtract from (2). This produces a nontrivial relation of length $< m :$ contradiction.

Also it is clear from (i) that $𝔤_{α} . M^{'} \subset M^{'}$ for each $α \in R \cup {0},$ whence $𝔤 . M^{'} \subset$ M with a prime?
Obvious.

$□$

If $M$ is any $𝔤$ –module, let $P (M) \subset 𝔥^{*}$ denote the set of weights of $M .$ (It might be empty.) Also, for each $λ \in 𝔥^{*},$ let

D (λ) = λ - Q^{+}

and for any subset $F$ of $𝔥^{*}$ let

D (F) = ⋃_{λ \in F} D (λ) .

We shall use this notation only for finite subsets $F$ of $𝔥^{*} .$

Let $𝒪$ denote the category of $𝔤$ –modules $M$ which satisfy the following two conditions:

$M$ is $𝔥$ –diagonalizable with finite dimensional weight spaces, i.e.
$M = \sum_{μ \in P (M)} M_{μ}$
(direct sum, by (3.1)), with each $M_{μ}$ finite-dimensional;
$P (M) \subset D (F)$ for some finite $F \subset 𝔥^{*} .$

The morphisms in $𝒪$ are $𝔤$ –module homomorphisms.

(3.2) Let $(E) 0 ⟶ M^{'} ⟶ M ⟶ M^{''} ⟶ 0$ be a short exact sequences of $𝔤$ –modules, with $M \in 𝒪$ . Then

$M^{'}, M^{''} \in 𝒪;$
For each $λ \in 𝔥^{*}$ the sequence $(E_{λ}) 0 ⟶ M_{λ}^{'} \overset{f}{⟶} M_{λ} \overset{g}{⟶} M_{λ}^{''} ⟶ 0$ is exact;
$P (M) = P (M^{'}) \cup P (M^{''}) .$

Proof.

Since $M$ is $𝔥$ –diagonalizable we have $M^{'} = \underset{λ}{\oplus} M_{λ}^{'}$ by (1.5), and $f (M_{λ}^{'}) \subset M_{λ}$ by (3.1), so that $M_{λ}^{'}$ is finite-dimensional and $P (M^{'}) \subset P (M) \subset D (F) .$ Hence $M^{'} \in 𝒪 .$

Next, we have $g (M_{λ}) \subset M_{λ}^{''}$ by (3.1), for all $λ \in 𝔥^{'},$ hence

M^{''} = g (M) = \sum g (M_{λ}) \subset \sum M_{λ}^{''} \subset M^{''};

consequently we have equality throughout, whence $g (M_{λ}) = M_{λ}^{''}$ for each $λ \in 𝔥^{*}$ and the sequence $(E_{λ})$ is therefore exact. Finally, $M_{λ}^{''}$ is finite-dimensional and $P (M^{''}) \subset P (M),$ so that $M^{''} \in 𝒪,$ and (iii) is now obvious.

$□$

Recall the partial order $λ \geq μ$ on $𝔥^{*} :$ $λ \geq μ$ iff $λ - μ - \sum_{1}^{n} u_{i} α_{i}$ with each $u_{i} \geq 0 .$

(3.3) Each module $M \in 𝒪$ has at least one maximal weight.

Proof.

Suppose $M$ has no maximal weight. Then $P (M)$ contains an infinite strictly increasing sequence $μ_{1} < μ_{2} < \dots .$ For each $λ \in F,$ the $μ_{i} \in D (λ)$ form a subsequence. Since $F$ is finite, at least one of these subsequences is infinite, say $v_{1} < v_{2} < \dots$ in $D (λ) .$ Each $v_{i} \in D (λ) = λ - Q^{+},$ hence $ht (λ - v_{i})$ is a nonnegative integer. It follows that the sequence ${(ht (λ - v_{i}))}_{i \geq 1}$ is an infinite strictly decreasing sequence of integers $\geq 0,$ which is absurd.

$□$

If $M$ has a unique maximal weight $λ,$ then $λ$ is called the highest weight of $M .$

Highest weight $𝔤$ –modules

We shall say that a $𝔤$ –module $M$ is a highest weight (h.w.) module if

$M$ has a highest weight, say $λ;$
$M$ is generated (as $U (𝔤)$ –module) by some $v_{λ} \in M_{λ} .$

(3.4) Let $M$ be a h.w. $𝔤$ –module, with highest weight $λ .$ Then

$M \in 𝒪;$
$dim M_{λ} = 1;$
$P (M) \subset D (λ);$
$M$ has a unique maximal submodule, hence a unique simple quotient;
If $M^{'}$ is a nonzero homomorphic image of $M,$ then $M^{'}$ is h.w. with h.w.

Proof.

We have $λ + α_{i} \notin P (M),$ hence by (3.1) $e_{i} . v_{λ} = 0$ $(1 \leq i \leq n) .$ It follows that $𝔫_{+} . v_{λ} = 0,$ i.e. $U (𝔫_{+}) . v_{λ} = k v_{λ} .$ Since $U (𝔤) = U (𝔫_{-}) U (𝔥) U (𝔫_{+}),$ we have $M = U (𝔤) v_{λ} = U (𝔫_{-}) v_{λ} .$

Let $y_{1}, y_{2}, \dots$ be a $k$ –basis of $𝔫_{-}$ consisting of root vectors. By Poincaré-Birkhoff-Witt, the monomials $y_{1}^{r_{1}} y_{2}^{r_{2}} \dots$ form a $k$ –basis of $U (𝔫_{-}),$ hence the vectors $y_{1}^{r_{1}} y_{2}^{r_{2}} \dots v_{λ}$ span $M$ (as a $k$ –vector space). But each such vector is a weight vector, for if $y_{i} \in 𝔤_{- β_{i}}$ then $y_{1}^{r_{1}} y_{2}^{r_{2}} \dots v_{λ} \in M_{λ - r_{1} β_{1} - r_{2} β_{2} - \dots} .$ It follows that $M$ is the sum of its weight spaces and that each weight space $M_{μ}$ is finite-dimensional, for there are only finitely many solutions of the equation $μ = λ - \sum r_{i} β_{i}$ in non-negative integers $r_{i} .$ Moreover each such $μ \in D (λ),$ and in particular $M_{λ}$ is 1–dimensional, generated by $v_{λ} .$ So we have proved (i) – (iii).
Let $M^{'}$ be a proper submodule of $M .$ Then $M^{'} \in 𝒪$ (3.2), hence $M^{'} = \sum M_{μ}^{'}$ where $M_{μ}^{'} = M^{'} \cap M_{μ} .$ But $M_{λ}^{'} = 0,$ otherwise by (ii) $M_{λ}^{'}$ would contain and hence $M^{'} = M .$ It follows that
$M^{'} \subset M_{+} = \sum_{μ \neq λ} M_{μ}$
and hence the sum of all proper submodules of $M$ is contained in $M_{+},$ hence is a proper submodule. This proves (iv), and (v) is clear.

$□$

We shall now show how to construct all h.w. $𝔤$ –modules.

Verma modules

Let $λ \in 𝔥^{*}$ and let $E_{λ}$ as before denote the 1–dimensional $𝔥$ –module corresponding to $λ :$ $E_{λ} = k u_{λ}$ where $h . u_{λ} = λ (h) u_{λ}$ for all $h \in 𝔥 .$

Let $𝔟 = 𝔥 + 𝔫_{+}$ be the subalgebra of $𝔤$ generated by $𝔥$ and $e_{1}, \dots, e_{n} .$ The subalgebra $𝔟$ is a semidirect product $𝔫_{+} ⋊ 𝔥,$ because $𝔫_{+}$ is an ideal in $𝔟$ and $𝔟 / 𝔫_{+} = 𝔥 .$ We may regard $E_{λ}$ as a $𝔟$ –module by making $𝔫_{+}$ act trivially, i.e. $𝔫_{+} . u_{λ} = 0 .$ The Verma module $V (λ)$ is defined to be the induced $𝔤$ –module

V (λ) = {ind}_{𝔟}^{𝔤} (E_{λ}) = U (𝔤) \otimes_{U (𝔟)} E_{λ} .

Let $v_{λ} = 1 \otimes u_{λ} \in V (λ) .$ Clearly $v_{λ}$ generates $V (λ),$ and since $U (𝔤) = U (𝔫_{-}) \otimes U (something)$ we have $V (λ) = U (𝔫_{-}) . v_{λ},$ showing that $V (λ)$ is a h.w. $𝔤$ –module with highest weight $λ,$ and that it is free of rank 1 as a $U (𝔫_{-})$ –module.

Alternative description of $V (λ) :$ let $J (λ)$ denote the left ideal in $U (𝔤)$ generated by something and all $h - λ (h),$ $h \in 𝔥 .$ Then

V (λ) ≅ U (𝔤) / J (λ) .

For if $π$ is the representation of $U (𝔟)$ on $E_{λ},$ then $π : U (𝔟) \to k$ is such that $π (e_{i}) = 0 (1 \leq i \leq n)$ and $π (h) = λ (h),$ all $h \in 𝔥;$ hence $K = Ker (π)$ is the left ideal of codimension 1 in $U (𝔟)$ generated by $𝔫_{+}$ and all $h - λ (h);$ tensoring the exact sequence (of left $U (𝔟)$ –modules)

0 ⟶ K ⟶ U (𝔟) ⟶ E_{λ} ⟶ 0

with $U (𝔤)$ (over $U (𝔟)$ ) gives

U (𝔤) \otimes_{U (𝔟)} K ⟶ U (𝔤) ⟶ V (λ) ⟶ 0

and the image of $U (𝔤) \otimes_{U (𝔟)} K$ in $U (𝔤)$ is $J (λ) .$

The Verma modules are the "universal" h.w. $𝔤$ –modules:

(3.5)

$V (λ)$ is a h.w. $𝔤$ –module with highest weight $λ .$
Every h.w. $𝔤$ –module with highest weight $λ$ is a homomorphic image of $V (λ) .$

Proof.

Already observed above.
Let $M$ be a h.w. with generator $x \in M_{λ} .$ Then the ideal $J (λ)$ kills $x,$ hence $M$ is a homomorphic image of $U (𝔤) / J (λ) = V (λ) .$

$□$

By (3.4)(iv) it follows that $V (λ)$ has a unique simple quotient $L (λ) :$ by (3.2) and (3.4)(i), we have $L (λ) \in 𝒪 .$ Moreover, the $L (λ)$ are precisely the simple objects in the category $𝒪 :$

(3.6) If $M \in 𝒪$ is simple, then $M ≅ L (λ)$ for a unique $λ \in 𝔥^{*} .$

Proof.

By (3.3), $M$ has at least one maximal weight, say $λ .$ Let $x \in M_{λ},$ $x \neq 0 .$ Then $𝔫_{+} . x = 0 .$ (because $λ + α_{i} \notin P (M), 1 \leq i \leq n),$ hence $x$ is killed by $J (λ),$ and therefore the submodule $U (𝔤) . x = M^{'}$ generated by $x$ is a quotient of $V (λ) .$ Since $M^{'} \neq 0$ and $M$ is simple, we have $M^{'} = M;$ hence $M$ is a simple quotient of $V (λ),$ hence $M ≅ L (λ) .$

Suppose also that $M ≅ L (μ) .$ Then we have a $𝔤$ –isomorphism $something (λ) \tilde{\to} L (μ),$ under which weight spaces correspond (3.1). Hence $λ$ is a weight of $L (μ),$ whence $λ \leq μ;$ similarly $μ \leq λ$ and therefore $λ = μ .$

$□$

(3.7) Example. When $λ = 0,$ $E_{λ} = k$ with trivial $𝔥$ –action $(h . 1 = 0),$ and $V (0) = U (𝔫_{-}) .$ The maximal submodule of $V (0)$ is the augmentation ideal of $U (𝔫_{-}),$ hence $L (0)$ is the trivial 1–dimensional $𝔤$ –module.

(3.8) Ket $M$ be a h.w. module. Then ${End}_{𝔤} (M) = k .$

Proof.

Let $v_{λ} = M_{λ}$ be a h.w. vector which generates $M .$ If $φ : M \to M$ is a $𝔤$ –module homomorphism something have $φ (v_{λ}) \in M_{λ},$ hence $= a v_{λ}$ for some $a \in k$ (because $dim M_{λ} = 1$ (3.4)). The Kernel of $φ - a . 1$ is a submodule of $M$ which contains $v_{λ},$ hence is the whole of $M,$ i.e. $φ - a . 1 = 0 .$

$□$

Characters

Let $ε$ be the set of all functions $f : 𝔥^{*} \to ℤ$ such that $Supp (f) \subset D (F)$ for some finite subset $F$ of $𝔥^{*} :$ i.e. $f (μ) = 0$ unless $λ - μ \in Q^{+}$ for some $λ \in F .$ Clearly $ε$ is closed under addition and subtraction of functions; define multiplication by convolution:

\begin{matrix} (f g) (v) = \sum_{λ + μ = v} f (λ) g (μ) (finite sum) & (1) \end{matrix}

If $Supp (f) \subset D (F),$ $Supp (g) \subset D (G),$ then $Supp (f g) \subset D (F + G) .$ Thus $ε$ is a commutative ring.

A family ${(f j)}_{j \in J}$ of functions in $ε$ is summable if $\exists$ finite $F \subset 𝔥^{*}$ such that

$Supp (f j) \subset D (F)$ for all $j \in J;$
for each $λ \in 𝔥^{*}$ we have $f j (λ) = 0$ for almost all $j \in J .$

In that case the function $f$ defined by

f (λ) = \sum_{j \in J} f_{j} (λ)

is well defined and belongs to $ε,$ and we write $f = \sum f_{j} .$

For each $λ \in 𝔥^{*}$ let $e^{λ}$ denote the characteristic function of $λ .$ Then $e^{λ} e^{μ} = e^{λ + μ}$ from the rule (1) defining multiplication. For any $f \in ε,$ the family ${(f (λ) e^{λ})}_{λ \in 𝔥^{*}}$ is summable, and we have

\begin{matrix} f = \sum_{λ} f (λ) e^{λ} \in \sum_{μ \in F} e^{μ} ℤ [[e^{- α_{1}}, \dots, e^{- α_{n}}]] & (2) \end{matrix}

Now let $M \in 𝒪$ and define the formal character of $M$ to be the function $ch (M)$ defined by

ch (M) (λ) = dim M_{λ};

thus $Supp (ch (M)) = P (M) \subset D (F)$ for some finite $F \subset 𝔥^{*},$ so that $ch (M) \in ε$ and by (2) we have

ch (M) = \sum_{λ \in P (M)} dim M_{λ} . e^{λ}

Thus $ch (M)$ is nothing but the generating function for the multiplicities of the weights $λ \in P (M) .$

(3.9) ch is an additive function on the category $𝒪,$ i.e. if $0 ⟶ M^{'} ⟶ M ⟶ M^{''} ⟶ 0$ is an exact sequence in $ℳ,$ then

ch (M) = ch (M^{'}) + ch (M^{''}) .

Proof.

This follows from (3.2)(ii) by counting dimensions

$□$

More generally, if $0 ⟶ M_{0} ⟶ M_{1} ⟶ \dots ⟶ M_{r} ⟶ 0$ is an exact sequence in $𝒪,$ we have

\sum_{i = 0}^{r} {(- 1)}^{i} ch (M_{i}) = 0

by breaking up the exact sequence into short exact sequences and applying (3.9).

We shall first compute the character of a Verma module $V (λ) :$

(3.10) Let $λ \in 𝔥^{*} .$ Then

ch (V (λ)) = e^{λ} / \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}} = e^{λ} / π say

where as usual $m_{α} = dim 𝔤_{α} = dim 𝔤_{- α} .$ (The product on the right is a unit in $ε .)$

Proof.

We saw earlier that $V (λ)$ is a free $U (𝔫_{-})$ –module of rank 1. As before, let $y_{1}, y_{2}, \dots$ be a $k$ –basis of $𝔫_{-}$ consisting of root vectors, say $y_{i} \in 𝔤_{- β_{i}} .$ Then $V (λ)$ has a $k$ –basis consisting of weight vectors $y_{1}^{r_{1}} y_{2}^{r_{2}} \dots v_{λ},$ where each $r_{i} \geq 0$ (and $\sum r_{i} < \infty),$ the vector just written being of weight $λ - r_{1} β_{1} - r_{2} β_{2} - \dots$

$□$

Invariant bilinear form

As before, $A = (a_{i j})$ is any $n \times n$ matrix $/ k,$ and $𝔤 = 𝔤 (A) .$ Suppose that there exists a symmetric ( $k$ –valued) bilinear form $⟨ x, y ⟩$ on $𝔤$ such that

$⟨ [x, y], z ⟩ = ⟨ x, [y, z] ⟩$ for all $x, y, z \in 𝔤$ (invariance)
the restriction of $⟨, ⟩$ to $𝔥$ is nondegenerate.

Then for any $h \in 𝔥$ we have

\begin{matrix} ⟨ h, h_{j} ⟩ & = & ⟨ h, [e_{j}, f_{j}] ⟩ & = & ⟨ [h, e_{j}], f_{j} ⟩ \\ = & α_{j} (h) ⟨ e_{j}, f_{j} ⟩ \end{matrix}

Let $ε_{j} = ⟨ e_{j}, f_{j} ⟩ \in k .$ Condition (ii) ensures that $ε_{j} \neq 0;$ taking $h = h_{i}$ in the calculation above we have

⟨ h_{i}, h_{j} ⟩ = a_{i j} ε_{j}

and therefore the matrix $A E = (a_{i j} ε_{j})$ is symmetric, i.e. $A$ is symmetrizable ( $E$ is a nonsingular diagonal matrix).

This proves the first part of

(3.11) Let $⟨ x, y ⟩$ be an invariant symmetric bilinear form on $𝔤,$ whose restriction to $𝔥$ is nondegenerate. Then

the matrix $A$ is symmetrizable
$⟨ x, y ⟩$ is nondegenerate on $𝔤$
$⟨ x, y ⟩$ restricted to $𝔤_{α} \times 𝔤_{β}$ (where $α, β \in R \cup {0}$ ) is
1. zero if $α + β \neq 0$
2. nondegernerate if $α + β = 0$
If $α \in R$ and $x \in 𝔤_{x},$ $y \in 𝔤_{- α},$ then
$[x, y] = ⟨ x, y ⟩ h_{α}^{\lor}$
where $h_{α}^{\lor} \in 𝔥$ is defined by $⟨ h, h_{α}^{\lor} ⟩ = α (h)$ for all $h \in 𝔥 .$

Proof.

Let $𝔞 = {x \in 𝔤 : ⟨ x, 𝔤 ⟩ = 0} .$ Since the form is invariant, $𝔞$ is an ideal in $𝔤 :$ for if $x \in 𝔞$ and $y, z \in 𝔤$ we have
$⟨ [x, y], z ⟩ = ⟨ x, [y, z] ⟩ = 0$
whence $[x, y] \in 𝔞 .$

Now all ideals in $𝔞$ are graded (1.7), hence $𝔞 = \sum 𝔞_{α},$ where $𝔞_{α} = 𝔞 \cap 𝔤_{α}$ (and $𝔞_{0} = 𝔞 \cap 𝔥) .$ But $𝔞_{0} = 0,$ because if $h \in 𝔞_{0}$ then certainly $⟨ h, 𝔥 ⟩ = something$ whence $h = 0 .$ But $𝔤$ has no nontrivial ideals with trivial $𝔥$ –component, hence $𝔞 = 0 .$
Let $x \in 𝔤_{α},$ $y \in 𝔤_{β},$ $h \in 𝔥 .$ Then $⟨ [x, h], y ⟩ = ⟨ x, [h, y] ⟩$ and thus
$- α (h) ⟨ x, y ⟩ = β (h) ⟨ x, y ⟩$
If $α + β \neq 0,$ choose $h$ such that $α (h) + β (h) \neq 0 .$ It follows that $⟨ x, y ⟩ = 0,$ which proves (a).

Next, suppose $x \in 𝔤_{α}$ is such that $⟨ x, 𝔤_{- α} ⟩ = 0 .$ Then $⟨ x, 𝔤 ⟩ = 0$ by (a), whence $x = 0$ by (ii).
We have
$⟨ h, [x, y] ⟩ = ⟨ [h, x], y ⟩ = α (h) ⟨ x, y ⟩ = ⟨ h, h_{α}^{\lor} ⟩ ⟨ x, y ⟩$
whence the result, by nondegeneracy.

$□$

Proposition (3.11) has a converse:

(3.12) Suppose that the matrix $A$ is symmetrizable. Then there exists a nondegenerate symmetric invariant bilinear form $⟨ x, y ⟩$ on $𝔤$ (which therefore has the properties listed in (3.11)).

Proof.

By assumption, there exist non-zero scalars $ε_{j}$ such that $a_{i j} ε_{j} = a_{j i} ε_{i} .$ We shall first construct the form on $𝔥$ (of (2.23)) and then extend it to $𝔤 .$

Choose a vector space complement $𝔥^{''}$ of $𝔥^{'}$ in $𝔥$ (where as usual $𝔥^{'} = \sum_{1}^{n} k h_{i}$ ) and define $⟨ x, y ⟩$ on $𝔥 \times 𝔥$ by

\begin{matrix} ⟨ x, h_{i} ⟩ & = & ⟨ h_{i}, x ⟩ & = & ε_{i} α_{i} (x) & (x \in 𝔥) \\ ⟨ y, z ⟩ & = & 0 & (y, z \in 𝔥^{*}) \end{matrix}

To see that this form is nondegenerate, suppose that $h \in 𝔥$ is such that $⟨ h, 𝔥 ⟩ = 0 .$ Then in particular we have $ε_{i} α_{i} (h) = ⟨ h, h_{i} ⟩ = 0,$ whence $h \in \cap_{1}^{n} Ker α_{i} = 𝔠 \subset 𝔥^{'};$ thus $\sum λ_{i} h_{i}$ say, and then

\sum_{1}^{n} λ_{i} ε_{i} α_{i} (x) = ⟨ h, x ⟩ = 0

for all $x \in 𝔥,$ so that $\sum λ_{i} ε_{i} α_{i} = 0$ in $𝔥^{*},$ hence $λ_{1} = \dots = λ_{n} = 0$ and so $h =$ something.

Recall the principal $ℤ$ –grading of $𝔤 :$

𝔤_{r} = \sum_{ht α = r} 𝔤_{α}; 𝔤 = \sum_{r \in ℤ} 𝔤_{r}; 𝔤_{0} = 𝔥

Let $G_{n} = \sum_{∣ r ∣ \leq n} 𝔤_{r}$ for $n \geq 0 .$

The extension of $⟨ x, y ⟩$ to $G_{1}$ is unique, for by (3.11)(iii) we must have $⟨ 𝔤_{α}, 𝔤_{β} ⟩ = 0$ if $α + β \neq 0,$ and also $⟨ e_{j}, f_{j} ⟩ = ε_{j} .$ It is straightforward to verify that

\begin{matrix} ⟨ [x, y], z ⟩ = ⟨ x, [y, z] ⟩ & (1) \end{matrix}

whenever all 5 terms lie in $G_{1} .$

We shall now extend $⟨, ⟩$ to a symmetric bilinear form on $G_{n}$ $(n \geq 2)$ by induction on $n,$ such that (1) holds whenever all 5 terms are in $G_{n},$ and such that

\begin{matrix} ⟨ 𝔤_{i}, 𝔤_{j} ⟩ = 0 if i + j \neq 0 & (2) \end{matrix}

whenever $∣ i ∣ \leq n$ and $∣ j ∣ \leq n .$

So assume $n \geq 2$ and $⟨, ⟩$ defined on $G_{n - 1},$ satisfying (1) and (2). To extend the form to $G_{n}$ we have, in view of (2), only to define $⟨ x, y ⟩$ on $𝔤_{n} \times 𝔤_{- n} .$ Write

\begin{matrix} x & = & \sum_{i} [s_{i}, t_{i}] & \in & 𝔤_{n} & (3) \\ y & = & \sum_{j} [u_{j}, v_{j}] & \in & 𝔤_{- n} & (4) \end{matrix}

where $s_{i}, t_{i}$ (resp. $u_{j}, v_{j})$ are homogeneous of positive (resp. negative) degree, hence lie in $G_{n - 1} .$ Define now

\begin{matrix} ⟨ y, x ⟩ = ⟨ x, y ⟩ = \sum_{j} ⟨ [x, u_{j}], v_{j} ⟩ . & (5) \end{matrix}

The whole point is to show that this is well defined, i.e. that it does not depend on the expression (4) for $y .$ For this purpose we make the following calculation: dropping the suffixes,

\begin{matrix} ⟨ [[s, t], u], v ⟩ & = & ⟨ [s, [t, u]], v ⟩ - ⟨ [t, [s, u]], v ⟩ & (Jacobi) \\ = & - ⟨ [t, u], [s, v] ⟩ + ⟨ [s, u], [t, v] ⟩ & (invariance) \\ = & - ⟨ [s, v], [t, u] ⟩ - ⟨ [s, u], [v, t] ⟩ & (symmetry) \\ = & - ⟨ s, [v, [t, u]] ⟩ - ⟨ s, [u [v, t]] ⟩ & (invariance) \\ = & ⟨ s, [t, [u, v]] ⟩ & (Jacobi) \end{matrix}

i.e. we have

\begin{matrix} ⟨ [[s, t], u], v ⟩ & = & ⟨ s, [t, [u, v]] ⟩ & (6) \end{matrix}

From (3), (4) and (6) it follows that

\begin{matrix} \sum_{j} ⟨ [x, u_{j}], v_{j} ⟩ & = & \sum_{i, j} ⟨ [[s_{i}, t_{i}], u_{j}], v_{j} ⟩ \\ = & \sum_{i, j} ⟨ s_{i}, [t_{i}, [u_{j}, v_{j}]] ⟩ \\ = & \sum_{i} ⟨ s_{i}, [t_{i}, y] ⟩ \end{matrix}

and therefore $⟨ x, y ⟩$ (as defined by (5)) is well-defined, and satisfies the invariance condition (1) by our definition (5).

$□$

Notation. In $𝔥$ we have $⟨ h_{i}, x ⟩ = ε_{i} α_{i} (x),$ in particular

⟨ h_{i}, h_{j} ⟩ = a_{i j} ε_{j} = a_{j i} ε_{i}

and an isomorphism $θ : 𝔥 \to 𝔥^{*}$ definted by

θ (x) (y) = ⟨ x, y ⟩

so that

θ (h_{i}) (x) = ⟨ h_{i}, x ⟩ = ε_{i} α_{i} (x)

for all $x \in 𝔥,$ whence

\begin{matrix} θ (h_{i}) & = & ε_{i} α_{i} & = & α_{i}^{\lor} \\ θ^{- 1} (α_{i}) & = & ε_{i}^{- 1} h_{i} & = & h_{i}^{\lor} \end{matrix}

We use $θ$ to transport the scalar product from $𝔥$ to $𝔥 * :$ thus

⟨ α_{i}, α_{j} ⟩ = ⟨ ε_{i}^{- 1} h_{i}, ε_{j}^{- 1} h_{j} ⟩ = ε_{i}^{- 1} a_{i j} = ε_{j}^{- 1} a_{j i}

Casimir operator

In the classical situation, where $𝔤$ is finite-dimensional, the Casimir operator plays an important role in representation theory. The invariant bilinear form may be regarded as an element $B \in {(𝔤 \otimes 𝔤)}^{*} = 𝔤^{*} \otimes 𝔤^{*};$ since it is non-degenerate it induces an isomorphism of $𝔤^{*}$ with $𝔤,$ hence determines an element of $𝔤 \otimes 𝔤 .$ The image of this in $U (𝔤)$ (which is a quotient of the tensor algebra $T (𝔤))$ is the Casimir element $ω$ . Since $B$ is invariant it follows that $ω$ is in the centre of $U (𝔤),$ hence acts as a scalar on any simple $𝔤$ –module. Explicitly, if $x_{1}, \dots, x_{n}$ is any $k$ –basis of $𝔤,$ let $y_{1}, \dots, y_{n}$ be the dual basis (so that $⟨ x_{i}, y_{j} ⟩ = δ_{i j});$ then $ω = \sum_{1}^{n} y_{i} x_{i} .$

In the present situation, where $A$ is any symmetrizable matrix, we proceed as follows. Let $α \in R^{+} \cup {0};$ by (3.11), the bilinear form $⟨ x, y ⟩$ is nondegenerate on $𝔤_{α} \times 𝔤_{- α};$ choose a basis $x_{1}, \dots, x_{m}$ of $𝔤_{α}$ $(m = m_{α});$ let $y_{1}, \dots, y_{m}$ be the dual basis of $𝔤_{- α},$ and define

u_{α} = \sum_{i = 1}^{m} y_{i} x_{i} \in U (𝔤) .

Then $u_{α}$ is independent of the choice of dual bases, for if $x_{1}^{'}, \dots, x_{m}^{'};$ $y_{1}^{'}, \dots, y_{m}^{'}$ is another pair of dual bases, we have

\begin{matrix} y_{i} & = & \sum_{j} ⟨ x_{j}^{'}, y_{i} ⟩ y_{j}^{'} \\ x_{j}^{'} & = & \sum_{i} ⟨ x_{j}^{'}, y_{i} ⟩ x_{i} \end{matrix}

and therefore

\sum_{i} y_{i} x_{i} = \sum_{i, j} ⟨ x_{j}^{'}, y_{i} ⟩ y_{j}^{'} x_{i} = \sum_{j} y_{j}^{'} x_{j}^{'} .

If $α \in Q^{+}$ is not a root (or zero) we define $u_{α} = 0$ (the sum is empty).

Example: We have $⟨ e_{i}, f_{i} ⟩ = ε_{i},$ hence $u_{α_{i}} = ε_{i}^{- 1} f_{i} e_{i} . (1)$

Let $x \in 𝔤_{β}$ $(β \in R \cup {0}),$ then we have

\begin{matrix} [u_{α}, x] & = & \sum_{i = 1}^{m} (y_{i} x_{i} x - x y_{i} x_{i}) \\ = & \sum_{i} [y_{i}, x] x_{i} - \sum_{i} y_{i} [x, x_{i}] \\ = & v_{α, x} - v_{α, x}^{'} say \end{matrix}

where, for the same reason as before, $v_{α, x}$ and $v_{α, x}^{'}$ are independent of the choice of dual bases. Since $x \in 𝔤_{β}$ and $x_{i} \in 𝔤_{α}$ we have $[x, x_{i}] \in 𝔤_{α + β} .$ Let $(x_{j}^{'})$ be a basis of $𝔤_{α + β},$ $(y_{j}^{'})$ the dual basis of $𝔤_{- (α + β)} .$ Then

\begin{matrix} [x, x_{i}] & = & \sum_{j} ⟨ y_{j}^{'}, [x, x_{i}] ⟩ x_{j}^{'} \\ = & \sum_{j} ⟨ [y_{j}^{'}, x], x_{i} ⟩ x_{j}^{'} (invariance) \end{matrix}

and therefore

\begin{matrix} v_{α, x}^{'} & = & \sum_{i} y_{i} [x, x_{i}] \\ = & \sum_{i, j} ⟨ [y_{j}^{'}, x], x_{i} ⟩ y_{i} x_{j}^{'} \\ = & \sum_{j} [y_{j}^{'}, x] x_{j}^{'} = v_{α + β, x} \end{matrix}

i.e. we have the formula

\begin{matrix} \begin{matrix} v_{α, x}^{'} = v_{α + β, x} \end{matrix} (x \in 𝔤_{β}) & (2) \end{matrix}

Likewise

\begin{matrix} \begin{matrix} v_{α, x} = v_{α - β, x}^{'} \end{matrix} & (3) \end{matrix}

In particular, $v_{α, x}^{'} = 0$ unless both $α$ and $α + β$ are positive roots (or 0); and likewise $v_{α, x} = 0$ unless both $α, α - β \in R^{+} \cup {0} .$

Now let

u = \sum_{α \in R^{+}} u_{α}

(in some completion of $U (𝔤)$ ...)

(3.13) We have

\begin{matrix} [u, e_{i}] & = & - h_{i}^{\lor} e_{i} \\ [u, f_{i}] & = & f_{i} h_{i}^{\lor} \\ [u, h] & = & 0 (h \in 𝔥) \end{matrix}

where $h_{i}^{\lor} =$ image of $α_{i}$ under the isomorphism $𝔥^{*} \to 𝔥$ induced by the bilinear form. (i.e. $⟨ h_{i}^{\lor}, h ⟩ = α_{i} (h),$ so that $h_{i}^{\lor} = ε_{i}^{- 1} h_{i})$

Proof.

We compute:

\begin{matrix} [u, e_{i}] & = & \sum_{α \in R^{+}} [u_{α}, e_{i}] \\ = & \sum_{α \in R^{+}} v_{α, e_{i}} - \sum_{α \in R^{+}} v_{α, e_{i}}^{'} \\ = & \sum_{α \in R^{+}} v_{α, e_{i}} - \sum_{α \in R^{+}} v_{α + α_{i}, e_{i}} by (2) \end{matrix}

But $v_{α, e_{i}} = 0$ unless $α - α_{i} \in R^{+} \cup {0},$ and therefore

[u, e_{i}] = v_{α_{i}, e_{i}} = ε_{i}^{- 1} [f_{i}, e_{i}] e_{i} = - ε_{i}^{- 1} h_{i} e_{i} = - h_{i}^{\lor} e_{i} .

Similarly we have

\begin{matrix} [u, f_{i}] & = & \sum_{α} v_{α, f_{i}} - \sum_{α \in R^{+}} v_{α, f_{i}}^{'} \\ = & \sum_{α} v_{α + α_{i}, f_{i}}^{'} - \sum_{α} v_{α, f_{i}}^{'} \\ = & - v_{α_{i}, f_{i}}^{'} = - ε_{i}^{- 1} f_{i} [f_{i}, e_{i}] = ε_{i}^{- 1} f_{i} h_{i} = f_{i} h something? \end{matrix}

Finally $[u, h] = \sum_{α} (v_{α, h} - v_{α, h}^{'}) = 0$ by (2).

$□$

Choose an element $ρ \in 𝔥^{*}$ such that

ρ (h_{i}) = \frac{1}{2} a_{i i} (1 \leq i \leq n)

(thus $ρ (h_{i}) = 1$ if $A$ is a Cartan matrix). Then we have

⟨ ρ, α_{i} ⟩ = ρ (h_{i}^{\lor}) = ε_{i}^{- 1} ρ (h_{i}) = \frac{1}{2} ε_{i}^{- 1} a_{i i} = \frac{1}{2} ⟨ α_{i}, α_{i} ⟩

i.e.,

\begin{matrix} ⟨ 2 ρ, α_{i} ⟩ = ⟨ α_{i}, α_{i} ⟩ . \end{matrix}

Now let $M \in 𝒪$ and define a $k$ –linear map

Ω = Ω_{M} : M ⟶ M

as follows: if $v_{λ} \in M_{λ}$ $(λ \in P (M))$ then

Ω (v_{λ}) = {∣ λ + ρ ∣}^{2} v_{λ} + 2 u . v_{λ}

where ${∣ λ + ρ ∣}^{2} = ⟨ λ + ρ, λ + ρ ⟩$ and

u . v_{λ} = \sum_{α \in R^{+}} u_{α} . v_{λ}

is a finite sum, because $𝔤_{α} . v_{λ} = 0$ for almost all $α \in R^{+} .$

(3.14) $Ω_{M}$ is a $𝔤$ –module homomorphism.

Proof.

Since $𝔤 = 𝔤 (A)$ is generated by the $e_{i},$ the $f_{i}$ and $𝔥,$ it is enough to verify that $Ω$ commutes with the action of each of these elements. So we calculate:

\begin{matrix} Ω (e_{i} . v_{λ}) - e_{i} Ω (v_{λ}) & = & ({∣ λ + α_{i} + ρ ∣}^{2} - {∣ λ + ρ ∣}^{2}) e_{i} . v_{λ} + 2 [u, e_{i}] . v_{λ} \\ = & ⟨ α_{i}, 2 λ + 2 ρ + α_{i} ⟩ e_{i} . v_{λ} - 2 h_{i}^{\lor} . e_{i} . v_{λ} by (3.1. something \\ = & (⟨ α_{i}, 2 λ + 2 ρ + α_{i} ⟩ - 2 ⟨ α_{i}, λ + α_{i} ⟩) e_{i} . v_{λ} \\ = & ⟨ α_{i}, 2 ρ - α_{i} ⟩ e_{i} . v_{λ} = 0 . \end{matrix}

Likewise,

\begin{matrix} Ω (f_{i}, v_{λ}) - f_{i} . Ω (v_{λ}) & = & ({∣ λ - α_{i} + ρ ∣}^{2} - {∣ λ + ρ ∣}^{2}) f_{i} . v_{λ} + 2 [u, f_{i}] . v_{λ} \\ = & - ⟨ α_{i}, 2 λ + 2 ρ - α_{i} ⟩ f_{i} . v_{λ} + 2 f_{i} h_{i}^{\lor} . v_{λ} by (3.13) \\ = & (- ⟨ α_{i}, 2 λ + 2 ρ - α_{i} ⟩ + 2 ⟨ α_{i}, λ ⟩) f_{i} . v_{λ} \\ = & - ⟨ α_{i}, 2 ρ - α_{i} ⟩ f_{i} . v_{λ} = 0 . \end{matrix}

Finally,

\begin{matrix} Ω (h . v_{λ}) - h . Ω (v_{λ}) & = & ({∣ λ + ρ ∣}^{2} - {∣ λ + ρ ∣}^{2}) h . v_{λ} + 2 [u, h] . v_{λ} \\ = & 0 by (3.13) again. \end{matrix}

$□$

Remark: $Ω$ is functorial, i.e. if $f : M \to N$ is a $𝔤$ –module homomorphism (with $M, N \in 𝒪)$ then the diagram

\begin{matrix} M & \overset{f}{⟶} & N \\ Ω_{M} ↓ & ↓ Ω_{N} \\ M & \overset{f}{⟶} & N \end{matrix}

commutes. For $f$ commutes with the action of $u,$ and preserves weight spaces.

(3.15) Example. Let $M$ be a h.w. module with h. wt. $λ .$ If $v_{λ} \in M_{λ}$ is a generator of $M$ (3.4), we have $𝔤_{α} . v_{λ} = 0$ for all $α \in R^{+},$ hence $u . v_{λ} = 0$ and therefore $Ω_{M} . v_{λ} = {∣ λ + ρ ∣}^{2} v_{λ} .$ Hence by (3.14) (since $v_{λ}$ generates $M)$

Ω_{M} = {∣ λ + ρ ∣}^{2} . 1_{M} .

(3.16) Let $M \in 𝒪$ be such that $Ω_{M} = a . 1_{M}$ for some scalar $a .$ Let $F$ be a finite subset of $𝔥^{*}$ such that $P (M) \subset D (F),$ and let

S = {λ \in D (F) : {∣ λ + ρ ∣}^{2} = a} .

Then there exist integers $d_{λ}, λ \in S$ such that

ch (M) = \prod^{- 1} \sum_{λ \in S} d_{λ} e^{λ}

where $\prod = \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}} .$

Proof.

If $μ \in D (F)$ we have $λ - μ \in Q^{+}$ for some $λ \in F,$ hence $ht (λ - μ)$ is an integer $\geq 0 .$ Define the depth of $μ$ (relative to $F)$ to be

δ (μ) = max {ht (λ - μ) : λ \in F, μ \in D (λ)}

so that $λ (μ) \in ℕ;$ also define

δ (M) = min {δ (μ) : μ \in P (M)} .

Since $F$ is finite there are only finitely many $μ \in D (F)$ of given depth; in particular, $M$ has only finitely many weights $μ$ of least depth $δ (M),$ and they are all maximal weights. Call them $μ_{1}, \dots, μ_{r} .$

We shall kill the weight spaces $M_{μ_{i}}$ $(1 \leq i \leq r) .$ Let $d_{i} = dim M_{μ_{i}},$ and let

V = \oplus_{i = 1}^{r} V {(μ_{i})}^{d_{i}} .

Choose a $k$ –basis of each $M_{μ_{i}}$ and let $φ : V \to M$ be the $𝔤$ –homomorphism which maps the generators of the summands of $V$ to the chosen basis elements of the $M_{μ_{i}} .$ Let $M^{'}, M^{''}$ be the kernel and cokernel of $φ,$ so that we have an exact sequence

0 ⟶ M^{'} ⟶ V \overset{φ}{⟶} M ⟶ M^{''} ⟶ 0 .

Then $M^{'} \in 𝒪$ because it is a submodule of $V,$ and $M^{''} \in 𝒪$ because it is a quotient of $M .$ Now $Ω$ acts as scalar multiplication by ${∣ μ_{i} + ρ ∣}^{2}$ on $V {(μ_{i})}^{d_{i}}$ (3.15), and hence also on the image $φ (V {(μ_{i})}^{d_{i}}),$ which is a non zero submodule of $M .$ Since by hypothesis $Ω$ acts as scalar multiplication by $a$ on $M,$ it follows that ${∣ μ_{i} + ρ ∣}^{2} = a,$ i.e. $μ_{i} \in S$ $(1 \leq i \leq r) .$ Hence $Ω$ acts as $a . 1$ on $V,$ and hence on $M^{'};$ also on $M^{''} .$ By construction we have $δ (M^{'}) > δ (M)$ and $δ (M^{''}) > δ (M),$ and by additivity of ch (3.9)

\begin{matrix} ch (M) & = & ch (V) + ch (M^{''}) - ch (M^{'}) \\ = & \sum_{i = 1}^{r} d_{i} ch V (μ_{i}) + ch (M^{''}) - ch (M^{'}) . \end{matrix}

Now repeat the same procedure on $M^{'}$ and $M^{''} .$ After we have done it $m$ times we shall have say

ch (M) = \sum_{μ \in S_{m}} d_{μ} ch V (μ) + f_{m}

where $S_{m}$ is some finite subset of $S,$ and $δ (v) > m$ for all $v \in Supp (f_{m}) .$ Now let $m \to \infty$ and we have

\begin{matrix} ch (M) & = & \sum_{μ \in S} d_{μ} ch V (μ) \\ = & \prod^{- 1} \sum_{μ \in S} d_{μ} e^{μ} by (3.10). \end{matrix}

$□$

Remark. Suppose in particular that $M$ is a h.w. module, with highest weight $λ .$ Then

\begin{matrix} ch (M) & = & \prod^{- 1} \sum_{\binom{μ \in D (λ)}{{∣ μ + ρ ∣}^{2} = {∣ λ + ρ ∣}^{2}}} d_{μ} e^{μ} \end{matrix}

with $d_{μ} \in ℤ,$ and in particular $d_{λ} = 1 .$

The Weyl-Kac character formula

From now on, $A$ is a Cartan matrix.

Let $(M, π)$ be a h.w. module, with highest weight $λ \in 𝔥^{*} .$ Then each $π (e_{i})$ is a locally nilpotent endomorphism of $M .$ For if $μ \in P (M),$ say $μ = λ - \sum_{i = 1}^{n} m_{i} α_{i},$ and if $x \in M_{μ},$ then $π {(e_{i})}^{m} x \in M_{μ + m α_{i}} = 0$ if $m > m_{i} .$

If also each $π (f_{i})$ is locally nilpotent on $M,$ we shall say that $M$ is a quasi-simple $𝔤$ –module. (Later we shall see that quasi-simple $\Rightarrow$ simple).

(3.17) Let $(M, π)$ be a h.w. module with highest weight $λ,$ and generator $x \in M_{λ} .$ If $\exists k \geq 1$ such that $π {(f_{i})}^{k} x = 0$ for $1 \leq i \leq n,$ then $M$ is quasi-simple.

Proof.

Recall the formula (1.16)

x^{N} y = \sum_{r = 0}^{N} (\binom{N}{r}) {(ad x)}^{r} y x^{N - r}

$(x, y \in$ associative ring $R) .$ Let $v \in M,$ so that $v = π (u) x$ for some $u \in U (𝔤),$ and apply (1.16) with $x = π (f_{i}),$ $y = π (u) :$

\begin{matrix} π {(f_{i})}^{N} v & = & π {(f_{i})}^{N} π (u) x \\ = & \sum_{r = 0}^{N} (\binom{N}{r}) π ({ad f_{i}}^{r} u) π {(f_{i})}^{N - r} x . \end{matrix}

Now $ad f_{i}$ is locally nilpotent on $𝔤$ (1.19), hence also on $U (𝔤),$ so that ${(ad f_{i})}^{m} u = 0$ for some $m \geq 1 .$ Hence if $N$ is large enough $(N = k + m - 1$ would do) either $r \geq m$ or $N - r \geq k$ for each $r \in [0, N],$ and so $π {(f_{i})}^{N}$ something

$□$

(3.18) Let $(M, π)$ be a quasi-simple $𝔤$ –module with highest weight $λ .$ Then:

$ch (M)$ is $W$ –invariant (as a function on $𝔥^{*})$
If $μ \in P (M)$ and $1 \leq i \leq n,$ then the set of integers $r$ such that $μ + r α_{i} \in P (M)$ is a finite interval $[- p, q]$ in $ℤ,$ where $p, q \geq 0$ and $p - q = μ (h_{i}) .$
If $μ \in P (M),$ then $μ (h_{i}) \in ℤ$ $(1 \leq i \leq n) .$

Proof.

We shall make use of the following formula:

e^{ad x} y = e^{x} y e^{- x}

for elements $x, y$ of an associative $ℚ$ –algebra, with $x$ nilpotent (so that $e^{x}$ is defined). The proof is very simple: we have $ad x = λ_{x} - ρ_{x},$ and $λ_{x}, ρ_{x}$ commute, hence

\begin{matrix} e^{ad x} y & = & e^{λ_{x} - ρ_{x}} y & = & e^{λ_{x}} e^{- ρ_{x}} y \\ = & λ_{e^{x}} ρ_{e^{- x}} y & = & e^{x} y e^{- x} . \end{matrix}

Let $x \in 𝔤 .$ Since $ad e_{i}$ and $π (e_{i})$ are locally nilpotent, we have
$\begin{matrix} π (e^{ad e_{i}} x) & = & e^{ad π (e_{i})} . π (x) \\ = & e^{π (e_{i})} π (x) e^{- π (e_{i})} . \end{matrix}$
by the formula above. Similarly with $e_{i}$ replaced by $f_{i} .$ Hence if (as in Ch. II) we write
${\tilde{w}}_{i} = e^{ad e_{i}} e^{- ad f_{i}} e^{ad e_{i}}$
then we have
$\begin{matrix} π ({\tilde{w}}_{i} x) = θ_{i} π (x) θ_{i}^{- 1} & (1) \end{matrix}$
where
$θ_{i} = e^{π (e_{i})} e^{- π (f_{i})} e^{π (e_{i})} \in GL (M)$
Now recall (2.3) that ${\tilde{w}}_{i} h = w_{i} h$ for $h \in 𝔥 .$ It follows from (1) that
$\begin{matrix} π (w_{i} h) = θ_{i} π (h) θ_{i}^{- 1} . & (2) \end{matrix}$
Now let $μ \in P (M),$ $v \in M_{μ},$ $v \neq 0 .$ Then $π (h) v = μ (h) v$ $(h \in 𝔥)$ and therefore
$\begin{matrix} π (h) (θ_{i}^{- 1} v) & = & θ_{i}^{- 1} π (w_{i} h) v by (2) \\ = & θ_{i}^{- 1} (μ (w_{i} h) v) \\ = & (w_{i} μ) (h) θ_{i}^{- 1} v \end{matrix}$
(since $θ_{i}^{- 1}$ is $k$ –linear). This calculation shows that $θ_{i}^{- 1} v \in M_{w_{i} μ},$ and hence that $w_{i} μ \in P (M) .$ Consequently $P (M)$ is $W$ –stable; also $θ_{i}^{- 1}$ takes $M_{μ}$ into $M_{w_{i} μ},$ so that $dim M_{μ} \leq dim M_{w_{i} μ};$ replacing $μ$ by $w_{i} μ$ we get the opposite inequality, hence $ch (M)$ is $W$ –invariant.
Same proof as (2.31) (root strings).
Follows from (b).

$□$

A linear form $λ \in 𝔥^{*}$ is integral if $λ (h_{i}) \in ℤ$ $(1 \leq i \leq n);$ dominant integral if $λ (h_{i}) \in ℕ$ for $1 \leq i \leq n .$

Let $P$ (resp. $P^{+})$ denote the set of all integral (resp. dominant integral) $λ \in 𝔥^{*} .$ Notice that each $α_{j} \in P,$ because $α_{j} (h_{i}) = a_{i j} \in ℤ :$ thus $Q \subset P .$ (Warning: $Q^{+} ⊄ P^{+}) .$ Clearly $P^{+} = P \cap C^{\lor}$ $(C^{\lor}$ the dual fundamental chamber).

(3.19) Let $M$ be a quasi-simple $𝔤$ –module with highest weight $λ .$ Then $λ \in P^{+} .$ Conversely, if $λ \in P^{+}$ then $L (λ)$ is quasi-simple.

Proof.

Recall (1.17)

e_{i} f_{i}^{N + 1} = f_{i}^{N + 1} e_{i} + (N + 1) f_{i}^{N} (h_{i} - N) .

Let $v_{λ} \in M_{λ}$ be a generator of $M .$ Since $M$ is quasi-simple, $\exists N \geq 0$ such that $f_{i}^{N} . v_{λ} \neq 0,$ $f_{i}^{N + 1} . v_{λ} = 0;$ also $e_{i} . v_{λ} = 0,$ whence

\begin{matrix} 0 = e_{i} f_{i}^{N + 1} . v_{λ} = (N + 1) f_{i}^{N} (λ (h_{i}) - N) v_{λ} & (1) \end{matrix}

and therefore $λ (h_{i}) = N \geq 0 .$ Thus $λ \in P^{+} .$ (Notice that this gives another proof of (3.18)(c), namely that $P (M) \subset P :$ for if $μ \in P (M),$ then $μ \in λ - Q^{+} \subset P .)$

For the second part, let $v_{λ}$ be the generator of $L (λ)$ and let $x_{i} = f_{i}^{λ (h_{i}) + 1} v_{λ} .$ I claim that $x_{i} = 0 .$ For now we have from (1) that $e_{i} f_{i}^{N + 1} v_{λ} = 0$ if $N = λ (h_{i}),$ i.e. $e_{i} x_{i} = 0;$ also $e_{j} . x_{i} = f_{i}^{λ (h_{i}) + 1} e_{j} . x_{i} = 0$ if $j \neq i$ (because $e_{j}, f_{i}$ then commute). Hence $x_{i}$ generates a proper submodule of $L (λ) .$ Since $L (λ)$ is simple, we must have $x_{i} = 0 .$ By (3.17), it follows that $L (λ)$ is quasi-simple.

$□$

Recall that $ρ \in 𝔥^{*}$ was chosen such that $ρ (h_{i}) = \frac{1}{2} a_{i i} (1 \leq i \leq n) .$ Since $A$ is now a Cartan matrix, this condition now becomes

ρ (h_{i}) = 1 (1 \leq i \leq n) .

Thus $ρ \in P^{+} .$

For $w \in W,$ let $ε (w) = det (w) = {(- 1)}^{l (w)} .$ (sign character of $W) .$

(3.20) $e^{ρ} \prod$ is $W$ –skew, i.e.

w (e^{ρ} \prod) = ε (w) . e^{ρ} \prod

for all $w \in W .$

Proof.

It is enough to verify this when $w = w_{i}$ is a generator of $W .$ We have

w_{i} ρ = ρ - ρ (h_{i}) α_{i} = ρ - α_{i} .

On the other hand (2.6), $w_{i}$ sends $α_{i}$ to $- α_{i}$ and permutes the positive roots $\neq α_{i}$ . Thus

\begin{matrix} w_{i} (e^{ρ} \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}}) & = & e^{ρ - α_{i}} (1 - e^{α_{i}}) \prod_{\binom{α \in R^{+}}{α \neq α_{i}}} {(1 - e^{- α})}^{m_{α}} (since α_{i} has multiplicity 1) \\ = & - e^{ρ} \prod . \end{matrix}

$□$

Now assume that the Cartan matrix $A$ is symmetrizable. Then the scalar product on $𝔥$ and $𝔥^{*}$ is $W$ –invariant (2.23), and we have

\begin{matrix} ⟨ ρ, α_{i} ⟩ = ε_{i}^{- 1} ρ (h_{i}) = ε_{i}^{- 1} > 0 (1 \leq i \leq n) & (1) \end{matrix}

For the same reason, if $λ \in P^{+}$ we have

\begin{matrix} ⟨ λ, α_{i} ⟩ = ε_{i}^{- 1} λ (h_{i}) \geq 0 . & (2) \end{matrix}

(3.21) Theorem (V. Kac) Let $A$ be a symmetrizable Cartan matrix and let $M$ be a quasi-simple $𝔤 (A)$ –module with highest weight $λ .$ Then

ch (M) = (\sum_{w \in W} ε (w) e^{w (λ + ρ)}) / e^{ρ} \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}} .

Proof.

From (3.16) we have, writing $d_{μ} = c_{μ + ρ},$

e^{ρ} \prod . ch (M) = \sum_{μ} c_{μ + ρ} e^{μ + ρ}

summed over $μ \in D (λ)$ such that ${∣ μ + ρ ∣}^{2} = {∣ λ + ρ ∣}^{2},$ with coefficients $c_{μ + ρ} \in ℤ$ and, in particular, $c_{λ + ρ} = 1 .$

Now $ch (M)$ is $W$ –invariant (3.18) and $e^{ρ} \prod$ is $W$ –skew (3.20). Hence there product is $W$ –skew, and therefore for each $w \in W$ we have

\sum_{μ} c_{μ + ρ} e^{μ + ρ} = \sum_{μ} ε (w) c_{μ + ρ} e^{w (μ + ρ)}

so that $c_{w (μ + ρ)} = ε (w) c_{μ + ρ} .$ Hence if $c_{μ + ρ} \neq 0$ we have $w (μ + ρ) \leq λ + ρ$ for all $w \in W;$ choose $w$ so that $ht (λ + ρ - w (μ + ρ))$ is minimal and put $ν = w (μ + ρ) .$ Then $ht (λ + ρ - w_{i} ν) \geq ht (λ + ρ - ν),$ i.e. $ht (ν - w_{i} ν) \geq 0$ and therefore $ν (h_{i}) \geq 0,$ or equivalently $⟨ ν, α_{i} ⟩ \geq 0 .$

Thus $ν$ satisfies

$⟨ ν, α_{i} ⟩ \geq 0 (1 \leq i \leq n);$
$ν \leq λ + ρ,$ i.e. $λ + ρ = ν + \sum_{1}^{n} m_{i} α_{i}$ with coefficients $m_{i} \geq 0;$
${∣ ν ∣}^{2} = {∣ w (μ + ρ) ∣}^{2} = {∣ μ + ρ ∣}^{2} = {∣ λ + ρ ∣}^{2} .$

These three conditions force $ν = λ + ρ;$ for we have

\begin{matrix} 0 & = & {∣ λ + ρ ∣}^{2} - {∣ ν ∣}^{2} & = & ⟨ λ + ρ + ν, λ + ρ - ν ⟩ \\ = & ⟨ λ + ρ + ν, \sum m_{i} α_{i} ⟩ \\ = & \sum m_{i} ⟨ λ + ρ + ν, α_{i} ⟩ \end{matrix}

But $⟨ λ, α_{i} ⟩ \geq 0$ by (2) because $λ \in P^{+}$ (3.19); $⟨ ρ, α_{i} ⟩ > 0$ (1); and $⟨ ν, α_{i} ⟩ \geq 0$ ((i) above). Hence $⟨ λ + ρ + ν, α_{i} > 0 ⟩ (1 \leq i \leq n),$ and therefore all coefficients $m_{i}$ are 0, hence $ν = λ + ρ$ and therefore $(since c_{λ + ρ} = 1)$

\sum_{μ} c_{μ + ρ} e^{μ + ρ} = \sum_{w \in W} ε (w) e^{w (λ + ρ)} .

$□$

Recall (3.19) that $L (λ)$ is quasi-simple if $λ \in P^{+} .$ The character formula (3.21) shows that if $M$ is a quasi-simple $𝔤$ –module with highest weight $λ$ $(\in P^{+}, by (3.19))$ then $ch (M)$ depends only on $λ .$ It follows that

ch (M) = ch L (λ)

i.e. $dim M_{μ} = dim L {(λ)}_{μ}$ for all $μ .$ But $L (λ)$ is in any case a homomorphic image of $M,$ and so we conclude that $M = L (λ) :$

(3.22) Every quasi-simple $𝔤$ –module is simple.

Another corollary of (3.21) is the "denominator formula":

(3.23) For any symmetrizable Cartan matrix we have

\sum_{w \in W} ε (w) e^{w ρ - ρ} = \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}}

Proof.

Take $λ = 0$ in (3.21) and observe that $L (0)$ is the trivial 1–dimensional $𝔤$ –module (3.7), so that $ch L (0) = 1 .$

$□$

We can write (3.23) in another form, as follows. Recall that for $w \in W,$

\begin{matrix} S (w) & = & {α \in R^{+} : w^{- 1} α \in R^{-}} \\ = & R^{+} \cap w R^{-} \end{matrix}

is a finite set (2.10). Define

s (w) = \sum_{α \in S (w)} α

a finite sum of positive roots. We then have the formula (for any Cartan matrix A)

(3.24) $s (w) = ρ - w ρ .$

Proof.

If $w = w_{i_{1}} \dots w_{i_{r}}$ is a reduced word for $w (r = l (w))$ then (2.9)

S (w) = {α_{i_{1}}, w_{i_{1}} α_{i_{2}}, \dots, w_{i_{1}} \dots w_{i_{r - 1}} α_{i_{r}}}

from which it follows that if $w^{'} = w_{i_{2}} \dots w_{i_{r}}$

S (w) = α_{i_{1}} \cup w_{i_{1}} S (w^{'})

i.e. if $w = w_{i} w^{'}$ with $l (w^{'}) = l (w) - 1,$ then

S (w) = α_{i} \cup w_{i} S (w^{1})

and therefore

\begin{matrix} s (w) = α_{i} + w_{i} s (w^{'}) . & (1) \end{matrix}

To prove (3.24), we proceed by induction on $l (w) .$ The result is clearly true when $l (w) = 0,$ for then $w = 1,$ $s (w) = 0 .$ Assume $l (w) > 0$ and write $w = w_{i} w^{'}$ as above, then

\begin{matrix} s (w) & = & α_{i} + w_{i} (ρ - w^{'} ρ) by (1) \leftarrow ind. hyp. \\ = & α_{i} + (ρ - ρ (h_{i}) α_{i}) - w ρ \\ = & ρ - w ρ \end{matrix}

since $ρ (h_{i}) = 1 .$

$□$

By virtue of (3.24) we can rewrite (3.23) in the form

(3.23 $^{'}$ )

\sum_{w \in W} ε (w) e^{- s (w)} = \prod_{α \in R^{+}} {(1 - e^{- α})}^{m_{α}} .

Also form (3.23) we can rewrite the character formula (3.21) in the form

(3.21 $^{'}$ ) Let $λ \in P^{+},$ then

ch L (λ) = \frac{\sum_{w \in W} ε (w) e^{w (λ + ρ)}}{\sum_{w \in W} ε (w) e^{w ρ}} .

(3.23 $^{'}$ ) is a statement about the root system $R$ and the Weyl group $W;$ it may be formally inverted to give a formula for the multiplicities $m_{α}$ (recall that all real roots have multiplicity 1; the imaginary roots may have multiplicities $m_{α} > 1) .$

Examples

Suppose $A$ is of finite type, so that $𝔤 (A)$ is finite-dimensional and $R$ is finite. In that case
$\begin{matrix} ρ = \frac{1}{2} \sum_{α \in R^{+}} α & (1) \end{matrix}$
For if $δ$ is $\frac{1}{2}$ the sum of the positive roots then by (2.6)
$\begin{matrix} w_{i} δ & = & \frac{1}{2} \sum_{α \in R^{+}} w_{i} α \\ = & \frac{1}{2} (- α_{i} + \sum_{\binom{α \in R^{+}}{α \neq α_{i}}} α) = δ - α_{i}; \end{matrix}$
but on the other hand $w_{i} δ = δ - δ (h_{i}) α_{i},$ so that $δ (h_{i}) = 1$ for all $i .$ Since $A$ is nonsingular, $𝔥$ is spanned by $h_{1}, \dots, h_{n}$ and therefore $δ = ρ .$

The Denominator formula (3.23) now reads
$\begin{matrix} \sum_{w \in W} ε (w) e^{w ρ} & = & e^{ρ} \prod_{α \in R^{+}} (1 - e^{- α}) \\ = & \prod_{α \in R^{+}} (e^{α / 2} - e^{- α / 2}) \end{matrix}$
by virtue of $(✶) .$ where is this from? It is a polynomial identity in the group ring $ℤ [\frac{1}{2} Q] .$

For a specific example, take $A$ of type $A_{n - 1} .$ Let $u_{1}, \dots, u_{n}$ be the standard basis of $ℝ^{n},$ then the roots may be taken to be $u_{i} - u_{j}$ $(i \neq j)$ and the positive roots $u_{i} - u_{j}$ $(i < j) .$ Thus
$ρ = \frac{1}{2} \sum_{i < j} (u_{i} - u_{j}) = \frac{1}{2} \sum_{i = 1}^{n} (n + 1 - 2 i) u_{i}$
Put $x_{i} = e^{u_{i}};$ the Weyl group $W$ is here the symmetric group $S_{n}$ acting by permuting the $u_{i}$ (or, equivalently, the $x_{i}) .$ We have
$\begin{matrix} e^{ρ} & = & x_{1}^{\frac{n - 1}{2}} x_{x}^{\frac{n - 3}{2}} \dots x_{n}^{\frac{1 - n}{2}} \\ = & {(x_{1} \dots x_{n})}^{\frac{1 - n}{2}} x_{1}^{n - 1} x_{2}^{n - 2} \dots \end{matrix}$
and therefore
$\sum_{w \in W} ε (w) e^{w ρ} = {(x_{1} \dots x_{n})}^{\frac{1 - n}{2}} Δ (x_{1}, \dots, x_{n})$
where $Δ (x_{1}, \dots, x_{n}) = det (x_{i}^{n - j})$ is the Vandermonde determinant. On the other side,
$\begin{matrix} e^{ρ} \prod_{α \in R^{+}} (1 - e^{- α}) & = & e^{ρ} \prod_{i < j} (1 - x_{i}^{- 1} x_{j}) \\ = & {(x_{1} \dots x_{n})}^{\frac{1 - n}{2}} \prod_{i < j} (x_{i} - x_{j}) \end{matrix}$
and therefore the "denominator formula" in this case reduces to the familiar factorization of the Vanermonde determinant:
$Δ (x_{1}, \dots, x_{n}) = \prod_{i < j} (x_{i} - x_{j}) .$
So it is an essentially trivial polynomial identity in this case.
Let $A = (\begin{matrix} 2 & - 2 \\ - 2 & 2 \end{matrix}),$ of affine type. The Weyl group $W$ is infinite dihedral, generated by reflections $w_{1}, w_{2} .$ So its elements are 1 and
w1w2w1… torterms (all r≥1) w2w1w2… torterms (all r≥1)
We have $w_{1} (α_{1}) = - α_{1},$ $w_{2} (α_{1}) = α_{1} - α_{1} (h_{2}) α_{2} = α_{1} + 2 α_{2}$ and likewise $w_{1} (α_{2}) = 2 α_{1} + α_{2},$ $w_{2} (α_{2}) = - α_{2} .$ So we get the following picture:

Now if $w = w_{1} w_{2} w_{1} \dots$ to $r$ terms then
s(w) = α1+w1α2+ w1w2α1+… torterms, by (2.8) = α1+ (2α1+α2) +(3α1+2α2) +… = 12r(r+1)α1 +12r(r-1)α2
so that if we put $x_{1} = e^{- α_{1}}, x_{2} = e^{- α_{2}}$ we have
∑w∈Wε(w) e-s(w) = 1+∑r=1∞ (-1)r ( x112r(r+1) x212r(r-1) + x112r(r-1) x212r(r+1) ) = ∑r∈ℤ (-1)r x112r(r+1) x212r(r-1)
On the other hand, the positive roots are
rα1+(r-1)α2, rα1+rα2, (r-1)α1+rα2 (r≥1)
$r α_{1} + r α_{2} = r δ$ is an imaginary root; in fact (as we shall see later) it has multiplicity 1. So we obtain the identity
∑r∈ℤ(-1)r x112r(r+1) x212r(r-1) = ∏r=1∞ ( 1-x1r x2r-1 ) ( 1-x1r-1 x2r ) ( 1-x1rx2r )
in $ℤ [[x_{1}, x_{2}]] .$ If we put $x_{1} x_{2} = t,$ $x_{1} = x$ it takes the form
∑r∈ℤ(-1)r xrt12r(r-1) =∏r=1∞ (1-xtr-1) (1-x-1tr) (1-tr)
and is due to Jacobi (and earlier, unpublished, to Gauss): it is called Jacobi's triple product identity and it can be specialized in various ways, at least two of which are worth notice:
1. Put $t = x^{3}$ and we get
  $\prod_{r = 1}^{\infty} (1 - x^{n}) = \sum_{r \in ℤ} {(- 1)}^{r} x^{\frac{1}{2} r (3 r - 1)}$
  Euler's pentagonal number theorem.
2. Divide both sides by $1 - x$ and then set $x = 1 .$ On the product side we get $\prod_{r = 1}^{\infty} {(1 - t^{r})}^{3},$ and on the sum side
  $\begin{matrix} \sum_{r \geq 1} {(- 1)}^{r} t^{\frac{1}{2} r (r - 1)} \frac{x^{r} - x^{1 - r}}{1 - x} & \to & \sum_{r \geq 1} {(- 1)}^{r + 1} (2 r - 1) t^{\frac{1}{2} r (r - 1)} \\ = & \sum_{r \geq 0} {(- 1)}^{r} (2 r + 1) t^{\frac{1}{2} r (r + 1)} \end{matrix}$
  Thus we obtain another famous identity due to Jacobi:
  $\prod_{r = 1}^{\infty} {(1 - t^{r})}^{3} = 1 - 3 t + 5 t^{3} - 7 t^{6} + 9 t^{10} - 11 t^{15} + \dots$
Finally, if $A$ is symmetrizable and of indefinite type, the denominator formula can be used to compute the multiplicities of the imaginary roots. For a simple example take $A = (\begin{matrix} 2 & - 3 \\ - 3 & 2 \end{matrix}) .$ The Weyl group $W$ is infinite dihedral:
$\begin{matrix} w_{1} (α_{1}) & = & - α_{1} & w_{2} (α_{1}) & = & α_{1} + 3 α_{2} \\ w_{1} (α_{2}) & = & α_{2} + 3 α_{1} & w_{2} (α_{2}) & = & - α_{2} \end{matrix}$
So the real roots are $α_{1}, α_{2}, α_{1} + 3 α_{2}, α_{2} + 3 α_{1}, \dots$
$\begin{matrix} w_{1} (α_{1} + 3 α_{2}) & = & - α_{1} + 3 (α_{2} + 3 α_{1}) & = & 8 α_{1} + 3 α_{2} \\ w_{1} (3 α_{1} + 8 α_{2}) & = & - 3 α_{1} + 8 (α_{2} + 3 α_{1}) & = & 21 α_{1} + 8 α_{2} \end{matrix}$
the coefficients of which we recognise as Fibonacci numbers: the real roots are
$f_{2 k} α_{1} + f_{2 k - 2} α_{2}; f_{2 k - 2} α_{1} + f_{2 k} α_{2} (k \geq 1)$
$(f_{0} = 0, f_{1} = 1, f_{r} = f_{r - 1} + f_{r - 2})$

From this we calculate easily
$s (\underset{\underset{k terms}{⏟}}{w_{1} w_{2} \dots}) = (f_{2 k + 1} - 1) α_{1} + (f_{2 k - 1} - 1) α_{2}$
so that the series $\sum_{w \in W} ε (w) e^{- s (w)}$ is
$1 - x_{1} - x_{2} + x_{1}^{4} x_{2} + x_{2} x_{2}^{4} - x_{1}^{12} x_{2}^{4} - x_{1}^{4} x_{2}^{12} + x_{1}^{33} x_{2}^{12} + x_{1}^{12} x_{2}^{33} - \dots$
$(x_{i} = e^{- α_{i}}),$ i.e. it is a 'sparse' power series in $x_{1}, x_{2} .$ By factorizing it as a product of factors ${(1 - x_{1}^{k_{1}} x_{2}^{k_{2}})}^{m_{k_{1}, k_{2}}}$ we compute the multiplicities of the roots:
$= (1 - x_{1}) (1 - x_{2}) (1 - x_{1} x_{2}) \dots$

References

I.G. Macdonald
Issac Newton Institute for the Mathematical Sciences
20 Clarkson Road
Cambridge CB3 OEH U.K.

Version: October 30, 2001

page history