Kac-Moody Lie Algebras Introduction

Kac-Moody Lie Algebras
Introduction

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 13 August 2012

Abstract.
This is a typed version of I.G. Macdonald's lecture notes on Kac-Moody Lie algebras from 1983.

Introduction

Before we get down to serious business, let me begin by telling you a little in general terms about this subject, and what I propose to do (and what I do not propose to do) is this course of lectures. Basically it is an outgrowth of the theory of finite-dimensional complex simple (or semisimple) Lie algebras developed by Killing and E. Cartan nearly 100 years ago. If $𝔤$ is a complex finite-dimensional semisimple Lie algebra, one can associate canonically with $𝔤$ (I will go into details later) a certain matrix $C = (C_{i j})$ of integers, called the Cartan matrix of $𝔤$ , which determines $𝔤$ up to isomorphism. This matrix satisfies the following conditions:

$(C)$	$c_{i i} = 2; c_{i j} \leq 0 if i \neq j; c_{i j} = 0 \Rightarrow c_{j i} = 0$
$(P)$	All principal minors of $C$ are positive.

Conversely, any matrix of integers satisfying these two conditions is the Cartan matrix of some semisimple Lie algebra $𝔤$ , and one can write down generators and relations for $𝔤$ which involve only the integers $c_{i j}$ . (SOMETHING GOES HERE) (I will write them down explicitly a little later.)

In the late 1960's, V. Kac and R. Moody more or less simultaneously and independently, had the idea of starting with a "generalized Cartan matrix" (satisfying $(C)$ but not $(P)$ ), writing down the same set of generators and relations as in Serr's theorem, and looking at the resulting Lie algebra (which is now infinite-dimensional). These are the so-called Kac-Moody Lie algebras, or Lie algebras defined by generalized Cartain matrices.

Of course, nearly always when one takes an attractive and elegant piece of mathematics, such as the classical theory of finite-dimensional semisimple Lie algebras over $ℂ$ , and starts tinkering with it by weakening the axioms, the result is usually of no interest at all. The surprising thing in this case is that one does continue to get a coherent theory (which of course includes the classical theory as a special case) in which all the main features of the classical theory have their counterparts: root system, Weyl group, representations and characters – culminating in a generalization (due to V. Kac) of Weyl's character formula.

Moreover it has become clear during the last 10 years or so that these Kac-Moody Lie algebras impinge on many other areas of mathematics:

Number Theory	(Modular forms)
Combinatorics	(Partitions, Rogers-Ramanujan identities)
Topology	(Loop spaces and Loop groups)
Linear Algebra	(Representations of Quivers)
Singularities
Completely integrable systems
Mechanics and Particle Physics

My aim in this course of lectures is to cover the basic structure and representation theory of Kac-Moody Lie algebras, but not any of the applications listed above. Also I don't propose to assume any particular knowledge of Lie algebras, so I will begin by briefly reviewing some of the basic notions. We shall work always over a field $k$ of characteristic 0 (and fairly soon $k$ will be $ℂ$ , the field of complex numbers). A Lie algebra $𝔤$ is then a vector space over $k$ endowed with a bilinear multiplication

	$(x, y) \mapsto [x, y]$	(Lie bracket)
$𝔤 \times 𝔤 \to 𝔤$

satisfying

(1)	$[x, x] = 0$ for all $x \in 𝔤$
(2)	$[x, [y, z]] + [y, [z, x]] + [z, [x, y]] = 0$

for all $x, y, z \in 𝔤$ (Jacobi identity).

By applying (1) to $x + y$ and using the bilinearity of the bracket we have

'

)

[x, y] = - [y, x]

and conversely (1 $'$ ) $\Rightarrow$ (1) (take $x = y$ ).

Examples

$V$ any vector space over $k$ , define $[x, y] = 0$ for all $x, y \in V$ . This is an abelian Lie algebra.
$A$ any associative $k$ –algebra, define $[x, y] = x y - y x$ . Check that (2) holds. We have a Lie algebra L( $A$ ).
$A$ any $k$ –algebra – i.e. $A$ is a $k$ –vector space endowed with a bilinear multiplication $(x, y) \mapsto x y : A \times A \to A$ .
A derivation $d : A \to A$ is a $k$ –linear mapping which satisfies

$(✶)$

$d (x y) = (d x) y + x (d y) (x, y \in A)$

If $d_{1}, d_{2}$ are derivations, so is $[d_{1}, d_{2}] = d_{1} d_{2} - d_{2} d_{1}$ (check this). Again the Jacobi identity is satisfied (the verification is the same as in Ex. 2), so we have a Lie algebra Der( $A$ ).
It follows from $(✶)$ that
$d^{n} (x y) = \sum_{p + q = n} \frac{n!}{p! q!} (d^{p} x) (d^{q} y) (Leibniz)$
by induction on $n$ ; hence if $d$ is nilpotent $(d^{N} = 0 for some N > 0)$
$e^{d} = \sum_{n \geq 0} \frac{d^{n}}{n!}$
is well-defined (because the sum is finite) and is an automorphism of $A$ :
$e^{d} (x y) = \sum_{n \geq 0} \frac{d^{n} (x y)}{n!} = \sum_{p, q \geq 0} \frac{d^{p} x}{p!} \cdot \frac{d^{q} y}{q!} = e^{d} (x) e^{d} (y)$
so that $e^{d} : A \to A$ is a $k$ –algebra homomorphism, hence an automorphism because $e^{d} \cdot e^{- d} = 1$ .
Let $V$ be a finite-dimensional $k$ –vector space, $A =$ End( $V$ ). Then L( $A$ ) (Ex. 2) is a Lie algebra denoted by $𝔤 𝔩 (V)$ . If $V = k^{n}$ , so that $A = M_{n} (k)$ is the algebra of $n \times n$ matrices over $k$ , we write ${𝔤 𝔩}_{n} (k)$ in place of $𝔤 𝔩 (k^{n})$ . ${𝔰 𝔩}_{n} (k) = {X \in {𝔤 𝔩}_{n} (k) : trace X = 0}$ is a subalgebra of ${𝔤 𝔩}_{n} (k)$ , because $trace [X, Y] = trace X Y - trace Y X = 0$ .
$G$ a (real or complex) Lie group, $𝔤 =$ tangent space $T_{e} (G)$ to $G$ at the identity element $e$ ( $k = ℝ$ or $ℂ$ here). $𝔤$ inherits from the group $G$ a Lie algebra structure: roughly speaking, addition $X + Y$ in $𝔤$ corresponds to multiplication in $G$ near the identity, and the bracket $[X, Y]$ ERROR HERE? to formation of the commutator $x y x^{- 1} y^{- 1}$ (for $x, y$ near $e$ ). Here of course $𝔤$ is finite-dimensional: $dim 𝔤 = dim G$ . This is the origin of the subject: Lie algebra $𝔤 =$ linear approximation to $G$ at $e$ .

Basic concepts

Many notions for groups have counterparts for Lie algebras. This is hardly surprising, given the origin of the subject.

Let $𝔤$ be a Lie algebra. If $𝔞, 𝔟$ are vector subspaces (or just subsets) of $𝔤$ , let $[𝔞, 𝔟]$ denote the subspace of $𝔤$ spanned by all $[x, y]$ with $x \in 𝔞, y \in 𝔟$ . Observe that $[𝔞, 𝔟] = [𝔟, 𝔞]$ (because $[x, y] = - [y, x]$ ).

Subalgebra: a vector subspace $𝔞$ of $𝔤$ is a subalgebra if $[𝔞, 𝔞] \subset 𝔞$ (so that $𝔞$ is a Lie algebra in it's own right).

Ideal: a vector subspace $𝔫$ of $𝔤$ is an ideal of $𝔤$ if $[𝔤, 𝔫] \subset 𝔫$ (normal subgroup).

Quotient algebra: Let $𝔫$ be an ideal in $𝔤$ , and form the vector space quotient $𝔤 / 𝔫$ , whose elements are the cosets $\overline{x} = x + 𝔫$ . Define $[\overline{x}, \overline{y}] = \overline{[x, y]}$ , this does not depend on the choice of representations, and makes $𝔤 / 𝔫$ into a Lie Algebra. $(G / N)$

Homomorphism: a homomorphism from $𝔤$ to $𝔥$ is a $k$ –linear map $f : 𝔤 \to 𝔥$ such that $f ([x, y]) = [f (x), f (y)]$ for all $x, y \in 𝔤$ .
It's kernel $𝔫 = f^{- 1} (0)$ is an ideal in $𝔤$ , it's image $𝔞 = f (𝔤)$ is a subalgebra of $𝔥$ , and $f$ induces an isomorphism $𝔤 / 𝔫 \tilde{\to} 𝔞$

If $x, y \in 𝔤$ are such that $[x, y] = 0$ , we say that $x, y$ commute. In particular, if any two elements of $𝔤$ commute, ie if $[𝔤, 𝔤] = 0,$ we say that $𝔤$ is abelian (Ex. 1 above).

Centre of $𝔤 = z = {x \in 𝔤 : [x, 𝔤] = 0}$ . It is an ideal in $𝔤$ .

Derived algebra $𝔤' = D 𝔤 = [𝔤, 𝔤]$ consists of all linear combinations of brackets $[x, y]$ . $D 𝔤$ is an ideal in $𝔤$ (by virtue of the Jacobi identity):

$[[x, y], z] = - [[y, z], x] - [[z, x], y] \in D 𝔤,$

hence $[D 𝔤, 𝔤] \subset D 𝔤$ . Moreover $𝔤 / D 𝔤$ is abelian (and $D 𝔤$ is the smallest ideal with abelian quotient).
Derived series, upper and lower central series; nilpotent, solvable Lie algebras.

Adjoint representation

For each $x \in 𝔤$ we define $ad (x) : 𝔤 \to 𝔤$ by $ad (x) y = [x, y]$ . Then

$ad : 𝔤 \to 𝔤 𝔩 (𝔤)$

is a homomorphism of Lie algebras, because for all $x, y, z \in 𝔤$ we have

$\begin{matrix} ad [x, y] \cdot z & = & [[x, y], z] \\ = & - [z, [x, y]] \\ = & [x, [y, z]] - [y, [x, z]] (Jacobi) \\ = & (ad x) (ad y) z - (ad y) (ad x) z \\ = & [ad x, ad y] \cdot z \end{matrix}$

Moreover each $ad x$ is a derivation of $𝔤$ :

$(ad x) [y, z] = [(ad x) y, z] + [y, (ad x) z]$

This is another equivalent form of the Jacobi identity.

The kernel of ad is the centre $z$ of $𝔤$ .

Inner automorphisms

If $x \in 𝔤$ is such that $ad x$ is nilpotent $({(x)}^{N} = 0 for some N > 0)$ then we can form $e^{ad x}$ (Ex. 3 above) which is an automorphism of $𝔤$ . The subgroup Int( $𝔤$ ) of Aut( $𝔤$ ) generated by these $e^{ad x}$ is the group of inner automorphisms of $𝔤$ . It is a normal subgroup of Aut( $𝔤$ ), because if $φ \in Aut (𝔤)$ we have $φ (ad x) φ^{- 1} = ad φ (x)$ and therefore also

$φ (e^{ad x}) φ^{- 1} = e^{ad φ x}$ .

Representations

Let $𝔤$ be a Lie algebra. A representation $ρ$ of $𝔤$ on a $k$ –vector space $V$ is by definition a Lie algebra homomorphism $ρ : 𝔤 \to 𝔤 𝔩 (V)$ . In other words, for each $x \in 𝔤$ we have a linear transformation $ρ (x) : V \to V$ depending on linearity on $x$ :

$ρ (α x + β y) = α ρ (x) + β ρ (y) (x, y \in 𝔤; α, β \in k)$

and satisfying

$ρ ([x, y]) = ρ (x) ρ (y) - ρ (y) ρ (x)$ .

An equivalent notion is that of a $𝔤$ –module, which is a vector space $V$ on which $𝔤$ acts linearly, i.e. we are given a bilinear mapping

$(x, v) \mapsto x \cdot v : 𝔤 \times V \to V$

satisfying

$[x, y] \cdot v = x \cdot y \cdot v - y \cdot x \cdot v (x, y \in 𝔤; v \in V)$

To connect the two notions, define $x \cdot v = ρ (x) v$ .

Usual notions of irreducibility, direct sums etc.

Universal enveloping algebra of a Lie algebra

If $G$ is a group, a $G$ –module (or representation of $G$ ) is the same thing as a $k G$ –module, where $k G$ is the group algebra of $G$ over $k$ . The analogue of this for Lie algebras is the universal enveloping algebra $U (𝔤)$ of a Lie algebra $𝔤$ , which may be defined as follows: for the tensor algebra of the vector space $𝔤$

$T (𝔤) = \underset{n \geq 0}{\oplus} T^{n} (𝔤)$

where $T^{0} (𝔤) = k, T^{1} (𝔤) = 𝔤, T^{n} (𝔤) = 𝔤 \otimes \dots \otimes 𝔤 (n factors) for n \geq 2$ .
Let $J_{𝔤}$ be the two-sided ideal of IS THIS A T? $U (𝔤)$ generated by all

$x \otimes y - y \otimes x - [x, y] (x, y \in 𝔤)$

and define

$U (𝔤) = T (𝔤) / J_{𝔤}$ .

$U (𝔤)$ is functional in $𝔤 :$ if $φ : 𝔤 \to 𝔥$ is a homomorphism of Lie algebras, it induces $T (φ) : T (𝔤) \to T (𝔥)$ , and

$T (φ) (x \otimes y - y \otimes x - [x, y])$
$φ x \otimes φ y - φ y \otimes φ x - [φ x, φ y] \in J_{𝔥}$

so that T( $φ$ ) maps $J_{𝔤}$ into $J_{𝔥}$ and hence induces

$U (φ) : U (𝔤) \to (𝔥)$

U is the left adjoint of the functor L (Ex. 2) from associative algebras to Lie algebras (over $k$ ): for each Lie algebra $𝔤$ and each associative algebra A, there is a canonical bijection

${Hom}_{assoc. alg} (U (𝔤), A) \tilde{\to} {Hom}_{Lie alg.} (𝔤, L(A))$

For if $φ : 𝔤 \to L(A)$ is a Lie algebra homomorphism, it is a $k$ –linear mapping $𝔤 \to A$ such that

$φ [x, y] = φ (x) φ (y) - φ (y) φ (x)$ . (1)

Extend $φ$ to a homomorphism $φ : T (𝔤) \to A$ by defining

$φ (x_{1} \otimes \dots \otimes x_{n}) = φ (x_{1}) \dots φ (x_{n}),$

then the kernel of $φ$ contains the ideal $J_{𝔤}$ , by virtue of (1). Hence $φ$ induces a homomorphism of associative algebras $φ^{#} : U (𝔤) \to A$ .

In the other direction, let $θ : U (𝔤) \to A$ be a homomorphism of associative algebras, and form the linear mapping

$θ^{b} : g ↪ T (𝔤) \to U (𝔤) \overset{θ}{\to} A$

which is a Lie algebra homomorphism $𝔤 \to L(A)$ . Finally verify that the mappings $φ \to φ^{#}, θ \to θ^{b}$ are inverses of each other.

In particular, if $ρ : 𝔤 \to 𝔤 𝔩 (V) = L(End(V))$ is a representation, we have $ρ^{#} : U (𝔤) \to End(V)$ , i.e. $V$ is a $U (𝔤)$ –module.

The Poincaré-Birkhoff-Witt Theorem

Recall that $U (𝔤) = T (𝔤) / J_{𝔤}; T (𝔤) = \underset{n \geq 0}{\oplus} T^{n} (𝔤)$ is a graded algebra, but $J_{𝔤}$ is not a graded ideal, because the generators $x \otimes y - y \otimes x - [x, y]$ are not homogeneous: $x \otimes y - y \otimes x \in T^{2} (𝔤), [x, y] \in T^{1} (𝔤)$ . So $U (𝔤)$ is not a graded algebra; but it does carry a filtration, defined as follows: Let

$T_{n} = \oplus_{i = 0}^{n} T^{i} (𝔤)$

Let $π : T (𝔤) \to U (𝔤)$ be the canonical homomorphism, and let

$U_{n} = π (T_{n})$

The $U_{n}$ are vector subspaces of $U$ :

$k = U_{0} \subset U_{1} \subset \dots; U = ⋃_{n \geq 0} U_{n}$

and since $T_{m} \otimes T_{n} \subset T_{m + n}$ we have $U_{m} \cdot U_{n} \subset U_{m + n}$ , ie $U (𝔤)$ is a strong filtered associative $k$ –algebra. Now form the associated graded algebra: define

$G^{n} = U_{n} / U_{n - 1} (n \geq 0; U_{- 1} = 0)$

then the multiplication in $U (𝔤)$ induces bilinear mappings

$G^{m} \times G^{n} \to G^{m + n}$

namely (for $x \in U_{m}, y \in U_{n}$ )

$(x + U_{m - 1}) (y + U_{n - 1}) = x y + U_{m + n - 1}$

which make $G = Gr (U (𝔤)) = \underset{n \geq 0}{\oplus} G^{n}$ into a graded associated $k$ –algebra. For each $n \geq 1$ we have

commutative with exact rows; $φ : T^{n} (𝔤) \to G^{n}$ is surjective, hence we have a surjective algebra homomorphism

$φ : T (𝔤) \to G$

defined by

$φ (x) = π (x) + U_{n - 1} (x \in T_{n})$

In particular, if $x, y \in 𝔤$ we have

$π (x \otimes y - y \otimes x) = π ([x, y]) \in U_{1}$

and therefore

$φ (x \otimes y - y \otimes x) = 0 in G_{2}$

Hence the kernel of $φ$ contains the two-sides ideal $I$ of $T (𝔤)$ generated by all $x \otimes y - y \otimes x (x, y \in 𝔤)$ ; now $T (𝔤) / I$ is by definition the symmetric algebra $S = S (𝔤)$ of the vector space $𝔤$ ; hence $φ$ induces a surjective homomorphism

$ω : S (𝔤) \to Gr (U (𝔤))$

(P – B – W) $ω$ is an isomorphism.

For the proof, see the standard texts (Bourbaki (Ch. I), Humphreys, Jacoborn)

Let $σ : T (𝔤) \to S (𝔤)$ be the canonical homomorphism.

Let $V$ be a vector subspace of $T^{n} (𝔤)$ which is mapped isomorphically by $σ$ onto $S^{n} (𝔤)$ . Then $π (V) (≅ V)$ is a complement of $U_{n - 1}$ in $U_{n}$ .

Proof.

The diagram

commutes, hence $θ$ maps $V$ isomorphically onto $G^{n} = U_{n} / U_{n - 1}$ (because $ω$ is an isomorphism, by P-B-W). Hence the result.

$□$

The canonical map $g ↪ T (𝔤) \overset{π}{\to} U (𝔤)$ is injective.

We may therefore identify $𝔤$ with it's image in $U (𝔤)$ .

Let ${(x_{λ})}_{λ \in L}$ be a totally ordered $k$ –basis of $𝔤$ . Then the elements

$x_{λ_{1}} \dots x_{λ_{n}} = π (x_{λ_{1}} \otimes \dots \otimes x_{λ_{n}})$

such that $λ_{1} \leq \dots \leq λ_{n} (for all n \geq 0)$ form a $k$ –basis of $U (𝔤)$ .

Proof.

Let $V_{n}$ be the subspace of $V^{n} (𝔤)$ spanned by all $x_{λ_{1}} \otimes \dots \otimes x_{λ_{n}}$ with $λ_{1} \leq \dots \leq λ_{n}$ . Clearly $σ$ maps $V_{n}$ isomorphically onto $S^{n} (𝔤)$ , hence by Corollary 7.2 $π (V_{n})$ is a complement of $U_{n - 1}$ in $U_{n}$ . By induction on $n$ it follows that $U (𝔤)$ is the direct sum of the $π (V_{n})$ for all $n \geq 0$ .

(Corollary 7.4 is also known as the P-B-W theorem).

$□$

Recap

To recapitulate from last time:– to each Lie algebra $𝔤$ we associate $U (𝔤)$ , its universal enveloping algebra: $U (𝔤) = T (𝔤) / J_{𝔤}$ where $T (𝔤)$ generated by all $x \otimes y - y \otimes x - [x, y] (x, y \in 𝔤)$ . $𝔤$ embeds in $U (𝔤)$ (by virtue of the P-B-W theorem) and we identify 𝔤 with its image in $U (𝔤)$ . In $U (𝔤)$ we have $[x, y] = x y - y x (x, y \in 𝔤)$ . Moreover $U (𝔤)$ is universal in the following sense: if $φ : 𝔤 \to A$ is any $k$ –linear mapping of $𝔤$ into an associative $k$ –algebra $A$ such that $φ [x, y] = φ (x) φ (y) - φ (y) φ (x)$ – i.e. if $φ : 𝔤 \to L (A)$ is a Lie algebra homomorphism, then $φ$ extends uniquely to a homomorphism $φ^{#} : U (𝔤) \to A$ , as follows :– first extend $φ$ to $\tilde{φ} : T (𝔤) \to A$ in the obvious way:

$\tilde{φ} (x_{1} \otimes \dots \otimes x_{n}) = φ (x_{1}) \dots φ (x_{n}) (x_{1}, \dots, x_{n} \in 𝔤)$

and then observe that $J_{𝔤} \subset Ker \tilde{φ}$ , so that $\tilde{φ}$ induces $φ^{#} : U (𝔤) \to A$ as desired. Thus $φ \mapsto φ^{#}$ is a mapping

Hom Lie alg ( 𝔤, L(A) ) → Hom assoc. alg ( U(𝔤)A )

which one easily verifies to be bijective (i.e. $U$ is a left adjoint of the functor L, as I said last time).

Recall also (Corollary 7.4 of P-B-W th.) that if ${(x_{λ})}_{λ \in L}$ is an ordered $k$ –basis of $𝔤$ , then the monomials $x_{λ_{1}} \dots x_{λ_{n}}$ with $λ_{1} \leq \dots \leq λ_{n}$ and $n \geq 0$ (if $n = 0$ , the product is empy and conventionally is said to be read as 1, the identity element of $U (𝔤)$ ) form a $k$ –basis of $U (𝔤)$ . This has the following consequence: if

$𝔤 = 𝔞 \oplus 𝔟$

where $𝔞, 𝔟$ are subalgebras and $𝔤$ is the direct sum of the vector spaces $𝔞, 𝔟$ , then $U (𝔞), U (𝔟)$ are subalgebras of $U (𝔤)$ and

$U (𝔤) = U (𝔞) U (𝔟) = U (𝔟) U (𝔞)$

(Take ordered bases $(y_{μ}), (z_{ν})$ of $𝔞, 𝔟$ respectively; the monomials $y_{μ_{1}} \dots y_{μ_{m}}$ with $μ_{1} \leq \dots \leq μ_{m}$ form a $k$ –basis of $U (𝔞)$ , the monomials $z_{ν_{1}} \dots z_{ν_{n}}$ with $ν_{1} \leq \dots \leq ν_{n}$ form a $k$ –basis of $U (𝔟)$ , and the monomials $y_{μ_{1}} \dots y_{μ_{m}} z_{ν_{1}} \dots z_{ν_{n}}$ with $μ_{1} \leq \dots \leq μ_{m}$ and $ν_{1} \leq \dots \leq ν_{n}$ form a $k$ –basis of $U (𝔤)$ .)

Free Lie algebras

Let $X$ be a set, $k$ a field. We want to define the free Lie algebra Lie( $X$ ) on the set $X$ . There are two ways of proceeding: one involves P-B-W, the other doesn't.

(1) Form the free non-associative algebra F( $X$ ) on $X$ . How does one do this?

Define inductively sets $X_{n}, n \geq 1$ by

$\begin{matrix} X_{1} & = & X \\ X_{2} & = & X_{1} \times X_{1} \\ X_{3} & = & (X_{2} \times X_{1}) ⊔ (X_{1} \times X_{2}) \\ and in general \\ X_{n} & = & ∐_{p = 1}^{n - 1} X_{p} \times X_{n - p} \end{matrix}$

and put $M (X) = \underset{n \geq 1}{∐} X_{n}$ (disjoint union) (the "free magma" on $X$ ).

If $a, b \in M (X)$ , say $a \in X_{p}$ and $b \in X_{q}$ , then $(a, b) \in X_{p} \times X_{q} \subset X_{p + q} \subset M$ so we have a multiplication $a b = (a, b)$ in $M (X)$ . Then $F (X)$ is the $k$ –algebra with $M (X)$ as basis, i.e. it consists of all finite linear combinations $\sum λ_{i} a_{i}$ with $λ_{i} \in k$ and $a_{i} \in M (X)$ , and multiplication defined in the obvious way:

$(\sum λ_{i} a_{i}) (\sum μ_{j} b_{j}) = \sum_{i, j} λ_{i} μ_{j} a_{i} b_{j}$ .

Now let $J$ be the 2-sided ideal in $F (X)$ generated by all

$x x, x (y z) + y (z x) + z (x y) (x, y, z \in F (X))$

and define

$Lie (X) = F (X) / J$

where $F (X)$ is a graded algebra and $J$ is a homogeneous ideal. It is clear that Lie( $X$ ) is a Lie algebra and that if

$j : X ↪ F (X) \to Lie (X)$

is the canonical embedding, then any mapping $φ$ of the set $X$ into a Lie algebra $𝔤$ extends uniquely to a Lie algebra homomorphism

$φ^{#} : Lie (X) \to 𝔤$

(extend $φ$ in the obvious way to $\tilde{φ} : F (X) \to 𝔤$ and observe that the generators of $J$ in the kernel of $\tilde{φ}$ , by definition).

In other words, we have a bijection

${Hom}_{sets} (X, 𝔤) \tilde{\to} {Hom}_{Lie alg.} (Lie (X), 𝔤)$

i.e. the functor Lie (from sets to Lie algebras) is a left adjoint of the forgetful functor $Φ$ (from Lie algebras to sets).

(2) Let $A (X)$ be the free associative algebra on $X$ ( $= F (X) / I$ , where $I$ is the 2-sided ideal generated by all $(x y) z - x (y z) (x, y, z \in F (X))$ ). The embedding

$X ↪ A (X) = L (A (X))$

induces, as we have just seen, a Lie algebra homomorphism

$α : Lie (X) \to L (A (X))$

hence also

$β : U (Lie (X)) \to A (X)$

But also we have a mapping $X ↪ Lie (X) \to U (Lie (X))$ , hence (by the universal property of $A (X)$ ) a homomorphism of associative algebras

$γ : A (X) \to U (Lie (X))$

Check that these two homomorphisms $β, γ$ are inverses of each other, hence that

$U (Lie (X)) ≅ A (X)$

By P-B-W, Lie( $X$ ) embeds in $U (Lie (X))$ , hence in $A (X)$ , so that the mapping $α$ above is injective. In other words, the free Lie algebra Lie( $X$ ) may be described as the subalgebra of $L (A (X))$ generated by $X$ .

We have $β k = alpha$ and $γ α = k$ , also $α j = i$ , hence $γ β k = γ α = k$ , hence $γ β = 1$ (because $k$ injective); $β γ i = β γ α j = β k j = α j = i$ , hence $β γ = 1$ (because $i$ is injective).

Finally, if $R$ is any subject of Lie( $X$ ), the Lie algebra generated by $X$ subject to the relations $R$ is by definition $Lie (X) / 𝔞$ , where $𝔞$ is the ideal of the Lie( $X$ ) generated by $R$ (i.e. the intersection of all ideals of Lie( $X$ ) which contain $R$ ).

Finite-dimensional simple Lie algebras / $ℂ$

This is the classical theory we intend to generalise. A Lie algebra $𝔤$ is said to be simple if its only ideals are 0 and $𝔤$ , and if also $𝔤$ is non-abelian (thus deliberately excluding the 1-dimensional abelian Lie algebra). Take $k = ℂ$ , and dim $𝔤 < \infty$ .

An element $x \in 𝔤$ is semisimple if ad $x : 𝔤 \to 𝔤$ is a semisimple (i.e., diagonalizable) linear transformation.

Let $𝔤$ be simple, finite-dimensional. Then $𝔤$ has nonzero subalgebras consisting of semisimple elements (toral subalgebras); they are necessarily abelian.

Let $𝔥$ be a maximal toral subalgebra (or Cartan subalgebra) of $𝔤$ . Certainly such exist, for dimensional reasons. Moreover (a non-trivial fact) any two such are conjugate in $𝔤$ (i.e. transforms of each other under the group Int( $𝔤$ ) of inner automorphisms). Fix $𝔥$ once for all. $l = dim 𝔥$ is called the rank of $𝔤$ .

Let $𝔥^{*}$ be the vector space dual of $𝔥$ . Introduce the killing form

$⟨ x, y ⟩ = trace (ad x) (ad y) (x, y \in 𝔤)$

This is symmetric, nondegenerate and invariant, ie

$⟨ [x, z], y ⟩ = ⟨ x, [z, y] ⟩ (x, y, z \in 𝔤)$

Moreover its restriction to $𝔥$ is nondegenerate, hence defines an isomorphism $ω : 𝔥 \tilde{\to} 𝔥^{*} (ω (x) (y) = ⟨ x, y ⟩)$ and a symmetric bilinear form $⟨ λ, μ ⟩$ on $𝔥^{*} (⟨ λ, μ ⟩ = ⟨ ω^{- 1} λ, ω^{- 1} μ ⟩)$ .

Example $𝔤 = {𝔰 𝔩}_{n} (ℂ) =$ Lie algebras of $n \times n$ matrices with trace 0. Here we may take $𝔥$ to consist of the diagonal matrices

$k = (\begin{matrix} h_{1} \\ ⋱ \\ h_{n} \end{matrix}) with \sum_{1}^{n} h_{i} = 0$

(so that $l = rank (g) = n - 1$ ).

Roots

Consider the adjoint representation ${ad}_{𝔤}$ of $𝔤$ , restricted to $𝔥$ : this is a representation of $𝔥$ on $𝔤$ . Since $𝔥$ is abelian, all its irreducible representations are 1-dimensional, so that $𝔤$ splits up into a direct sum of 1-dimensional $𝔥$ –modules. explicitly, for each $α \in 𝔥^{*}$ define

$𝔤_{α} = {x \in 𝔤 : (ad x) = α (h) x for all h \in 𝔥}$

Then it turns out that $𝔤_{0} = 𝔥$ ; the nonzero $α \in 𝔥^{*}$ such that $𝔤_{α} \neq 0$ are called the roots of $𝔤$ (relative to $𝔥$ ). They form a finite subject $R$ of $𝔥^{*}$ , called the root system of $(𝔤, 𝔥)$ , and we have

$𝔤 = 𝔥 + \sum_{α \in R} 𝔤_{α} (direct sum)$

Moreover each $𝔤_{α} (α \in R)$ is 1-dimensional, and $[𝔤_{α}, 𝔤_{β}] \subset 𝔤_{α + β}$ (hence is 0 if $α + β \notin R ⋃ {0}$ ). If $α$ is a root, so is $- α$ .

In the case of ${𝔰 𝔩}_{n} (ℂ)$ , let $e_{i j} (1 \leq i, j \leq n)$ be the matrix units, and for $1 \leq i \leq n$ let $u_{i} : 𝔥 \to ℂ$ be the $i$ th projection: $u_{i} (h) = h_{i}$ . Then

$(ad h) e_{i j} = [h, e_{i j}] = h e_{i j} - e_{i j} h = (h_{i} - h_{j}) e_{i j} = (u_{i} - u_{j}) (h) \cdot e_{i j} (h \in 𝔥)$

which shows that the roots are $α = u_{i} - u_{j} (i \neq j); 𝔤_{α} = ℂ e_{i j}$ , and the root space decomposition is clear. We compute the Killing form on $𝔥$ as follows from above

$(ad h) (ad h') e_{i j} = (h_{i} - h_{j}) ({h'}_{i} - {h'}_{j}) e_{i j}$

so that

$\begin{matrix} ⟨ h, h' ⟩ & = & \sum_{i, j} (h_{i} - h_{j}) ({h'}_{i} - {h'}_{j}) \\ = & 2 n \sum_{1}^{n} h_{i} {h'}_{i} \end{matrix}$

(remember that $\sum h_{i} \sum {h'}_{i} = 0$ ). So it is a multiple of the obvious scalar product.

It is possible to choose roots $α_{1}, \dots, α_{l} (l = dim 𝔥)$ such that each root $α \in R$ is of the form $α = \sum_{1}^{l} n_{i} α_{i}$ with coefficients $n_{i} \in ℤ$ and either $n_{i} \geq 0$ (positive roots) or all $n_{i} \leq 0$ (negative roots). The $α_{i}$ are called a set of simple roots or a basis $B$ of $R$ (they are also a basis of $𝔥^{*}$ ). Choose such a basis once for all. There is then a unique highest root, for which $\sum n_{i}$ is a maximum; and a unique lowest root, for which $\sum n_{i}$ is a minimum.

Weyl group

For each $α \in R$ , let $w_{α}$ denote the reflection in the hyperplane orthogonal to $α$ in $𝔥^{*}$ , so that

$w_{α} (λ) = λ - ⟨ λ, α^{\lor} ⟩ α (λ \in 𝔥^{*})$

where $α^{\lor} = 2 α / {| | α | |}^{2}$ is the coroot of $α$ . The reflections $w_{α_{i}}$ corresponding to the simple roots generate a finite group of isometries of $𝔥^{*}$ , called the Weyl group $W$ of (𝔤,𝔥). Each reflection $w_{α}$ lies in $W$ ; $R$ is stable under $W$ ; and each root $α \in R$ is of the form $w_{α_{i}}$ for some $w \in W$ and some simple root $α_{i}$ . Moreover, any other basis of $R$ is of the form $(w_{α_{1}}, \dots, w_{α_{l}}) = w B$ for some (unique) $w \in W$ .

Cartan matrix

The numbers

$α_{i j} = ⟨ α_{i}^{\lor}, α_{j} ⟩ = \frac{2 ⟨ α_{i}, α_{j} ⟩}{⟨ α_{i}, α_{j} ⟩}$

are integers, and the matrix ${(a_{i j})}_{1 \leq i, j \leq l}$ is called the Cartan matrix of. It is independent of the choices of $𝔥$ and of basis of $R$ . It satisfies the following conditions:

$(C)$	$a_{i i} = 2 (1 \leq i \leq l); a_{i j} \leq 0 if i \neq j; a_{i j} = 0 \Leftrightarrow a_{j i} = 0$
$(P)$	All principal minors of $A$ are positive.

In the case of ${𝔰 𝔩}_{n} (ℂ)$ we may take $α_{i} = u_{i} - u_{i + 1} (1 \leq i \leq n - 1)$ . The Weyl group $W$ is the symmetric group $S_{n}$ (for $w_{α_{i}}$ interchanges $u_{i}$ and $u_{i + 1}$ and leaves the other $u_{j}$ fixed). Here the Cartan matrix is

$A = (\begin{matrix} 2 & - 1 \\ - 1 & 2 & - 1 \\ - 1 & 2 \\ ⋱ & - 1 \\ - 1 & 2 \end{matrix})$

(with $l = n - 1$ rows and columns).

Generators and relations

The Cartan matrix $A$ determines $𝔤$ up to isomorphism.

Choose generators $e_{i} \in 𝔤_{α_{i}}, f_{i} \in 𝔤_{{- α}_{i}} (1 \leq i \leq l)$ such that $⟨ e_{i}, f_{i} ⟩ = 1,$ and elements $h_{1}, \dots, h_{l} \in 𝔥$ such that $⟨ h_{i}, h ⟩ = α_{i}^{\lor} (h)$ , so that

$α_{j} (h_{i}) = ⟨ α_{i}^{\lor}, α_{j} ⟩ = a_{i j}$

Then the $3 l$ elements $e_{i}, f_{i}, h_{i}$ generate $𝔤$ subject to the following relations (Serre):

$\begin{matrix} [h_{i}, h_{j}] = 0 for all i, j \\ [e_{i}, f_{j}] = δ_{i j} h_{i} \\ [h_{i}, e_{j}] = a_{i j} e_{j}; [h_{i}, f_{j}] = - a_{i j} f_{j} \\ {(ad e_{i})}^{1 - a_{i j}} e_{j} = {(ad f_{i})}^{1 - a_{i j}} f_{j} = 0 (i \neq j) . \end{matrix}$

The idea now is (roughly) the following: start with any matrix $A$ of integers satisfying ( $C$ ), and form the Lie algebra with the above generators and relations.

However there is one remark that should be made at this point. In the classical set up (which I have just been describing) the Cartan matrix $A$ is nonsingular, the $h_{i} (1 \leq i \leq l)$ form a basis of the Cartan subalgebra $𝔥$ , and the simple roots $α_{j} \in 𝔥^{*}$ . Now a generalized Cartan matrix may well be singular (and it would be foolish to exclude this possibility, because for the affine Lie algebras the Cartan matrix is singular).

References

I.G. Macdonald
Issac Newton Institute for the Mathematical Sciences
20 Clarkson Road
Cambridge CB3 OEH U.K.

Version: October 30, 2001

page history