The Role of Affine Weyl groups in the representation theory of algebraic Chevalley groups and their Lie algebras

The Role of Affine Weyl groups in the representation theory of algebraic Chevalley groups and their Lie algebras
Part II
Representations and decompositions in characteristic $p \neq 0$

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 2 April 2014

§ 4. Fundamental constants $c_{λ μ}$ and the character formula

As in Part I, let $Δ$ be an indecomposable reduced root-system, and $𝔤_{ℂ}$ be the simple Lie algebra over $ℂ$ (determined up to isomorphism) possessing $Δ$ as its root-system.

4.1. Let $G$ be an algebraic Chevalley group, i.e. a connected semisimple (affine) algebraic group, of type $𝔤_{ℂ}$ (or of type $Δ)$ defined over a field $K$ of characteristic $p \neq 0;$ one denotes by $G_{L}$ the group of $L -rational$ points of $G$ for $L ⫆ K .$ Thus, depending on one's approach, $G$ itself may be identified either with $G_{K}$ for the algebraic closure $\overline{K}$ of $K,$ or with $G_{L}$ for a very large algebraically closed (universal) field $L,$ or else with the functor $L \to G_{L};$ $G$ is characterised by its affine-ring $A = A_{K}$ (being $K [G]$ in the notation of [Bor1970], our canonical reference for standard facts summarized here too briefly), which is a Hopf algebra over $K,$ such that $G_{L}$ is identified with the maximal-ideal spectrum of $A_{L} = A \underset{K}{\otimes} L$ for every field $L ⫆ K,$ the group-operation on $G_{L}$ coming from the coproduct in $A_{L}$ in the standard manner. By a $G -module$ we shall always mean an algebraic $G -module$ defined over $K,$ using the words "module" and "representation" interchangeably. Thus $K$ is the underlying field of every representation space $V,$ and concretely the representation affords a homomorphism of $G_{L}$ into ${Aut}_{L} V_{L}$ for every overfield $L$ of $K,$ where $V_{L} = V \underset{K}{\otimes} L .$

Let us assume that $G$ is simply-connected, and split over $K .$ The former assumption means that every projective representation (i.e. algebraic homomorphism into $P G L (n))$ of $G$ comes from a $G -module$ (i.e. factors through $G L (n));$ this is particularly suitable for representation-theoretic purposes, and there is no loss in this assumption because of the existence and uniqueness of simply-connected (universal) coverings. The latter assumption means that $G$ contains a maximal torus $T$ which is defined and split over $K,$ i.e. is isomorphic over $K$ to ${(𝔾_{m})}^{l}$ where $𝔾_{m} = G L (1)$ is the "multiplicative group" (which, as a group-functor, assigns to $L ⫆ K$ the multiplicative group of $L).$ One knows that under these assumptions $G$ is uniquely determined (up to isomorphism) by $K$ and $Δ,$ and that $K$ may be taken to be the prime field $𝔽_{p};$ this means existence of an $𝔽_{p} -form$ of the affine-ring, which was described explicitly by Kostant purely in terms of the root-system $Δ$ (via the $ℤ -form$ $U_{ℤ}$ of the enveloping algebra $U_{ℂ}$ of $𝔤_{ℂ}$ which we shall describe below in some detail, without discussing the affine-ring further as it is not required by us here); cf. [Bor1970, § 4]. We find it convenient not to specify $K,$ but treat it as a floating (working) field. Thus the Lie algebra $𝔤_{K}$ of $G$ is a vector space over $K$ of dimension $dim G = dim 𝔤_{ℂ} = | Δ | + l .$ The elements of $𝔤_{K}$ may be identified with either the right or the left-invariant derivations of $A_{K} .$ It may be remarked that unlike the situation in characteristic 0, the object $𝔤_{K}$ is not entirely "local"; in particular, the Lie algebra of two isogenous groups (in the sense of algebraic groups) may not be isomorphic. In fact, one knows that this pathology arises if and only if $p$ divides the "connexion number" $[X : X']$ of $Δ$ where $X, X'$ are as in § 1; in this case $𝔤_{K}$ is nonsimple, and the only time that happens again is for types $F_{4}$ with $p = 2$ and $G_{2}$ with $p = 3;$ but we shall not require this fact. An explicit description of $𝔤_{K}$ in terms of $Δ,$ essentially due to Chevalley, is as follows.

Let $𝔤_{ℤ} = 𝔥_{ℤ} + \sum_{α \in Δ} 𝔤_{ℤ}^{α}$ be a Chevalley lattice in $𝔤_{ℂ} .$ This means the following: (i) $𝔥_{ℂ} = 𝔥_{ℤ} \underset{ℤ}{\otimes} ℂ$ is a Cartan subalgebra of $𝔤_{ℂ}$ on which elements of $Δ$ are considered as linear functions and $𝔥_{ℤ}$ consists of all $h \in 𝔥_{ℂ}$ such that $λ (h) \in ℤ$ for all $λ$ in the weight-lattice $X$ of $Δ$ (as defined in § 1, with $Δ \subset X \subseteq 𝔥_{ℂ}^{*} =$ vector space dual of $𝔥_{ℂ});$ thus if $h_{α} \in 𝔥_{ℂ}$ is defined by $λ (h_{α}) = ⟨ λ, α^{\lor} ⟩$ for all $λ \in 𝔥_{ℂ}^{*},$ then putting $h_{i} = h_{α_{i}}$ one has $𝔥_{ℤ} = \sum_{i = 1}^{l} ℤ h_{i};$ (ii) $𝔤_{ℂ}^{α} = 𝔤_{ℤ}^{α} \underset{ℤ}{\otimes} ℂ$ is the $α -rootspace$ of $𝔤_{ℂ}$ with respect to $𝔥_{ℂ},$ and if $e_{α}$ denotes a generator of the (additive) infinite-cyclic group $𝔤_{ℤ}^{α},$ then $[e_{α}, e_{- α}] = \pm 2 h_{α};$ and (iii) if $U_{ℤ}$ denotes the subring of the universal enveloping algebra $U_{ℂ}$ of $𝔤_{ℂ}$ generated by ${\frac{e_{α}^{n}}{n!} | α \in Δ, n \in ℤ^{+}},$ then $𝔤_{ℤ}$ is stable under the adjoint action of $U_{ℤ}$ on $𝔤_{ℂ} .$ It is customary to choose $e_{α}'s$ such that $[e_{α}, e_{- α}] = 2 h_{α},$ and we write $e_{i}$ for $e_{α_{i}}$ and $f_{i}$ for $e_{- α_{i}},$ for $1 ≦ i ≦ l .$ It may be remarked that the set ${\frac{e_{i}^{n}}{n!}, \frac{f_{i}^{n}}{n!} | 1 ≦ i ≦ l, n \in ℤ^{+}}$ generates $U_{ℤ}$ as a ring [Ver1972-2].

The Lie algebra $𝔤_{K}$ of $G$ can be identified with $𝔤_{Z} \underset{ℤ}{\otimes} K$ in such a way that $𝔥_{K} = 𝔥_{ℤ} \underset{ℤ}{\otimes} K$ the Lie algebra of the maximal torus $T,$ and identifying $X$ with the character group $X (T)$ of $T$ (i.e. the group of algebraic homomorphisms of $T$ into $𝔾_{m}$ with the group-operation given by tensoring these 1-dimensional $T -modules,$ which is written additively), the subspace $𝔤_{K} = 𝔤_{ℤ}^{α} \underset{ℤ}{\otimes} K$ of $𝔤_{K}$ becomes its $α -rootspace$ with respect to the adjoint action of $T .$ Thus, we are identifying $Δ$ with the root-system of $G$ with respect to $T,$ as is common practice, and this should lead to no confusion. In general, for any (algebraic) $T -module$ $M$ and for $λ \in X (T) = X$ we denote by $M_{(λ)}$ its $λ -weight-space$ consisting of all vectors in $M$ on which $T$ acts by $λ,$ so that one has $M = \sum_{λ \in X} M_{(λ)}$ in view of the complete reductibility of $T -modules;$ thus $𝔤_{K (α)}$ (for the adjoint action of $T$ on $𝔤_{K})$ is the same as $𝔤_{K}^{α} .$ The structure of a $T -module$ is determined by the dimensions of the various weight-spaces, i.e. by the element $char M$ of the group-ring of $X$ over $ℤ,$ called the formal character of $M,$ which assigns $λ \to dim M_{(λ)} .$ More precisely, denoting by $ℤ \cdot e^{X}$ the set of all $ℤ -linear$ combinations $\sum_{λ \in X} n_{λ} \cdot e^{λ}$ of formal symbols ${e^{λ} | λ \in X}$ where all but a finite number of $n_{λ}'s$ are zero, and making it into a ring under the usual coordinatewise addition and convolution multiplication with $e^{λ} \cdot e^{μ} = e^{λ + μ},$ we have $char M = \sum_{λ \in X} (dim M_{(λ)}) \cdot e^{λ} \in ℤ \cdot e^{X} .$ The formal character of a $G -module$ $M$ shall mean simply that of $M$ as a $T -module$ upon restriction. From the standard fact that the Weyl group $W$ of $Δ$ can be identified with the quotient $N / T$ in a canonical manner, where $N$ is the normalizer of $T$ in $G,$ it is immediate that the formal character of a $G -module$ is a $W -invariant$ element of $ℤ \cdot e^{X}$ under the natural action of $W .$

Let $B$ be the Borel subgroup of $G$ having as its Lie algebra $𝔟_{K} = 𝔥_{K} + \sum_{α \in Δ_{+}} 𝔤_{K}^{α},$ where $Δ_{+}$ is a given choice of positive roots in $Δ .$ One knows that every irreducible $G -module$ contains a unique 1-dimensional highest-weight-space, in the sense that it is a weight-space for $T$ whose elements remain fixed under the unipotent radical $[B, B]$ of $B .$ If this weightspace corresponds to the character $λ \in X,$ we say $λ$ is the highest weight of the given module. Necessarily $λ \in X^{+}$ (the dominant elements of $X$ with respect to the fixed choice of positive roots), and if $μ \in X$ is a "weight" of the given $G -module,$ in the sense that the $μ -weight-space$ is nonzero, then $λ - μ \in ℤ \cdot Δ_{+},$ which we rewrite as $μ ≦ λ$ by setting up the canonical partial-order $≦$ on $X$ compatible with the addition under which $ℤ^{+} \cdot Δ_{+}$ is the set of all elements $≧ 0 .$ Conversely, to every $λ \in X^{+}$ there corresponds an irreducible $G -module$ $M_{λ}$ with highest weight $λ,$ which is unique up to isomorphism. As stated in the Introduction, one of the long-standing unsolved problems in the study of $G -modules$ is to give a formula for $char M_{λ}$ in terms of $λ \in X^{+} .$

4.2. A famous theorem of Steinberg reduces the study of all irreducible $G -modules$ $M_{λ}$ for $λ \in X^{+},$ to that of the $p^{l}$ basic ones with $λ$ lying in the set $X^{♯} = {\sum_{i} m_{i} λ_{i} \in X | 0 ≦ m_{i} ≦ p - 1 for 1 ≦ i ≦ l};$ the elements of $X^{♯}$ will be called the "restricted dominant characters". The formulation of this theorem hinges on the fact that $G$ is defined over $𝔽_{p}$ (as stated earlier) and that all the $M_{λ}'s$ are definable over $𝔽_{p};$ we will be able to see the latter fact from the construction of $M_{λ}$ by the standard technique of "reduction $mod p ",$ which we shall need to describe presently in some detail. Now if $M$ is any $G -module$ defined over the prime field, one defines its Frobenius-twist $M^{Fr}$ to be the $G -module$ whose underlying space is $M$ itself, such that the new action of $g \in G_{L}$ on $M_{L} = M \underset{K}{\otimes} L$ is obtained as follows: take a basis of the $𝔽_{p} -form$ $M_{𝔽_{p}}$ of $M = M_{K}$ and write the matrix of (the old action of) $g$ with respect to this basis of $M_{L};$ then the new matrix obtained by replacing each entry by its $p^{th}$ power gives the new (Frobenius-twisted) action of $g$ on $M_{L}$ with respect to the same basis. It is immediate that the $μ -weight-space$ of $M$ coincides with the $p μ -weight-space$ of $M^{Fr},$ which gives $char M^{Fr} = {(char M)}^{Fr},$ where $for χ = \sum_{λ} n_{λ} e^{λ} \in ℤ e^{X}, we define χ^{Fr} = \sum_{λ} n_{λ} e^{p^{λ}} .$ Steinberg's tensor-product theorem then essentially says that for $λ \in X^{+}$ and $μ \in X^{♯}$ the module $M_{λ}^{Fr} \otimes M_{μ}$ is irreducible and hence isomorphic to $M_{p λ + μ};$ in particular, $M_{λ}^{Fr} ≅ M_{p λ} .$ It follows that if $λ \in X^{+}$ has the $p -expansion$ $λ = μ_{0} + μ_{1} p + \dots + μ_{k} p^{k}, with μ_{j} \in X^{♯} for 0 ≦ j ≦ k,$ then $M_{λ} ≅ M_{μ_{0}} \otimes M_{μ_{1}}^{Fr} \otimes \dots \otimes M_{μ_{k}}^{{Fr}^{k}},$ where ${Fr}^{j}$ denotes the $j -fold$ iteration of Fr, and consequently, $\begin{matrix} char M_{λ} = \prod_{j = 0}^{k} {(char M_{μ_{j}})}^{{Fr}^{j}} . & (4.1) \end{matrix}$

4.3. In order to define the constants $c_{λ μ}$ in the title of this section, we need to quickly introduce certain $G -modules$ $V_{λ, K}$ which are modelled after the irreducible $𝔤_{ℂ} -module$ $V_{λ, ℂ}$ with highest weight $λ \in X^{+},$ so that the formal character of $V_{λ, K}$ coincides with that of $V_{λ, ℂ} .$ We caution in advance that $V_{λ, K}$ is not uniquely determined by $λ$ alone; however, there are only a finite number of choices (up to isomorphism). The formal character of $V_{λ, K}$ is determined by $λ,$ and is given by Weyl's character formula as in characteristic 0; thus $char V_{λ, K}$ is independent of $p .$ Also (by Proposition 2 below) the composition factors of $V_{λ, K}$ are determined by $λ$ alone; these are far from being independent of $p$ though, and in fact for every $λ$ one can find a bound for $p$ beyond which $V_{λ, K}$ is irreducible. This is not a new result, and follows from the "discriminant criterion" described below.

Let $V_{λ, ℤ}$ be a $U_{ℤ} -stable$ lattice in $V_{λ, ℂ}$ ("admissible $ℤ -form"$ in the terminology of [Bor1970]). One sees that up to dilatation, i.e. multiplication by a nonzero scalar, there are only a finite number of choices for $V_{λ, ℤ};$ this follows from the fact that if an arbitrary nonzero element $v_{0}$ in the highest-weightspace of $V_{λ, ℂ}$ is required to be a primitive $(=$ nondivisible) vector in $V_{λ, ℤ}$ (as can always be ensured upon dilatation, as the highest-weight-space is 1-dimensional), then there exist two uniquely determined extremes for $V_{λ, ℤ} :$ $V_{λ, ℤ}^{min} ≦ V_{λ, ℤ} ≦ V_{λ, ℤ}^{max} .$ It is clear that the minimal lattice $V_{λ, ℤ}^{min}$ has to be simply the image $v_{0} U_{ℤ}$ of $v_{0}$ under the action of $U_{ℤ}$ on $V_{λ, ℂ} .$ Let $v_{0}^{*} \in {(V_{λ, ℂ})}^{*} =$ the vector space dual to $V_{λ, ℂ},$ be the linear function having value 1 at $v_{0}$ and vanishing at all the weight-spaces of $V_{λ, ℂ}$ except the highest one. Then $v_{0}^{*}$ is a lowest-weight-vector of the contragredient $𝔤_{ℂ} -module$ ${(V_{λ, ℂ})}^{*} = V_{λ^{*}, ℂ}$ with highest weight $λ^{*} = λ^{τ_{0}} = - λ^{σ_{0}} .$ Then, setting $V_{λ^{*}, ℤ}^{min} = v_{0}^{*} U_{ℤ},$ one has $V_{λ, ℤ}^{max} = {x \in V_{λ, ℂ} | y (x) \in ℤ for all y \in V_{λ^{*}, ℤ}^{min}} .$ An alternate way of describing this is in terms of the "contragredient bilinear form" introduced by W. J. Wong [Won1971]:

Let $θ$ denote the (unique) anti-automorphism of $U_{ℂ}$ which interchanges $e_{i}$ and $f_{i}$ for $1 ≦ i ≦ l$ (and hence leaves the $h_{i}'s$ fixed); the existence of $θ$ is classical and follows, for example, from the now well-known Harish-Chandra—Jacobson—Serre presentation of the Lie algebra $𝔤_{ℂ}$ (equivalently, of the associative algebra $U_{ℂ})$ in terms of the generators ${e_{1}, e_{2}, \dots, e_{l}, f_{1}, f_{2}, \dots, f_{l}} .$ Then $V_{λ, ℂ}$ possesses a unique symmetric bilinear form $(,)$ such that $(v_{0}, v_{0}) = 1$ and $(x \cdot e_{i}, y) = (x, y \cdot f_{i})$ for $1 ≦ i ≦ l$ and $x, y \in V_{λ, ℂ};$ more generally $(x \cdot u, y) = (x, y \cdot u^{θ})$ for $u \in U_{ℂ},$ and $(,)$ is determined by the fact that $(v_{0} \cdot u, v_{0} \cdot u') = c$ if the projection of $v_{0} u' u^{θ}$ on the highest-weight-space along the sum of all other weight-spaces is $c v_{0} .$ The restriction of $(,)$ to $V_{λ, ℤ}^{min}$ takes integer values, and one has $V_{λ, ℤ}^{max} = {x \in V_{λ, ℂ} | (x, y) \in ℤ for all y \in V_{λ, ℤ}^{min}} .$

The "reduction mod $p "$ $V_{λ, K} = V_{λ, ℤ} \underset{ℤ}{\otimes} K$ of any $U_{ℤ} -stable$ lattice $V_{λ, ℤ}$ in $V_{λ, ℂ},$ becomes a $G -module$ in a natural manner [Bor1970] (here simply-connectedness of $G$ is required, just as in characteristic 0), such that the associated infinitesimal action of $𝔤_{K}$ coincides with that of $𝔤_{ℤ} \underset{ℤ}{\otimes} K$ (which acts naturally on $V_{λ, ℤ} \underset{ℤ}{\otimes} K,$ since $𝔤_{ℤ} \subset U_{ℤ}),$ with which $𝔤_{K}$ has already been identified. The module $V_{λ, K}$ need not be irreducible as we shall see presently, and its internal structure may vary with the choice of $V_{λ, ℤ} .$ Perhaps $V_{λ, K}$ is always indecomposable; this is certainly true of the two extreme choices of $V_{λ, ℤ} .$ In fact $V_{λ, K}^{min} = V_{λ, ℤ}^{min} \underset{ℤ}{\otimes} K$ has a unique maximal proper $G -submodule$ (contained in the sum of all the weight-spaces except the highest one), the quotient by which is the irreducible module $M_{λ};$ and on the other extreme $V_{λ, ℤ}^{max} \underset{ℤ}{\otimes} K = V_{λ, K}^{max} = {(V_{λ^{*}, K}^{min})}^{*}$ has $M_{λ}$ as its unique irreducible submodule, on account of the duality given by the last equality. The injection $\begin{matrix} i_{λ, ℤ} : & V_{λ, ℤ}^{min} & ↪ & V_{λ, ℤ}^{max} \end{matrix}$ gives, on reduction mod $p,$ only a homomorphism $\begin{matrix} i_{λ, K} : & V_{λ, K}^{min} & ⟶ & V_{λ, K}^{max}, \end{matrix}$ whose kernel is the maximal proper submodule of $V_{λ, ℤ}^{min}$ and image is the unique copy of $M_{λ}$ in $V_{λ, K}^{max};$ in other words, $i_{λ, K}$ factors through $M_{λ} .$ Hence we have the "discriminant criterion": $V_{λ, K}$ is irreducible if and only if $p$ does not divide the index $[V_{λ, ℤ}^{max} : V_{λ, ℤ}^{min}];$ this index equals the discriminant of Wong's contragredient form $(,)$ on $V_{λ, ℤ}^{min}$ described above (cf. [Bor1970, Theorem 2J). It does not seem easy to give a formula for this number. (See ${Footnote}^{3} .)$ It would be particularly interesting to determine all "small" prime divisors of this index for each $λ;$ prime divisors not exceeding the Coxeter number $h_{Δ}$ of $Δ$ are of special interest, since the ideas of the next section provide an alternate method towards deciding the irreducibility of $V_{λ, K}$ particularly when $p > h_{Δ}$ (cf. Proposition 3).

It may be mentioned at this point that for the case of type $A_{1},$ i.e. for $G ≅ {S L}_{2},$ all $V_{λ, K}$ for $λ \in X^{♯}$ are irreducible. But this is an exceptional situation; it will be shown in a later paper that when $Δ$ is not of type $A_{1},$ there are values of $λ \in X^{♯}$ (in fact, for large $p,$ a sizeable percentage of values exist) such that $V_{λ, K}$ is not irreducible. The first example of a non-irreducible $V_{λ, K}$ with $λ \in X^{♯}$ was given by Curtis [Cur1960, p. 859], for type $A_{2}$ $(G ≅ {S L}_{3}) .$

4.4. Let us write ${\overline{V}}_{λ} = V_{λ, K}^{min},$ and for $μ \in X^{+}$ define $c_{λ μ}$ to be the multiplicity ${[{\overline{V}}_{λ} : M_{μ}]}_{G},$ of the occurrence of $M_{μ}$ as a composition factor of the $G -module$ ${\overline{V}}_{λ} .$ (See ${Footnote}^{4} .)$ It is clear that $c_{λ λ} = 1,$ and that $c_{λ μ} \neq 0$ implies $(μ$ is a weight of ${\overline{V}}_{λ}$ and therefore) $μ ≦ λ;$ in particular, $λ - μ \in X'$ is a necessary condition for $c_{λ μ} \neq 0 .$ We have ${\overline{V}}_{λ} \sim_{G} \sum_{μ \in X^{+}} c_{λ μ} M_{μ},$ where $\sim_{G}$ denotes equivalence in the Grothendieck ring of the category of $G -modules,$ and thus $\begin{matrix} \sum_{μ} c_{λ μ} \cdot char M_{μ} = char {\overline{V}}_{λ} = \frac{\sum_{σ \in W} det σ \cdot e^{({(λ + δ)}^{σ})}}{\sum_{σ \in W} det σ \cdot e^{(λ^{σ})}} & (4.2) \end{matrix}$ where the second equality comes from the Weyl's character formula for $V_{λ, ℂ} .$

The linear equations (4.2) for the "unknowns" $χ_{μ} = char M_{μ}$ can be solved in terms of the "known" quantities ${\overline{χ}}_{μ} = char {\overline{V}}_{μ},$ in the form $χ_{λ} = \sum_{μ} γ_{λ μ} {\overline{χ}}_{μ},$ where the matrix ${[γ_{λ μ}]}_{X^{+}}$ is the inverse of the infinite matrix ${[c_{λ μ}]}_{X^{+}},$ the outer subscript $X^{+}$ denoting the set over which the matrix indices $λ, μ$ run. With respect to any enumeration of the set $X^{+}$ which is stronger than the partial order $≦$ (definition at the end of § 4.1), the matrix ${[c_{λ μ}]}_{X^{+}}$ is unipotent in the sense that it is triangular with diagonal entries all 1. As a matter of fact, in order to compute $χ_{μ_{0}}$ for given $μ_{0} \in X^{+},$ one needs to consider (4.2) only for $λ$ ranging over the finite set $Y = Y (μ_{0})$ consisting of those $λ \in X^{+}$ such that there exists a sequence $μ_{1}, μ_{2}, \dots, μ_{k - 1}, μ_{k} = λ$ with $c_{μ_{j - 1} μ_{j}} \neq 0$ for $1 ≦ j ≦ k .$ Since for $λ \in Y,$ $μ \in X^{+},$ $c_{λ μ} \neq 0 implies μ \in Y (λ) ⫅ Y (μ_{0}) = Y,$ we have for $λ \in Y,$ $\sum_{μ \in Y} c_{λ μ} χ_{μ} = {\overline{χ}}_{λ},$ i.e. only the finite segment ${[c_{λ μ}]}_{Y}$ of ${[c_{λ μ}]}_{X^{+}}$ needs to be inverted.

Proposition 1 (Character formula). There exist unique integers $γ_{λ μ}$ for $λ, μ \in X^{+},$ such that for given $λ$ only finitely many $γ_{λ μ}$ are nonzero and $\begin{matrix} char M_{λ} = \frac{\sum_{μ} (\sum_{σ \in W} det σ \cdot γ_{λ μ} \cdot e^{({(μ + δ)}^{σ})})}{\sum_{σ \in W} det σ \cdot e^{(δ^{σ})}} . & (4.3) \end{matrix}$

By virtue of the standard fact that the set ${{\overline{χ}}_{λ} | λ \in X^{+}}$ is a $ℤ -free$ basis of the $W -invariants$ in the group-ring $ℤ \cdot e^{X}$ (see [Bou1968]: VI. 3.4, Proposition 3 and Example 2, on p. 187-188), and the fact that the matrix ${[γ_{λ μ}]}_{X^{+}}$ in $\sum_{μ} γ_{λ μ} {\overline{χ}}_{μ} = χ_{λ}$ is unipotent, one finds that ${χ_{λ} | λ \in X^{+}}$ is also a $ℤ -free$ basis of the formal characters of $G -modules.$ Hence we have

Proposition 2. The composition factors of a $G -module$ are determined by its formal character.

Because of (4.1) one is primarily interested in computing the formal characters of the $p^{l}$ basic irreducible $G -modules$ with highest weight in $X^{♯} .$ We claim that these are completely determined by our knowledge of the "decomposition" ${\overline{V}}_{λ} \sim_{G} \sum_{μ \in X^{+}} c_{λ μ} M_{μ}$ for $λ$ ranging in $X^{♯}$ alone. To see this, let us impose the new partial-order $≦^{♯}$ on $X^{♯},$ defined to be the weakest one such that whenever $μ_{0} + μ_{1} p + \dots + μ_{k} p^{k} ≦ λ$ with $λ, μ_{0}, μ_{1}, \dots, μ_{k}$ all in $X^{♯},$ one has $μ_{i} ≦^{♯} λ$ for $0 ≦ i ≦ k;$ and let $≦'$ denote any linear order on $X^{♯}$ stronger than $≦^{♯} .$ Then $χ_{λ}$ can be computed, recursively with respect to $≦',$ i.e. assuming that $χ_{μ}$ is known for all $μ ≦' λ,$ simply by substitution in $χ_{λ} = {\overline{χ}}_{λ} - \sum_{\underset{μ \neq λ}{μ \in X^{+}}} c_{λ μ} χ_{μ},$ where the $χ_{μ}'s$ appearing on the right-hand side are known by virtue of (4.1) and the definition of $≦^{♯} .$ Our claim is proved; and as a corollary we note that a knowledge of ${[c_{λ μ}]}_{λ \in X^{♯}, μ \in X^{+}}$ determines, by virtue of Proposition 2, the entire matrix ${[c_{λ μ}]}_{X^{+}} .$ It would be very interesting to devise a formula (short of a computer algorithm) which enables one to write down, efficiently and explicitly, the decompositions of ${\overline{V}}_{λ}$ for all $λ \in X^{+}$ in terms of those for $λ \in X^{♯}$ alone. (Here and elsewhere in this paper, by the "decomposition" of a module we mean that in the Grothendieck group of the category of modules at hand.)

4.5. When we substitute $χ_{μ} = \prod_{j} {(χ_{μ_{j}})}^{{Fr}^{j}}$ for $μ = \sum_{j} μ_{j} p^{j},$ with all $μ_{j}'s$ in $X^{♯},$ in $\sum_{μ} c_{λ μ} χ_{μ} = {\overline{χ}}_{λ},$ what we get is a set of nonlinear equations in ${χ_{ν} | ν \in X^{♯}} .$ However, under certain circumstances, it is possible to replace these by a set of linear equations $\begin{matrix} \sum_{μ \in X^{♯}} C_{λ μ} χ_{μ} = χ_{λ} for λ \in X^{♯}, & (4.4) \end{matrix}$ where the coefficients $C_{λ μ}$ are no longer integers, but instead lie in the group-ring $ℤ \cdot e^{X}$ itself; in fact $C_{λ μ}$ is a $W -invariant$ element of the subring $ℤ \cdot e^{p X}$ consisting of ${χ^{Fr} | χ \in e^{X}} .$ To this end let us define $Y_{0} = {ν \in X^{+} | c_{λ, μ + ν p} \neq 0 for some λ, μ \in X^{♯}};$ then $\begin{matrix} χ_{λ} & = & \sum_{\underset{ν \in Y_{0}}{μ \in X^{♯}}} c_{λ, μ + ν p} \cdot χ_{μ + ν p} \\ = & \sum_{μ \in X^{♯}} (\sum_{ν \in Y_{0}} c_{λ, μ + ν p} \cdot χ_{ν}^{Fr}) χ_{μ}, \end{matrix}$ which gives (4.4) with $\begin{matrix} C_{λ μ} = {(\sum_{ν \in Y_{0}} c_{λ, μ + ν p} \cdot χ_{ν})}^{Fr} . & (4.5) \end{matrix}$ Our "coefficients" in (4.4) till now are themselves polynomials in the unknowns ${χ_{ν} | ν \in X} .$ If for a given value of $p$ it can be shown that ${\overline{V}}_{ν}$ is irreducible for $ν \in Y_{0},$ one can replace $χ_{ν}$ in (4.5) by the known quantity ${\overline{χ}}_{ν};$ in Corollary to Proposition 3 in § 5, we shall see that such is indeed the case at least for $p ≧ 2 h_{Δ} - 3,$ using the fact that $\begin{matrix} Y_{0} ⫅ {\sum_{i} m_{i} λ_{i} \in X^{+} | \sum_{i = 1}^{l} m_{i} n_{i} ≦ h_{Δ} - 2}, & (4.6) \end{matrix}$ where $h_{Δ} = (\sum_{i} n_{i}) + 1$ is the Coxeter number as in § 1.

To see (4.6) we note that $\begin{matrix} Y_{0} & ⫅ & {ν \in X^{+} | μ + p ν ≦ λ for some λ, μ \in X^{♯}} \\ ⫅ & {ν \in X^{+} | ⟨ μ + p ν, - α_{0}^{\lor} ⟩ ≦ ⟨ λ, - α_{0}^{\lor} ⟩ for some λ, μ \in X^{♯}} \\ ⫅ & {ν \in X^{+} | ⟨ p ν, - α_{0}^{\lor} ⟩ ≦ ⟨ (p - 1) δ, - α_{0}^{\lor} ⟩} \\ = & {ν \in X^{+} | ⟨ ν, - α_{0}^{\lor} ⟩ ≦ \frac{p - 1}{p} (h_{Δ} - 1)}, \end{matrix}$ where the second (resp. the third) inclusion follows in view of the fact that $- α_{0}$ has non-negative inner product with all the fundamental roots (resp. all the fundamental weights), and hence also with all elements of $ℤ_{+} \cdot Δ_{+}$ (resp. all elements of $X^{♯} \subset X^{+},$ and so also all elements of the form $(p - 1) δ - λ$ for $λ \in X^{♯}).$ The inclusion (4.6) follows in view of the fact that $h_{Δ} = \frac{2}{r} l$ an integer. This also shows that $Y_{0} ⫅ X^{♯}$ if $p ≧ h_{Δ} - 2 .$ It is shown in § 5.4 (Example 2) that $Y_{0} = {0}$ for types $A_{2}$ and $B_{2}$ (and of course also for type $A_{1}).$ It seems likely that for all other types $Y_{0} ⫋ {0} .$

Once again, the linear equations (4.4) can be solved (in those cases where $C_{λ μ}'s$ are known) in the same way as (4.2) earlier, because the matrix ${[C_{λ μ}]}_{X^{♯}}$ is unipotent with respect to $≦'$ (any linear-order on $X^{♯}$ stronger than $≦^{♯}):$ ${\begin{matrix} C_{λ λ} = 1 = the identity element of ℤ \cdot e^{X}, and \\ C_{λ μ} \neq 0 implies (μ + ν p ≦ λ for some ν \in X^{+} and hence) μ ≦^{♯} λ . \end{matrix}$ The fact that $C_{λ μ}'s$ are themselves formal characters is no obstacle to inverting the unipotent matrix as the inverse can be explicitly written down as a polynomial in the $C_{λ μ}'s.$ In fact there is no need to handle the entire matrix of size as large as $| X^{♯} | = p^{l};$ in order to compute $χ_{μ_{0}}$ for $μ_{0} \in X^{♯}$ one need consider (4.4) simply for $λ$ ranging in the set $Y' = Y' (μ_{0})$ consisting of $λ \in X^{♯}$ such that there exists a sequence $μ_{1}, μ_{2}, \dots, μ_{k}$ with $C_{μ_{j - 1} μ_{j}} \neq 0$ for $1 ≦ j ≦ k .$ The validity of the "Harish-Chandra principle" to be discussed in § 5 implies that the cardinality of $Y'$ never exceeds $| W |,$ which is quite comfortable for large $p .$ Note that $Y (μ) ⫅ Y' (μ_{0}) + p Y_{0};$ the set $Y_{0}$ can also be "relativized", i.e. replaced by a subset $Y_{0} (μ_{0})$ of cardinality bounded by a number independent of $p,$ which would show that $| Y |$ is also bounded by a number independent of $p .$

$^{3}$ (Added in Proof.) This problem does not seem intractable now. First note that the required index $N_{λ} = [V_{λ, ℤ}^{max} : V_{λ, ℤ}^{min}]$ for $λ \in X^{+},$ is the product of the partial indices $N_{λ, μ} = [V_{λ, ℤ (μ)}^{max} : V_{λ, ℤ (μ)}^{min}]$ as $μ$ ranges over all the weights (since every $U_{ℤ} -stable$ $ℤ -form$ is compatible with the weight-space decomposition); clearly it suffices to determine $N_{λ, μ}$ for $μ \in X^{+} .$ I have solved this problem only when $μ$ is "sufficiently close" to $λ$ in this special sense: the dimension of the $μ -weightspace$ of $V_{λ, ℂ}$ equals $P (λ - μ)$ (which is the leading term in Kostant's multiplicity formula) where $P ()$ is Kostant's partition function. Based on that result it seems plausible that the value of $N_{λ, μ}$ is given in general by $N_{λ, μ} = \prod_{α \in Δ^{+}} \prod_{j \in ℤ^{+}} {(\binom{(λ + δ) (h_{α}) - 1}{j})}^{m_{j} (α)} ((\binom{\cdot}{\cdot}) being the usual binomial symbol)$ where $m_{j} (α) = m_{j}^{λ; μ} (α)$ is the dimension of the subspace of $V_{λ, ℂ (μ)}$ annihilated by $e_{α}^{j + 1}$ but not by $e_{α}^{j};$ when $μ$ is sufficiently close to $λ$ it is easily seen that $m_{j} (α) = P (λ - μ - j α) - P (λ - μ - (j + 1) α) .$ The suggested expression for $N_{λ, μ}$ is of particular interest for $μ = 0,$ since it appears in a crucial manner in the work of K. R. Parthasarathy, R. Ranga Rao and V. S. Varadarajan (Ann. Math. 85 (1967), Theorem 4.2 on page 424) on the principal series representations of complex semi-simple Lie groups; cf. § 8.5.10(b) in Dixmier's book cited in Footnote $^{5} .$ My proof of this formula (for $μ$ sufficiently close to $λ)$ is obtained by refining certain ideas of N. N. Sapovalov (cf. § 7.8.23 in Dixmier's book) that are not unrelated to the computations of the 3 authors quoted above; it will be published in due course.

In this connection, reader's attention may also be drawn to N. Burgoyne's ideas in § 1 of "Modular representations of some finite groups" in Proc. of Symposia in Pure Mathematics, Vol. XXI, AMS, Providence, 1971.

$^{4}$ (Added in Proof.) In a recent preprint (titled 'On the modular representations of the general linear and symmetric groups') R. W. Carter and G. Lusztig have introduced the imaginative term "Weyl Module" for ${\overline{V}}_{λ},$ albeit in the special situation of the general (or the special) linear groups. The allusion to Weyl is to emphasize the fact that the dimension is given by Weyl's formula as in characteristic zero: $dim {\overline{V}}_{λ} = D (λ + δ);$ in the special situation of Carter and Lusztig it has an additional association with the treatment given by H. Weyl (in his classic monograph "Classical groups") to the problem of decomposing the tensor spaces in accordance with certain symmetries. Our notation ${\overline{V}}_{λ}$ conforms that in [Hum1971].

Notes and References

This is an excerpt of the paper The Role of Affine Weyl groups in the representation theory of algebraic Chevalley groups and their Lie algebras by Daya-Nand Verma. It appeared in Lie Groups and their Representations, ed. I.M.Gelfand, Halsted, New York, pp. 653–705, (1975).

page history