The Schubert Calculus and Eigenvalue Inequalities for Sums of Hermitian Matrices

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last updated: 14 October 2014

The Symmetric Algebra and Schur Functions

The formulae and rules making up the Schubert Calculus appear in numerous other setting, many of which provide additional computational rules. These other settings include:

1)	Representation theory of the symmetric group;
2)	Representation theory of the full linear group;
3)	Multiplication of Schur functions in the algebra of symmetric polynomials.

In this section we consider topic 3), show the relationships between it and the Schubert calculus, and derive some new results which give another characterization of the set $V_{d}^{n} .$

Partitions, Young Diagrams

By a partition we mean a sequence of integers $λ = (λ_{1}, λ_{2}, \dots)$ satisfying $λ_{1} \geq λ_{2} \geq \dots \geq 0 .$ We say $λ$ is a partition of $w,$ or has weight $w,$ if $\sum_{i = 1}^{\infty} λ_{i} = w .$ The length of $λ$ is the number of nonzero integers among $λ_{1}, λ_{2}, \dots .$ If $λ$ has length $r,$ we write $λ = (λ_{1}, λ_{2}, \dots, λ_{r}) .$ We let $Π$ denote the set of all partitions, and $Π_{d}$ the set of partitions of length at most $d .$ The set $Π$ may be given a partial ordering $\leq,$ defined as follows: $μ \leq λ ⟺ μ_{i} \leq λ_{i} for i = 1, 2, 3, \dots .$

The diagram $Y (λ)$ associated with a partition $λ$ is a left-justified array of squares with $λ_{i}$ squares in the $i^{th}$ row.

Example: $λ = (4, 2, 1)$ $Y (λ) = \begin{matrix} \end{matrix}$ If $μ$ and $λ$ are partitions with $μ \leq λ,$ then the skew-diagram $Y (\frac{λ}{μ})$ is obtained by removing $Y (μ)$ from $Y (λ) .$ For example, if $λ = (5, 4, 3, 1, 1)$ and $μ = (4, 3, 1, 1, 0)$ then $Y (\frac{λ}{μ})$ looks like $\begin{matrix} \end{matrix}$

Lastly, the partition dual to $λ,$ denoted by $\tilde{λ},$ is the partition whose diagram is obtained from $Y (λ)$ by interchanging rows and columns. The "jump" function $δ_{X}$ can be used to express the components of $\tilde{λ}$ in terms of the components of $λ :$ ${\tilde{λ}}_{j} = \sum_{i = 1}^{\infty} δ_{λ_{i}} (j) .$

Symmetric functions

Let $A$ denote the ring of symmetric polynomials (also called symmetric functions) in an infinite number of variables $x_{1}, x_{2}, \dots$ with integer coefficients (properly called symmetric formal power series). We review some basis results about this ring (see [Sta1976] and [Lid1950]).

1)	Let $a_{r} = \sum_{σ \in Q_{r, \infty}} \prod_{i = 1}^{r} x_{σ (i)};$ $a_{r}$ is called the $r^{th}$ elementary symmetric function. The "fundamental theorem of symmetric functions" states that the elementary symmetric functions are algebraically independent and that $A = Z [a_{1}, a_{2}, \dots],$ i.e. every symmetric function in $A$ may be expressed uniquely as a polynomial in $a_{1}, a_{2}, \dots$ with integer coefficients.
2)	Let $G_{r, \infty}$ denote the set of all non-decreasing sequences of $r$ integers chosen from $1, 2, \dots .$ Define $h_{r} = \sum_{σ \in G_{r, \infty}} \prod_{i = 1}^{r} x_{σ (i)};$ $h_{r}$ is called the $r^{th}$ complete homogeneous product. It is the sum of all monomials in $A$ of degree $r .$
3)	Let $S_{r} = \sum_{i = 1}^{\infty} x_{i}^{r};$ $S_{r}$ is called the $r^{th}$ power symmetric function. By convention, $a_{0} = h_{0} = S_{0} = 1,$ and $a_{r} = h_{r} = S_{r} = 0$ for $r < 0 .$

The $a_{r}$ and $S_{r}$ are connected by the classical

Newton's identities:

$\begin{array}{rcl} S_{1} - a_{1} & = & 0, \\ S_{2} - S_{1} a_{1} + 2 a_{2} & = & 0, \\ S_{3} - S_{2} a_{1} + S_{1} a_{2} - 3 a_{3} & = & 0, \\ ⋮ \\ S_{N} - S_{N - 1} a_{1} + \dots + {(- 1)}^{N} N a_{N} & = & 0, \\ ⋮ \end{array}$

If one solves for the $a_{r}'s$ in terms of the $S_{r}'s,$ one obtains $r! \cdot a_{r} = det (Z_{r}),$ where $\begin{matrix} Z_{r} = (\begin{matrix} S_{1} & 1 & 0 & 0 & \dots & 0 \\ S_{2} & S_{1} & 2 & 0 & \dots & 0 \\ S_{3} & S_{2} & S_{1} & 3 & 0 & \dots & 0 \\ ⋮ & ⋮ \\ r - 1 \\ S_{r} & S_{r - 1} & S_{1} \end{matrix}) & (*) \end{matrix}$ The $S_{r}'s$ may be expressed in a similar fashion in terms of the $a_{r}'s;$ hence the power sums $S_{1}, S_{2}, \dots$ are also algebraically independent and $A = Z [S_{1}, S_{2}, \dots] .$

The following relations (due to Brioschi) connect the $S_{r}'s$ and the $h_{r}'s:$ $\begin{array}{rcl} S_{1} - h_{1} & = & 0, \\ S_{2} + S_{1} h_{1} - h_{2} & = & 0, \\ S_{3} + S_{2} h_{1} + S_{1} h_{2} - h_{3} & = & 0, \\ ⋮ \\ S_{N} + S_{N - 1} h_{1} + \dots - h_{N} & = & 0, \\ ⋮ \end{array}$

If one solves for the $h_{r}'s$ in terms of the $S_{r}'s,$ one obtains $\begin{array}{rclr} r! \cdot h_{r} & = & det ∣ \begin{matrix} S_{1} & - 1 & 0 & 0 & \dots & 0 \\ S_{2} & S_{1} & - 2 & 0 & \dots & 0 \\ S_{3} & S_{2} & S_{1} & - 3 & 0 & \dots & 0 \\ ⋮ & ⋮ \\ - r + 1 \\ S_{r} & S_{r - 1} & S_{1} \end{matrix} ∣ & (**) \\ = & permanent (Z_{r}) . \end{array}$ Again, the complete homogeneous product sums are algebraically independent, and $A = Z [h_{1}, h_{2}, \dots] .$

Another class of symmetric functions, the Schur symmetric functions, is also of particular interest. Let $λ$ be a partition. The Schur symmetric function $e_{λ}$ may be defined in numerous ways. We give the definition originally used by Schur. Let $S_{r}$ denote the symmetric group on $r$ symbols, and let $k^{(λ)}$ be the character of $S_{r}$ corresponding to $λ,$ as given in Young diagram theory. The $λ -immanent$ of an $r -by- r$ matrix $A = (A_{i j}),$ denoted by ${∣ A ∣}^{(λ)},$ is defined as follows: ${∣ A ∣}^{(λ)} = \sum_{σ \in S_{r}} k^{(λ)} (σ) \cdot \prod_{i = 1}^{r} A_{i, σ (i)} .$ The Schur function $e_{λ}$ is then defined by $r! \cdot e_{λ} = {∣ Z_{r} ∣}^{(λ)},$ where $Z_{r}$ is the matrix of power sums defined previously. We immediately note two special cases:

1)	If $λ = (1, 1, 1, \dots, 1)$ has weight $r,$ then ${∣ A ∣}^{(λ)} = det (A),$ so $e_{(1, 1, 1, \dots, 1)} = det (Z_{r}) / r! = a_{r} .$
2)	If $λ = (r, 0, 0, \dots, 0),$ then ${∣ A ∣}^{(λ)} = per (A),$ so $e_{(r, 0, 0, \dots, 0)} = per (Z_{r}) / r! = h_{r} .$

We list some classical theorems concerning Schur functions.

The set ${e_{λ} | λ \in Π}$ forms a $Z -basis$ for the $Z -module$ $A .$

(Jacobi-Trudi) If $λ$ is a partition of length $r,$ then $e_{λ} = det (H_{i j}),$ where $H_{i j}$ is the $r -by- r$ matrix given by $H_{i j} = h_{λ_{i} - i + j} .$

(Naegelsbach, Aitken) If $λ$ is a partition of length $r,$ then $e_{λ} = det (A_{i j}),$ where $A_{i j}$ is the $r -by- r$ matrix given by $A_{i j} = a_{{\tilde{λ}}_{i} - i + j},$ and $\tilde{λ}$ is the dual of $λ .$

As the reader might infer, identities 4.2 and 4.3 were proved for a definition of Schur functions other than that given by Schur. Classically, these results were obtained in the ring $A_{N}$ of symmetric polynomials in $N$ variables $x_{1}, x_{2}, \dots, x_{N} .$ In the ring $A_{N},$ definitions exist for the elementary symmetric functions, complete homogeneous symmetric functions, power sums, and Schur functions which are analogous to the definitions of $a_{r}, h_{r}, S_{r},$ and $e_{λ}$ in the ring $A .$ We will use the symbols $a_{r}, h_{r}, S_{r},$ and $e_{λ}$ to denote elements of $A_{N}$ as well as $A .$

In the ring $A_{N},$ another definition of $e_{λ}$ is possible (and in fact was the one used in the classical proofs of 4.2 and 4.3). Let $λ$ be a partition of length at most $N,$ and define $Δ_{λ} = det (X_{i j}),$ where $X_{i j} = x_{j}^{λ_{i} + N - i} .$ We then have

(Jacobi) $e_{λ} = \frac{Δ_{λ}}{Δ (0, 0, \dots, 0)}$

The quotients $\frac{Δ_{λ}}{Δ (0, 0, \dots, 0)}$ are referred to by Muir as bi-alternants.

Identity 4.4 holds in the ring $A_{N}$ but has no meaning for $A .$ Identities 4.2 and 4.3 hold in $A_{N}$ as well as $A .$ Statement 4.1 holds as stated for $A;$ but in $A_{N},$ $e_{λ} = 0$ if $λ$ has length more than $N .$ However, the set ${e_{λ} | weight λ \leq N}$ is a basis for $A_{N} .$

The mapping $h_{r} \to a_{r}$ extends to a ring automorphism $D$ of $A$ for which $D (e_{λ}) = e_{\tilde{λ}} .$

Proof.

Since $A ≅ Z [h_{1}, h_{2}, \dots] ≅ Z [a_{1}, a_{2}, \dots],$ any bijection from ${h_{1}, h_{2}, \dots}$ to ${a_{1}, a_{2}, \dots}$ extends to an automorphism of $A .$ From identities 4.2 and 4.3, $\begin{array}{rcl} D (e_{λ}) & = & D (det (h_{λ_{i} - i + j})) \\ = & det (D (h_{λ_{i} - i + j})) \\ = & det (a_{λ_{i} - i + j}) \\ = & e_{\tilde{λ}} . \end{array}$

$□$

The Littlewood-Richardson Rule

The Schur functions constitute a $Z -basis$ for $A;$ hence any product $e_{λ} \cdot e_{μ}$ may be expressed as a sum $\sum_{ν \in Π} c_{λ μ ν} \cdot e_{ν} .$ There is a certain well-studied rule for determining the integers $c_{λ, μ, ν} .$ Before describing the rule, we make a definition.

Let the sequence consisting of $μ_{1}$ $1's,$ $μ_{2}$ $2's,$ $\dots,$ $μ_{k}$ $k's,$ $\dots$ be denoted by $(1^{μ_{1}}, 2^{μ_{2}}, \dots, k^{μ_{k}}) .$ Suppose $(α_{1}, α_{2}, \dots)$ is a rearrangement of the sequence $(1^{μ_{1}}, 2^{μ_{2}}, \dots)$ such that for any pair of indices $i, i + 1$ the number of $i's$ appearing among $α_{1}, α_{2}, \dots, α_{j}$ is not less than the number of $(i + 1)'s$ appearing among $α_{1}, α_{2}, \dots, α_{j} .$ Then $(α_{1}, α_{2}, \dots)$ is called a lattice permutation of $(1^{μ_{1}}, 2^{μ_{2}}, \dots) .$

Example: $(1, 2, 3, 1, 4, 2)$ is a lattice permutation of $(1^{2}, 2^{2}, 3, 4),$ but $(1, 2, 3, 2, 1, 4)$ is not.

(The Littlewood-Richardson (LR) rule) Let $λ, μ,$ and $ν$ be partitions. The coefficient of $e_{ν}$ in the product $e_{λ} \cdot e_{μ}$ is equal to zero unless $λ \leq ν .$ If $λ \leq ν,$ it is equal to the number of ways of filling the squares of the skew-diagram $Y (\frac{ν}{λ})$ if $μ_{1}$ $1's,$ $μ_{2}$ $'s,$ $\dots$ subject to the following two conditions:

LR1:	The inserted numbers are non-decreasing in each row and strictly increasing in each column.
LR2:	The sequence $(α_{1}, α_{2}, \dots)$ obtained by reading from right to left across the first row, next right to left across the second row, etc., must be a lattice permutation of $(1^{μ_{1}}, 2^{μ_{2}}, \dots) .$

Example: Using the LR rule, one can compute that $e_{(2, 1)} \cdot e_{(2, 1)} = 2 e_{(3, 2, 1)} + e_{(4, 2)} + e_{(2, 2, 1, 1)} + e_{(3, 3)} .$

The LR rule was first stated in 1934 by D. E. Littlewood and A. R. Richardson ([Ll-R (Not included in Bibliography)]). The first proof is attributed to G. de B. Robinson in 1938, but that presentation is difficult to follow. Littlewood has offered proofs in [L1 (Not included in Bibliography)] and [L2 (Not included in Bibliography)] which seem nearly correct, but McConnell [McC1975] has pointed out problems in those efforts. Schutzenberger [Sch1976] has developed elegant (but non-elementary) combinatorial machinery which yields the LR rule as a corollary of a more general result. G. D. James has recently obtained the LR rule as a generalization of a well known result in the representation theory of the symmetric group [Jam1973]. The methods to be developed later in this section can be used to "fix" Littlewood's original argument and provide a fairly simple proof of the LR rule.

As a consequence of 4.7, we have the following

Let $λ = (λ_{1}, λ_{2}, \dots, λ_{k})$ be a partition of weight $N .$ Then $e_{λ} \cdot h_{r} = \sum_{ν} e_{ν},$ where $ν$ ranges over all partitions of weight $N + r$ and length at most $k + 1$ which satisfy $\begin{matrix} ν_{1} \geq λ_{1} \geq ν_{2} \geq λ_{2} \geq \dots \geq ν_{k} \geq λ_{k} \geq ν_{k + 1}, & (*) \end{matrix}$ i.e. $λ$ must interlace $ν .$

Proof.

Recall that $h_{r} = e_{(r, 0, \dots, 0)} .$ By the LR rule, $e_{λ} \cdot h_{r} = \sum c_{ν} e_{ν},$ where $c_{ν}$ equals the number of ways of inserting $r$ $1's$ into the skew diagram $Y (\frac{ν}{λ}),$ subject to the LR conditions. We must show that $c_{ν} = 1$ if $ν$ is of the form (*) and zero otherwise.

i)	If $λ ≰ ν$ or $weight (ν) \neq N + r,$ then $c_{ν} = 0$ is automatic.
ii)	If the skew diagram $Y (\frac{ν}{λ})$ has more than one square in any column, then any placement of $r$ 1's into $Y (\frac{ν}{λ})$ must violate condition LR1. This is the case if $ν$ has length greater than $k + 1$ or if $λ$ does interlace $ν;$ hence in these cases, $c_{ν} = 0 .$
iii)	If cases i) and ii) are excluded, then $λ \leq ν,$ $weight (ν) = N + r$ and $λ$ interlaces $ν .$ Then the obvious (and only) placement of $r$ 1's into $Y (\frac{ν}{λ})$ satisfies LR1 because all columns have one square only and this placement satisfies LR2 because only one symbol ("1") is involved. Hence $c_{ν} = 1 .$

$□$

We are ready to present a theorem which connects the multiplication of Schubert cycles in $A (G_{d, n})$ with the multiplication of Schur functions in $A .$ The connection is made with the help of a certain ideal in $A .$ Given $1 \leq d < n,$ let $p$ be the partition of $(n - d) d$ with $d$ parts equal to $n - d,$ and let $I_{d n}$ be the $Z -submodule$ of $A$ generated by all $e_{μ}$ for which $μ ≰ p .$ In other words, either $μ_{1} > n - d$ or $μ$ has more than $d$ parts. It is easy to verify using the LR rule that $I_{d n}$ is an ideal in $A .$ The quotient $A / I_{d n}$ is generated freely as a $Z -module,$ with basis ${e_{λ} | λ \leq p} .$ Products in $A / I_{d n}$ may be computed using the LR rule and ignoring terms involving those $e_{μ}$ in $I_{d n} .$ The reader may now anticipate that $A (G_{d, n})$ and $A / I_{d n}$ are isomorphic. We can describe an isomorphism using the map $θ_{d, n} : Π ⟶ Q_{d, n}$ defined by $θ_{d, n} (λ) = {\begin{cases} \emptyset, & if λ ≰ p, \\ n - d + 1 - λ_{1}, n - d + 2 - λ_{2}, \dots, n - λ_{d}, & otherwise. \end{cases}$ We note that in this instance $\emptyset$ refers to the empty sequence belonging to $Q_{d, n} .$ By convention, we shorten $θ_{d, n}$ to $θ$ where no ambiguity exists.

The rings $A / I_{d n}$ and $A (G_{d, n})$ are isomorphic. An isomorphism is given by $e_{λ} + I_{d n} ⟶ Ω (θ (λ))$ and linear extension.

Proof.

Let $T$ be the map from $A$ to $A (G_{d, n})$ defined by $T (e_{λ}) = Ω (θ (λ))$ and linear extension. We need only show that $T$ is a ring homomorphism onto $A (G_{d, n})$ with kernel $I_{d n} .$ Since $θ$ is a map onto $Q_{d, n},$ $T$ maps onto $A (G_{d, n}) .$ It is easy to see that the kernel of $T$ is $I_{d n} .$ Since the special Schur functions $h_{1}, h_{2}, \dots$ freely generated $A$ as a ring, $T$ is multiplicative if for each pair $i, j$ $T (h_{i} \cdot h_{j}) = T (h_{i}) \cdot T (h_{j}) .$ Let $Π_{i j} = {ν \in Π | \begin{array}{ll} 1) & length (ν) = 2, \\ 2) & ν_{1} \geq i \geq ν_{2}, \\ 3) & weight (ν) = i + j \end{array}} .$ We have $\begin{array}{rcl} T (h_{i} \cdot h_{j}) & = & T (\sum_{ν \in Π_{i j}} e_{ν}) \\ = & \sum_{ν \in Π_{i j}} Ω (θ (ν)) \\ = & \sum_{a \in θ (Π_{i j})} Ω (a) \\ = & σ (i) \cdot σ (j) \\ = & T (h_{i}) \cdot T (h_{j}) . \end{array}$

$□$

The connection described in Theorem 4.9 was first noticed by Lesieur [Le (Not included in Bibliography)] in 1947.

The rings $A (G_{d, n})$ and $A (G_{n - d, n})$ are isomorphic. An isomorphism is given by $Ω (a) \to Ω (\tilde{a})$ and linear extension.

Proof.

This follows from Theorem 4.5 and the fact that $\tilde{θ (λ)} = θ (\tilde{λ}) .$

$□$

(Duality for $V_{d}^{n})$ $(a, b, c) \in V_{d}^{n} ≅ (\tilde{a}, \tilde{b}, \tilde{c}) \in V_{n - d}^{n}$

Proof.

This follows from Theorems 4.10 and 3.9.

$□$

Array formulation of the LR Rule

The Littlewood-Richardson rule, as originally states, requires deft combinatorial reasoning in applications. In this section we present a new formulation of the LR rule which facilitates a more geometrical type of argument.

Let $e_{λ} \cdot e_{μ} = \sum c_{λ μ ν} e_{ν}$ and set $d = max (length (λ), length (μ), length (ν)) .$ Then $c_{λ, μ, ν}$ equals the number of $d -by- d$ integer matrices $[n_{i j}]$ satisfying the following system of linear inequalities:

LRA1:	$n_{i j} \geq 0$ for $1 \leq i, j \leq d .$
LRA2:	$\sum_{i = 1}^{d} n_{i j} = μ_{i},$ $i = 1, 2, \dots, d .$
LRA3:	$\sum_{i = 1}^{d} n_{i j} = ν_{j} - λ_{j},$ $j = 1, 2, \dots, d .$
LRA4:	$λ_{j} + \sum_{i = 1}^{t} n_{i j} \geq λ_{j + 1} + \sum_{i = 1}^{t + 1} n_{i, j + 1},$ $1 \leq i \leq d - 1,$ $0 \leq t \leq d - 1 .$
LRA5:	$\sum_{j = 1}^{t} n_{i j} \geq \sum_{j = 1}^{t + 1} n_{i + 1, j},$ $1 \leq i \leq d - 1,$ $0 \leq t \leq d - 1 .$

\begin{matrix} \begin{matrix} λ_{1} & λ_{2} & \dots & λ_{d} \end{matrix} \\ \begin{matrix} 0 \\ 0 \\ 0 \\ ⋮ \\ ⋮ \\ 0 \end{matrix} & \begin{matrix} n_{11} \\ ⋱ & n_{i j} \\ n_{i i} \\ ⋱ \\ ⋱ \\ 0 & n_{d d} \end{matrix} & \begin{matrix} μ_{1} \\ μ_{2} \\ ⋮ \\ ⋮ \\ ⋮ \\ μ_{d} \end{matrix} \\ \begin{matrix} ν_{1} & ν_{2} & \dots & ν_{d} \end{matrix} \end{matrix}

Proof.

First we dispose of the case $λ ≰ ν .$ By the LR rule, $c_{λ μ ν} = 0;$ this agrees with the theorem, since there are no integer matrices which could satisfy LRA1 and LRA3.

Next suppose $weight (λ) + weight (μ) \neq weight (ν) .$ By the LR rule, $c_{λ μ ν} = 0;$ this agrees with the theorem, since there are no matrices satisfying conditions LRA2 and LRA3 for which $weight (λ) + weight (μ) \neq weight (ν) .$

Now suppose $λ \leq ν$ and $weight (λ) + weight (μ) = weight (ν) .$ The theorem will be verified if we can establish a one-to-one correspondence between the ways of filling the squares of $Y (\frac{ν}{λ})$ with $μ_{1}$ 1's, $μ_{2}$ 2's, $\dots$ which conform to the LR restrictions, and the integer matrices $[n_{i j}]$ satisfying LRA1 through LRA5. The following is such a correspondence.

Given a way of filling $Y (\frac{ν}{λ}),$ let $n_{i j} = the number of symbols ' i' placed in row j of Y (\frac{ν}{λ}) .$ Clearly this defines a one-to-one correspondence between the set of ways of filling $Y (\frac{ν}{λ})$ and the set of integer arrays satisfying LRA1, LRA2, and LRA3. It remains to show that if an arbitrary way of filling $Y (\frac{ν}{λ})$ satisfies the two LR conditions, then the corresponding array satisfies LRA4 and LRA5, and conversely.

Suppose that $[n_{i j}]$ corresponds to a way of filling $Y (\frac{ν}{λ})$ which satisfies LR1 and LR2. Then $[n_{i j}]$ must satisfy LRA4 and LRA5, as the following two arguments show.

1) Suppose that LRA4 fails to hold for $[n_{i j}] .$ Then there is some $j_{0}$ that $λ_{j_{0}} + \sum_{i = 1}^{t_{0}} n_{i j_{0}} < λ_{j_{0} + 1} + \sum_{i = 1}^{t_{0} + 1} n_{i j_{0} + 1}$ for some $t_{0} .$ We assume that $j_{0}$ is the minimal such index, and that $t_{0}$ is the smallest is the smallest possible index corresponding to $j_{0} .$

Let $k_{1} = λ_{j_{0}} + \sum_{i = 1}^{t_{0}} n_{i j_{0}}$ and $k_{2} = λ_{j_{0} + 1} + \sum_{i = 1}^{t_{0} + 1} n_{i j_{0} + 1} .$ By assumption, $k_{1} < k_{2} .$ Now in row $j_{0}$ of $Y (\frac{ν}{λ}),$ the placement of 1's, 2's, $\dots,$ $t_{0}'s$ extends out to column $k_{1}$ at most; hence the square in row $j_{0},$ column $k_{2}$ of $Y (\frac{ν}{λ})$ contains a symbol greater than or equal to $t_{0} + 1 .$ However, the square in row $j_{0} + 1,$ column $k_{2}$ contains precisely the symbol $t_{0} + 1$ (because $n_{t_{0} + 1, j_{0} * 1}$ can not be zero by the minimality of $j_{0}$ and $t_{0}).$ But then this way of filling $Y (\frac{ν}{λ})$ does not satisfy LR1 ("symbols in a given column must be strictly increasing"), a contradiction.

2) Suppose LRA5 fails to hold for $[n_{i j}] .$ Then there exist indices $i_{0}, t_{0}$ such that $\sum_{j = 1}^{t_{0}} n_{i_{0} j} < \sum_{j = 1}^{t_{0} + 1} n_{i_{0} + 1, j} .$ We may assume $i_{0}$ is the least such index, and $t_{0}$ the least possible index corresponding to $i_{0} .$ In this case we must have $n_{i_{0} + 1, t_{0} + 1} \neq 0 .$ Let $k_{1} = \sum_{j = 1}^{t_{0}} n_{i_{0} j}$ and $k_{2} = \sum_{j = 1}^{t_{0} + 1} n_{i_{0} + 1, j} .$ Now for the way of filling $Y (\frac{ν}{λ})$ given by $[n_{i j}],$ let the sequence $(α_{1}, α_{2}, \dots)$ be obtained by reading the symbols $Y (\frac{ν}{λ})$ from right to left in the first, second, $\dots,$ $d^{th}$ rows. Since $n_{i_{0} + 1, t_{0} + 1} \neq 0,$ the symbol $i_{0} + 1$ occurs in row $t_{0} + 1$ of $Y (\frac{ν}{λ}) .$ Let the rightmost occurrence of $i_{0} + 1$ in row $t_{0} + 1$ appear in the sequence $(α_{1}, α_{2}, \dots)$ as $α_{N} .$ Then the number of occurrences of $i_{0} + 1$ among $α_{1}, α_{2}, \dots, α_{N}$ is $k_{2},$ while the number of occurences of $i_{0}$ among $α_{1}, α_{2}, \dots, α_{N}$ is $k_{1} .$ But $k_{1} < k_{2}$ contradicts the assumption that the given way of filling $Y (\frac{ν}{λ})$ satisfies LR2.

Arguments 1) and 2) may be more readily understood by working out a few examples. A reversal of the arguments shows that for any integer matrix satisfying LRA1–LRA5, the corresponding way of filling $Y (\frac{ν}{λ})$ satisfies LR1 and LR2. This establishes the one-to-one correspondence and completes the proof of Theorem 4.12.

$□$

An integer matrix $[n_{i j}]$ satisfying conditions LRA1 through LRA2, with respect to partitions $λ, ν,$ and $ν,$ will be called a $(λ, μ, ν)$ Littlewood-Richardson array $((λ, μ, ν) -LRA).$ Theorem 4.12 shows that if a $(λ, μ, ν) -LRA$ exists, then the coefficient $c_{λ μ ν}$ is non-zero. In order to determine more about the existence of LRA's for a given triple $(λ, μ, ν),$ we consider a slightly more general object.

Definition An element $(λ, μ, ν, N)$ belonging to $ℝ^{d} \times ℝ^{d} \times ℝ^{d} \times ℝ^{d \times d}$ will be called a Littlewood-Richardson design (LRD) if the following conditions hold:

1)	$λ_{1} \geq λ_{2} \geq \dots \geq λ_{d} \geq 0,$ $μ_{1} \geq μ_{2} \geq \dots \geq μ_{d} \geq 0,$ $ν_{1} \geq ν_{2} \geq \dots \geq ν_{d} \geq 0 .$
2)	$N = [n_{i j}]$ satisfies conditions LRA1-LRA5.

Comments:

i)	If $(λ, μ, ν, N)$ is an LRD which is integral, then $N$ is a $(λ, μ, ν) -LRA.$
ii)	If $N$ is a $(λ, μ, ν) -LRA,$ then $(λ, μ, ν, N)$ is an integral LRD.

We now make a useful observation.

The set of Littlewood-Richardson designs in $ℝ^{d} \times ℝ^{d} \times ℝ^{d} \times ℝ^{d \times d}$ is a closed convex cone.

Proof.

The theorem follows from the fact that the set of LRD's is defined by a system of homogeneous linear inequalities.

$□$

The following theorem is fundamental.

Let $(λ, μ, ν)$ be partitions of length $d,$ $d \leq 4 .$ If there exists a $(λ, μ, ν) -LRD,$ then there exists a $(λ, μ, ν) -LRA.$

Proof.

Let $(λ, μ, ν, N)$ be a $(λ, μ, ν) -LRD;$ we show how to successively "perturb" $N$ so that at each step we have a $(λ, μ, ν) -LRD$ and the final perturbed matrix is integral.

Let $N = (n_{i j}), 1 \leq i, j \leq d .$ Condition LRA5 guarantees that $N$ is upper triangular. This makes the result trivial if $d = 1$ or $d = 2 .$ For $d = 3,$ the only non-integral elements of $N$ must lie in the submatrix $N [1, 2 | 2, 3] .$ The presence of fractions means that there is some "play" in the inequalities LRA4 and LRA5. If we "rotate" the submatrix $N [1, 2 | 2, 3]$ (i.e. add $ξ$ to $n_{12},$ subtract $ξ$ from $n_{13},$ add $ξ$ to $n_{23},$ and subtract $ξ$ from $n_{22})$ the right amount, we obtain an integral matrix, and LRA1-LRA5 still hold.

Now consider the case $d = 4 .$ Since $N$ is upper triangular, $n_{11}$ and $n_{d d}$ are integral. We perturb $N$ until $n_{33}$ is integral as follows: decrease $n_{22}$ and $n_{33}$ by $ξ,$ increase $n_{12}, n_{23},$ and $n_{34}$ by $ξ,$ and decrease $n_{14}$ by $ξ,$ where $ξ = min {n_{14}, n_{33} - [n_{33}]} .$ As a result, either $n_{33}$ is integral or $n_{14}$ is zero.

Case 1. $n_{14} = 0$
Perturb $N$ further by decreasing $n_{22}$ and $n_{33}$ by $ξ;$ increasing $n_{12},$ $n_{23},$ and $n_{34}$ by $ξ$ and then increasing $n_{23}$ by another $ξ;$ and decreasing $n_{13}$ and $n_{24}$ by $ξ,$ where this time $ξ = min {n_{13}, n_{24}, n_{33} - [n_{33}]} .$ One possible result is that $n_{13} = 0 .$ Then $n_{12}$ and $n_{22}$ are integral, so a rotation of $N [2, 3 | 3, 4]$ yields a $(λ, μ, ν) -LRA.$ The other possible result is that $n_{24} = 0$ or $n_{33}$ is integral; in either case $n_{33},$ $n_{34},$ $n_{24},$ and $n_{14}$ are all integral, so a simple rotation of $N [1, 2 | 2, 3]$ yields a $(λ, μ, ν) -LRA.$

Case 2. $n_{33}$ is integral.
Then $n_{34}$ is also integral, so we rotate $N [1, 2 | 3, 4]$ until an integral element results, and still have a $(λ, μ, ν) -LRD.$ The last non-integral elements all lie in some 2-by-2 submatrix; we again rotate to remove them, and wind up with a $(λ, μ, ν) -LRA.$ This ends the proof.

$□$

Theorem 4.14 removes a difficult constraint in verifying the existence of a $(λ, μ, ν) -LRA;$ i.e. the existence of any real matrix satisfying conditions LRA1-LRA5 is enough to guarantee the existence of an integral matrix satisfying those conditions. Relaxing this constraint allows the use of standard arguments from convexity theory.

Additional Characterizations of $V_{d}^{n}$

We can use several of the preceding results to formulate new characterizations for the set $V_{d}^{n},$ introduced in section II. Theorem 3.9 states that $V_{d}^{n} = {(a, b, c) | \begin{array}{ll} 1) & a, b, c \in Q_{d, n}, \\ 2) & Ω (a) \cdot Ω (b) \cdot Ω (c) \neq 0 \end{array}} .$ Here $Ω (a), Ω (b),$ and $Ω (c)$ are Schubert cycles (elements of the ring $A (G_{d, n})).$ This definition of $V_{d}^{n}$ is based on a multiplication statement in the ring $A (G_{d, n}) .$ We know from Theorem 4.9 that $A (G_{d, n})$ is ring isomorphic to $A / I_{d n} .$ Using this isomorphism, we can translate the condition $Ω (a) \cdot Ω (b) \cdot Ω (c) \neq 0$ in $A (G_{d, n})$ to an equivalent condition in $A / I_{d n} .$ Application of the Littlewood-Richardson rule then gives information about elements of $V_{d}^{n} .$

Suppose $Ω (a) \cdot Ω (b) \cdot Ω (c) = n \cdot Ω (1, 2, \dots, d)$ and $Ω (a) \cdot Ω (b) = \sum_{s \in Q_{d, n}} n_{s} \cdot Ω (s) .$ Then $n_{\overline{c}} = n .$

Proof.

Easy consequence of Theorem 3.7.

$□$

Recall the definition of the map $θ_{d, n}$ which maps the set of partitions $Π_{d}$ onto the set $Q_{d, n} :$ $θ_{d, n} (λ) = {\begin{cases} \emptyset, & if λ ≰ p, \\ n - d + 1 - λ_{1}, n - d + 2 - λ_{2}, \dots, n - λ_{d}, & otherwise. \end{cases}$ We can define a one-sided inverse for $θ_{d, n}$ which maps elements of $Q_{d, n}$ into the set of partitions $Π .$ We will call this one-sided inverse $θ_{d, n}^{- 1}$ and define it by $θ_{d, n}^{- 1} (a) = (n - d + 1 - a_{1}, n - d + 2 - a_{2}, \dots, n - a_{d}) .$ By convention, $θ^{- 1} = θ_{d, n}^{- 1} .$

Let $Ω (a) \cdot Ω (b) = \sum_{s \in Q_{d, n}} n_{s} \cdot Ω (s)$ and $e_{θ^{- 1} (a)} \cdot e_{θ^{- 1} (b)} = \sum_{ν} m_{ν} \cdot e_{ν} .$ Then $n_{s} = m_{θ^{- 1} (s)} .$

Proof.

The isomorphism from $A / I_{d n}$ to $A (G_{d, n})$ given by $e_{λ} + I_{d n} ⟶ Ω (θ (λ))$ has an inverse given by $Ω (a) ⟶ e_{θ^{- 1} (a)} I_{d n} .$ We are given that $Ω (a) \cdot Ω (b) = \sum_{s} n_{s} \cdot Ω (s)$ in $A (G_{d, n}) .$ Applying $θ^{- 1}$ to each side yields $(e_{θ^{- 1} (a)} + I_{d n}) (e_{θ^{- 1} (b)} + I_{d n}) = \sum_{s} n_{s} \cdot (e_{θ^{- 1} (s)} + I_{d n})$ or $\begin{matrix} e_{θ^{- 1} (a)} \cdot e_{θ^{- 1} (b)} + I_{d n} = \sum_{s \in Q_{d, n}} n_{s} \cdot e_{θ^{- 1} (s)} + I_{d n} . & (*) \end{matrix}$ In the ring $A,$ we have $\begin{array}{rcl} e_{θ^{- 1} (a)} \cdot e_{θ^{- 1} (b)} & = & \sum_{ν \in Π} m_{ν} e_{ν} \\ = & \sum_{ν \leq p} m_{ν} e_{ν} + \sum_{ν ≰ p} m_{ν} e_{ν} . \end{array}$ Mapping this equation into $A / I_{d n}$ yields $e_{θ^{- 1} (a)} \cdot e_{θ^{- 1} (b)} + I_{d n} = \sum_{ν \leq p} m_{ν} e_{ν} + I_{d n}$ or $\begin{matrix} e_{θ^{- 1} (a)} \cdot e_{θ^{- 1} (b)} + I_{d n} = \sum_{s \in Q_{d, n}} m_{θ^{- 1} (s)} \cdot (e_{θ^{- 1} (s)} + I_{d n}) . & (**) \end{matrix}$ Comparing (*) to (**) shows that $n_{s} = m_{θ^{- 1} (s)} .$

$□$

$\begin{matrix} Ω (a) \cdot Ω (b) \cdot Ω (c) = n \cdot Ω (1, 2, \dots, d) \\ for some non-zero n \end{matrix} ⟺ there exists a (θ^{- 1} (a), θ^{- 1} (b), θ^{- 1} (\overline{c})) -LRA.$

Proof.

The theorem follows in a straightforward manner from lemmas 4.15 and 4.16 and Theorem 4.12.

$□$

We now have two methods of testing whether a product $Ω (a) \cdot Ω (b) \cdot Ω (c),$ known to be a multiple of $Ω (1, 2, \dots, d),$ is non-zero:

1)	Compute (using the determinantal and Pieri's formula) $Ω (a) \cdot Ω (b) \cdot Ω (c)$ in the ring $A (G_{d, n});$
2)	Determine (using the LR rule) whether a $(θ^{- 1} (a), θ^{- 1} (b), θ^{- 1} (\overline{c})) -LRA$ exists.

Note that method 1 yields more information than we seek: it identifies the multiple exactly. Method 2 can also be used to find the multiple exactly, but with less work can simply tell whether or not the multiple is non-zero.

We introduce some notation relevant to the next theorem: For $a \in Q_{d, n}$ and $a' \in Q_{d, n'},$ define $a \oplus a'$ by $(a \oplus a') (i) = a_{i} + a_{i}^{'} - i (a \oplus a' \in Q_{d, n + n' - d}) .$

("generalized pushing lemma") If $(a, b, c) \in V_{d}^{n}$ and $(a', b', c') \in V_{d}^{n'},$ then $(a, b, c) \oplus (a', b', c') \in V_{d}^{n + n' - d} .$

Proof.

The proof is done by translating from sequences in $Q_{d, n}, Q_{d, n'},$ and $Q_{d, n + n' - d}$ to partitions in $Π_{d} .$ Let $(a, b, c) \in V_{d}^{n}$ and $(a', b', c') \in V_{d}^{n'} .$ Then $\begin{array}{rcl} Ω (a) \cdot Ω (b) \cdot Ω (c) & \neq & 0 in A (G_{d, n}), \\ Ω (a') \cdot Ω (b') \cdot Ω (c') & \neq & 0 in A (G_{d, n'}) . \end{array}$ First assume both these products are non-zero multiples of $Ω (1, 2, \dots, d)$ in $A (G_{d, n})$ and $A (G_{d, n'}),$ respectively. From Theorem 4.17, there exists a $(θ_{d, n}^{- 1} (a), θ_{d, n}^{- 1} (b), θ_{d, n}^{- 1} (\overline{c})) -LRA,$ (which we will denote by $N),$ and a $(θ_{d, n}^{- 1} (a'), θ_{d, n}^{- 1} (b'), θ_{d, n}^{- 1} (\overline{c}')) -LRA,$ (which we will denote by $N').$ From Theorem 4.13, $N + N'$ is an LRA.

Let $\begin{array}{rcl} λ & = & θ_{d, n + n' - d}^{- 1} (a \oplus a'), \\ μ & = & θ_{d, n + n' - d}^{- 1} (b \oplus b'), \\ ν & = & θ_{d, n + n' - d}^{- 1} (\overline{c \oplus c'}) . \end{array}$ Again by Theorem 4.17, we can show that $(a \oplus a', b \oplus b', c \oplus c') \in V_{d}^{n + n' - d}$ if there exists a $(λ, μ, ν) -LRA,$ and this will establish the theorem. We claim that $N + N'$ is a $(λ, μ, ν) -LRA.$ The verification amounts to the following exercise: $\begin{array}{rcl} [θ_{d, n}^{- 1} (a)] (i) + [θ_{d, n}^{- 1} (a')] (i) & = & (n - d + i - a_{i}) + (n' - d + i - a_{i}^{'}) \\ = & (n + n' - d) - d + i - (a_{i} + a_{i}^{'} - i) \\ = & (n + n' - d) - d + i - [a \oplus a'] (i) \\ = & [θ_{n + n' - d}^{- 1} (a \oplus a')] (i) . \end{array}$ This identity, together with the fact that $\overline{c \oplus c'} = \overline{c} \oplus \overline{c'},$ easily establishes the claim.

The theorem is now established in the case where the products of Schubert cycles are multiples of $Ω (1, 2, \dots, d) .$ If this is not the case, then by Theorem 3.10 we can find $\begin{array}{rcl} (x, y, z) & \leq & (a, b, c), \\ (x', y', z') & \leq & (a', b', c') \end{array}$ such that the products $Ω (x) \cdot Ω (y) \cdot Ω (z)$ and $Ω (x') \cdot Ω (y') \cdot Ω (z')$ are non-zero multiples of $Ω (1, 2, \dots, d) .$ Hence $(x, y, z) \oplus (x', y', z') \in V_{d}^{n + n' - d}$ by our previous result. Since $(x, y, z) \oplus (x', y', z') \leq (a, b, c) \oplus (a', b', c'),$ Theorem 2.11 guarantees the desired result.

$□$

In the next section, we will show that Theorem 4.18 is indeed a generalization of Horn's original "pushing" lemma.

Notes and References

This is an excerpt from Steven Andrew Johnson's 1979 dissertation The Schubert Calculus and Eigenvalue Inequalities for Sums of Hermitian Matrices.

page history