Chapter 1: Permutations - Notes on Schubert Polynomials

Notes on Schubert Polynomials
Chapter 1

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 25 June 2013

Permutations

For each integer $n \geq 1,$ let $S_{n}$ denote the symmetric group of degree $n,$ that is to say the group of all permutations of the set $[1, n] = {1, 2, \dots, n} .$ Each $w \in S_{n}$ is a mapping of $[1, n]$ onto itself. As is customary, we write all mappings on the left of their arguments, so that the image of $i \in [1, n]$ under $w$ is $w (i) .$ We shall sometimes denote $w$ by the sequence $(w (1), w (2), \dots, w (n)) .$ Thus for example $(53214)$ is the element of $S_{5}$ that sends 1 to 5, 2 to 3, 3 to 2, 4 to 1 and 5 to 4.

For $i = 1, 2, \dots, n - 1$ let $s_{i}$ denote the transposition that interchanges $i$ and $i + 1,$ and fixes all other elements of $[1, n] .$ We have

\begin{matrix} (1.1) & {\begin{matrix} s_{i}^{2} = 1, \\ s_{i} s_{j} = s_{j} s_{i} & if | i - j | > 1, \\ s_{i} s_{i + 1} s_{i} = s_{i + 1} s_{i} s_{i + 1} & (1 \leq i \leq n - 2) . \end{matrix} \end{matrix}

Also, for each $w \in S_{n},$ let

I (w) = {(i, j) : 1 \leq i < j \leq n and w (i) > w (j)} .

We regard $I (w)$ as a subset of the square $Σ_{n} = [1, n] \times [1, n],$ and we shall adopt the convention of matrices, that in $Σ_{n}$ the first coordinate increases from north to south, and the second coordinate from west to east. The group $S_{n} \times S_{n}$ acts on $Σ_{n} : (u \times v) (i, j) = (u (i), v (j)) .$ In particular, $S_{n}$ acts diagonally: $w (i, j) = (w \times w) (i, j) = (w (i), w (j)) .$

Let $w \in S_{n},$ $1 \leq r \leq n - 1 .$ Then $w s_{r}$ is the permutation

(w (1), \dots, w (r + 1), w (r), \dots, w (n))

and it is clear that

\begin{matrix} (1.2) & I (w s_{r}) = {\begin{matrix} s_{r} I (w) \cup {(r, r + 1)} & if w (r) < w (r + 1), \\ s_{r} I (w) - {(r + 1, r)} & if w (r) > w (r + 1) . \end{matrix} \end{matrix}

Let $ℓ (w) = Card I (w) .$ Then from (1.2) we have

\begin{matrix} (1.3) & ℓ (w s_{r}) = {\begin{matrix} ℓ (w) + 1 & if w (r) < w (r + 1), \\ ℓ (w) - 1 & if w (r) > w (r + 1) . \end{matrix} \end{matrix}

(1.4) $s_{1}, \dots, s_{n - 1}$ generate the group $S_{n} .$

Proof.

We shall show by induction on $ℓ (w)$ that each $w \in S_{n}$ is a product of $s's.$ If $ℓ (w) = 0,$ then $w = 1$ and there is nothing to prove. If $ℓ (w) > 0$ then $w (r) > w (r + 1)$ for some $r,$ and hence $ℓ (w s_{r}) = ℓ (w) - 1$ by (1.3). Hence $w s_{r} = s_{a_{1}} \dots s_{a_{p}}$ say, and therefore (as $s_{r}^{2} = 1)$ $w = s_{a_{1}} \dots s_{a_{p}} s_{r} .$

$□$

For each $w \in S_{n} .$ the length of $w$ is the minimal length of a sequence $(a_{1}, \dots, a_{p})$ such that $w = s_{a_{1}} \dots s_{a_{p}} .$

(1.5) The length of $w \in S_{n}$ is equal to $ℓ (w) = Card I (w) .$

Proof.

Let $ℓ' (w)$ temporarily denote the length of $w .$ The proof of (1.4) shows that $w$ can be written as a word of length $ℓ (w)$ in the $s_{i},$ so that $ℓ' (w) \leq ℓ (w) .$ Conversely, let $w = s_{a_{1}} \dots s_{a_{p}}$ be any expression of $w$ as a product of $s_{i} .$ To show that $ℓ (w) \leq ℓ' (w)$ it is enough to show that $ℓ (w) \leq p .$ Let $w' = s_{a_{1}} \dots s_{a_{p - 1}};$ from (1.3) we have $ℓ (w) \leq ℓ (w') + 1$ and hence

ℓ (w') \leq p - 1 \Rightarrow ℓ (w) \leq p .

Hence the proof is completed by induction on $p .$

$□$

(1.6) Let $w \in S_{n} .$ Then

(i)	$ℓ (w) = 0$ if and only if $w = 1 .$
(ii)	$ℓ (w) = 1$ if and only if $w = s_{r} (1 \leq r \leq n - 1) .$
(iii)	$ℓ (w^{- 1}) = ℓ (w) .$
(iv)	Let $w_{0} = (n, n - 1, \dots, 2, 1) \in S_{n} .$ Then $ℓ (w_{0} w) = ℓ (w w_{0}) = \frac{1}{2} n (n - 1) - ℓ (w) .$

Proof.

(i), (ii) require no comment. Also (iii) is clear, since $w = s_{a_{1}} \dots s_{a_{p}}$ if and only if $w^{- 1} = s_{a_{p}} \dots s_{a_{1}} .$

(iv) The set $I (w_{0})$ consists of all $(i, j) \in Σ_{n}$ such that $i < j,$ so that $ℓ (w_{0}) = \frac{1}{2} n (n - 1) .$ Next, we have

w w_{0} = (w (n), w (n - 1), \dots, w (1))

so that $I (w w_{0})$ is the complement of $I (w)$ in $I (w_{0}),$ and therefore

ℓ (w w_{0}) = \frac{1}{2} n (n - 1) - ℓ (w) .

Finally, since $w_{0}^{2} = 1$ we have, by virtue of (iii) above,

\begin{matrix} ℓ (w_{0} w) & = & ℓ (w^{- 1} w_{0}) \\ = & \frac{1}{2} n (n - 1) - ℓ (w^{- 1}) \\ = & \frac{1}{2} n (n - 1) - ℓ (w) . \end{matrix}

$□$

The element $w_{0}$ is called the longest element of $S_{n} .$

For each $w \in S_{n}$ let $R (w)$ denote the set of all sequences $(a_{1}, \dots, a_{p})$ of length $p = ℓ (w)$ such that $w = s_{a_{1}} \dots s_{a_{p}} .$ Such sequences are called reduced words for $w .$ Clearly,

(a_{1}, \dots, a_{p}) \in R (w) ⟺ (a_{p}, \dots, a_{1}) \in R (w^{- 1}) .

(1.7) Let $(a_{1}, \dots, a_{p}) \in R (w) .$ Then

I (w) = {s_{a_{p}} \dots s_{a_{r + 1}} (a_{ρ}, a_{ρ} + 1) : 1 \leq r \leq p} .

Proof.

Let $w' = w s_{a_{p}} = s_{a_{1}} \dots s_{a_{p - 1}} .$ Then $ℓ (w') = p - 1$ and hence by (1.2) and (1.3) we have

I (w) = s_{a_{p}} I (w') \cup {(a_{p}, a_{p} + 1)}

from which (1.7) follows by induction on $p .$

$□$

(1.8) (Exchange Lemma). Let $(a_{1}, \dots, a_{p}), (b_{1}, \dots, b_{p}) \in R (w) .$ Then

(b_{1}, a_{1}, \dots, â_{i}, \dots, a_{p}) \in R (w) for some i = 1, 2, \dots, p .

Proof.

By (1.7), applied to $w^{- 1},$ we have $(b_{1}, b_{1} + 1) \in I (w^{- 1})$ and hence

(b_{1}, b_{1} + 1) = s_{a_{1}} \dots s_{a_{i - 1}} (a_{i}, a_{i} + 1)

for some $i = 1, \dots, p .$ It follows that

s_{b_{1}} = s_{a_{1}} \dots s_{a_{i - 1}} s_{a_{i}} {(s_{a_{1}} \dots s_{a_{i - 1}})}^{- 1},

so that $s_{b_{1}} s_{a_{1}} \dots s_{a_{i - 1}} = s_{a_{1}} \dots s_{a_{i}}$ and therefore

s_{b_{1}} s_{a_{1}} \dots ŝ_{a_{i}} \dots s_{a_{p}} = s_{a_{1}} \dots s_{a_{p}} = w .

$□$

(1.9) Let $w = s_{a_{1}} \dots s_{a_{r}}$ where $r > ℓ (w) .$ Then

w = s_{a_{1}} \dots ŝ_{a_{p}} \dots ŝ_{a_{q}} \dots s_{a_{r}}

for some pair $(p, q)$ such that $1 \leq p < q \leq r .$

Proof.

Since $ℓ (s_{a_{1}}) = 1$ and $ℓ (s_{a_{1}} \dots s_{a_{r}}) < r$ there exists $q \geq 2$ such that

ℓ (s_{a_{1}} \dots s_{a_{q - 1}}) = q - 1, ℓ (s_{a_{1}} \dots s_{a_{q}}) < q .

Let $v = s_{a_{1}} \dots s_{a_{q - 1}},$ so that $ℓ (v) = q - 1$ and $ℓ (v s_{a_{q}}) \leq q - 1,$ whence by (1.3) we have $ℓ (v s_{a_{q}}) = q - 2 .$ Let $(b_{1}, \dots, b_{q - 2})$ be a reduced word for $v s_{a_{q}},$ then $(b_{1}, \dots, b_{q - 2}, a_{q})$ and $(a_{1}, \dots, a_{q - 1})$ are reduced words for $v .$ By (1.8) (applied to $v^{- 1})$ it follows that $v = s_{a_{1}} \dots ŝ_{a_{p}} \dots s_{a_{q - 1}}$ for some $p = 1, 2, \dots, q - 1,$ and hence

w = v s_{a_{q}} \dots s_{a_{r}} = s_{a_{1}} \dots ŝ_{a_{p}} \dots ŝ_{a_{q}} \dots s_{a_{r}} .

$□$

If $i < j,$ let $t_{i j}$ denote the transposition that interchanges $i$ and $j$ and fixes each $k \neq i, j .$ For each permutation $w,$ let $e_{i j} (w)$ denote the number of $k$ such that $i < k < j$ and $w (k)$ lies between $w (i)$ and $w (j) .$ Consideration of $I (w)$ and $I (w t_{i j})$ shows that

\begin{matrix} (1.10) & ℓ (w t_{i j}) = {\begin{matrix} ℓ (w) - 2 e_{i j} (w) - 1 & if w (i) > w (j), \\ ℓ (w) + 2 e_{i j} (w) + 1 & if w (i) < w (j) . \end{matrix} \end{matrix}

In particular, $ℓ (w t_{i j}) = ℓ (w) \pm 1$ if and only if $e_{i j} = 0 .$

(1.11) Let $v, w$ be permutations and let $(a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Then the following conditions are equivalent:

(i)	$ℓ (v) < ℓ (w)$ and $v^{- 1} w$ is a transposition,
(ii)	$v = s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ for some $r = 1, 2, \dots, p .$

Proof.

(i) $\Rightarrow$ (ii). Suppose that $v^{- 1} w = t_{i j},$ so that $v = w t_{i j} .$ Then (1.10) shows that $w (i) > w (j),$ so that $(i, j) \in I (w) .$ Hence by (1.7) we have $(i, j) = s_{a_{p}} \dots s_{a_{r + 1}} (a_{r}, a_{r + 1})$ for some $r = 1, 2, \dots, p,$ and therefore

\begin{matrix} (1) & \begin{matrix} t_{i j} & = & (s_{a_{p}} \dots s_{a_{r + 1}}) s_{a_{r}} {(s_{a_{p}} \dots s_{a_{r + 1}})}^{- 1} \\ = & s_{a_{p}} \dots s_{a_{r + 1}} s_{a_{r}} s_{a_{r + 1}} \dots s_{a_{p}} . \end{matrix} \end{matrix}

Consequently

\begin{matrix} v = w t_{i j} & = & (s_{a_{1}} \dots s_{a_{p}}) (s_{a_{p}} \dots s_{a_{r}} \dots s_{a_{p}}) \\ = & s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}} . \end{matrix}

(ii) $\Rightarrow$ (i). Clearly $ℓ (v) < ℓ (w),$ and the calculation above shows that $v^{- 1} w$ is the transposition (1).

$□$

The Bruhat order

Let $v, w$ be permutations such that

(a)	$ℓ (w) = ℓ (v) + 1,$
(b)	$w = t v$ where $t$ is a transposition.

Since

t v = v t'

with

t' = v^{- 1} t v

also a transposition, we can replace (b) by

(b')

$w = v t'$ where $t'$ is also a transposition.

If (a) and (b) (or

(b'))

are satisfied we shall say that

w

covers

v

and write

v \to w .

(1.12) Let $v, w \in S_{n}$ and let $w_{0}$ be the longest element of $S_{n} .$ Then the following conditions are equivalent

(a)	$v \to w;$
(b)	$v^{- 1} \to w^{- 1};$
(c)	$w w_{0} \to v w_{0};$
(d)	$w_{0} w \to w_{0} v .$

This follows from the definition and (1.6)(iii), (iv).

(1.13) Let $(a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Then $v \to w$ if and only if $v = s_{a_{1}} \dots ŝ_{a_{i}} \dots s_{a_{p}}$ for some $i = 1, 2, \dots, p$ such that $(a_{1}, \dots, â_{i}, \dots, a_{p})$ is reduced.

This follows from (1.11).

(1.14) Let $w$ be a permutation and let $i \geq 1 .$ Then either $w \to s_{i} w$ or $s_{i} w \to w .$ Moreover we have $s_{i} w \to w$ if and only if there is a reduced word for $w$ starting with $i .$

Proof.

The first statement follows from (1.3) and (1.6)(iii). If $s_{i} w \to w,$ let $(a_{1}, \dots, a_{p})$ be a reduced word for $s_{i} w;$ then $w = s_{i} s_{a_{1}} \dots s_{a_{p}}$ is a reduced expression for $w .$ Conversely if $w = s_{i} s_{a_{1}} \dots s_{a_{p}}$ is reduced, it is clear that $ℓ (s_{i} w) = ℓ (w) - 1,$ and hence $s_{i} w \to w .$

$□$

(1.15) Let $v, w$ be permutations and let $i \geq 1$ be such that

v \to s_{i} v \neq w .

Then $v \to w$ if and only if both $w \to s_{i} w$ and $s_{i} v \to s_{i} w .$

Proof.

Assume that $v \to w,$ and let $(a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Suppose that $a_{1} = i .$ By (1.13) we have $v = s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ for some $r .$ If $r = 1$ then $s_{i} v = s_{a_{1}} v = w,$ and if $r > 1$ then $s_{i} v = s_{a_{2}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ has length $< p - 1 = ℓ (v),$ so that $s_{i} v \to v$ by (1.14). Since both these possibilities are excluded by our hypothesis, we can conclude that $a_{1} \neq i .$ Hence (1.14) shows that $w \to s_{i} w .$ It follows that $s_{i} s_{a_{1}} \dots s_{a_{p}}$ is a reduced expression for $s_{i} w,$ and $s_{i} s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ is one for $s_{i} v .$ Hence (1.13) shows that $s_{i} v \to s_{i} w .$

Conversely, assume that $w \to s_{i} w$ and $s_{i} v \to s_{i} w .$ As before, let $w = s_{a_{1}} \dots s_{a_{p}}$ be a reduced expression. Then $s_{i} w = s_{i} s_{a_{1}} \dots s_{a_{p}}$ is reduced, and since $s_{i} v \neq w$ it follows from (1.13) that $s_{i} v = s_{i} s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ for some $r = 1, 2, \dots, p .$ Hence $v = s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ and so $v \to w$ by (1.13) again.

$□$

The Bruhat order, denoted by $\leq,$ is the partial order on $S_{n}$ that is the transitive closure of the relation $\to .$ In other words, if $v$ and $w$ are permutations, $v \leq w$ means that there exists $r \geq 0$ and $v_{0}, v_{1}, \dots, v_{r}$ in $S_{n}$ such that

v = v_{0} \to v_{1} \to \dots \to v_{r} = w

(which implies that $ℓ (w) = ℓ (v) + r).$

(1.16) Let $v, w \in S_{n}$ and $i \geq 1$ be such that $s_{i} v \to v$ and $s_{i} w \to w .$ Then the following conditions are equivalent:

(i)	$v \leq w,$
(ii)	$s_{i} v < w,$
(iii)	$s_{i} v \leq s_{i} w .$

Proof.

(i) $\Rightarrow$ (ii). We have $s_{i} v < v \leq w,$ hence $s_{i} v < w .$

(ii) $\Rightarrow$ (i). By definition there exist $v_{0}, v_{1}, \dots, v_{m},$ where $m \geq 1,$ such that

s_{i} v = v_{0} \to v_{1} \to \dots \to v_{m} = w .

We have $v_{0} \to s_{i} v_{0}$ and $s_{i} v_{m} \to v_{m} .$ Hence there exists $k = 1, 2, \dots, m$ such that $v_{j} \to s_{i} v_{j}$ for $0 \leq j \leq k - 1,$ and $s_{i} v_{k} \to v_{k} .$

Suppose $1 \leq j \leq k - 1 .$ Then $v_{j - 1} \to s_{i} v_{j - 1}$ and $v_{j - 1} \to v_{j};$ also $v_{j} \neq s_{i} v_{j - 1},$ otherwise we should have $s_{i} v_{j} = v_{j - 1}$ and hence $s_{i} v_{j} \to v_{j} .$ Hence by (1.15) we have

\begin{matrix} (1) & s_{i} v_{j - 1} \neq s_{i} v_{j} (1 \leq j \leq k - 1) . \end{matrix}

Next, we have $v_{k - 1} \to s_{i} v_{k - 1}$ and $v_{k - 1} \to v_{k} .$ If $v_{k} \neq s_{i} v_{k - 1}$ we should by (1.15) have $v_{k} \to s_{i} v_{k},$ contradicting the definition of $k .$ Hence

\begin{matrix} (2) & v_{k} = s_{i} v_{k - 1} . \end{matrix}

From (1) and (2) it follows that

v = s_{i} v_{0} \to s_{i} v_{1} \to \dots \to s_{i} v_{k - 1} = v_{k} \to \dots \to v_{m} = w

and hence $v \leq w .$

This shows that (i) and (iii) are equivalent. To show that (ii) and (iii) are equivalent, assume that $v, w \in S_{n}$ for some $n \geq 1,$ let $w_{0}$ be the longest element of $S_{n},$ and replace $v, w$ respectively by $s_{i} w w_{0}$ and $s_{i} v w_{0} .$ Then we have

\begin{matrix} s_{i} v \leq s_{i} w & ⟺ & s_{i} w w_{0} \leq s_{i} v w_{0} & (by (1.12)) \\ ⟺ & w w_{0} < s_{i} v w_{0} & (by (i) ⟺ (ii)) \\ ⟺ & s_{i} v < w & (by (1.12) again) \end{matrix}

and the proof is complete.

$□$

(1.17) Let $v, w$ be permutations and let $a = (a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Then the following conditions are equivalent:

(i)	$v \leq w;$
(ii)	there exists a subsequence $b = (b_{1}, \dots, b_{q})$ of $a$ such that $v = s_{b_{1}} \dots s_{b_{q}};$
(iii)	there exits a reduced subsequence $b = (b_{1}, \dots, b_{q})$ of $a$ such that $v = s_{b_{1}} \dots s_{b_{q}} .$

Proof.

It follows from (1.13) that (i) $\Rightarrow$ (iii), and from (1.9) that (ii) and (iii) are equivalent. Thus it remains to prove that (iii) $\Rightarrow$ (i).

We proceed by induction on $r = p + q = ℓ (v) + ℓ (w) .$ If $r = 0,$ we have $v = w = 1,$ so assume that $r \geq 1 .$ We distinguish two cases:

(a) $v \to s_{a_{1}} v .$ In this case we have $b_{1} \neq a_{1},$ hence $(b_{1}, \dots, b_{q})$ is a subsequence of $(a_{2}, \dots, a_{p}),$ which is a reduced word for $s_{a_{1}} w .$ By the inductive hypothesis we have $v \leq s_{a_{1}} w < w,$ hence $v < w .$

(b) $s_{a_{1}} v \to v .$ In this case $ℓ (s_{a_{1}} v) + ℓ (w) = p - 1 + q = r - 1,$ and $s_{a_{1}} v = s_{a_{1}} s_{b_{1}} \dots s_{b_{q}} .$ If $a_{1} = b_{1}$ we have $s_{a_{1}} v = s_{b_{2}} \dots s_{b_{q}},$ and if $a_{1} \neq b_{1}$ then $(a_{1}, b_{1}, \dots, b_{q})$ is a non-reduced subsequence of $(a_{1}, \dots, a_{p}) .$ Hence the inductive hypothesis implies that $s_{a_{1}} v < w .$ But also $s_{a_{1}} w \to w,$ hence $v \leq w$ by (1.16).

$□$

(1.18) Let $w \in S_{n}$ and let $t$ be a transposition. Then

ℓ (w t) < ℓ (w) \Rightarrow w t < w .

This follows from (1.11) and (1.17).

To recognize when two permutations are comparable for the Bruhat order, the following rule may be used. For each $w \in S_{n}$ let $K (w)$ denote the column-strict tableau (of shape $δ = (n - 1, n - 2, \dots, 1))$ whose $j^{th}$ column, for $1 \leq j \leq n - 1,$ consists of the numbers $w (1), \dots, w (n - j)$ arranged in increasing order from north to south.

(1.19) Let $v, w \in S_{n} .$ Then $v \leq w$ if and only if $K (v) \leq K (w)$ (i.e., each entry in $K (v)$ is less than or equal to the corresponding entry in $K (w)).$

Proof.

If $v \to w$ it is easily seen that $K (v) \leq K (w),$ and hence $v \leq w$ implies $K (v) \leq K (w) .$

Conversely, suppose that $K (v) \leq K (w)$ and let $j = j (v, w)$ be the smallest integer $\geq 1$ such that $v (j) \neq w (j) .$ (If $v = w$ we define $j (v, w) = n .)$ We proceed by descending induction on $j (v, w) .$ If $j (v, w) = n$ we have $v = w,$ so assume $j (v, w) = j < n .$ Then $w (j)$ is not equal to any $v (1), \dots, v (j)$ and hence is equal to $v (k)$ for some $k > j .$ For each $i < j$ the ${(n - i)}^{th}$ columns of $K (v)$ and $K (w)$ are identical, and since $K (v) \leq K (w)$ it follows that $v (j) < w (j),$ i.e. $v (j) < v (k) .$ Let $v' = v t_{j k},$ then by (1.10) we have $ℓ (v) < ℓ (v')$ and hence $v < v'$ by (1.18). Also $v' (i) = v (i) = w (i)$ for $i < j,$ and $v' (j) = v (k) = w (j)$ so that $j (v', w) > j .$ Hence $v' \leq w$ by the inductive hypothesis, and therefore $v < w .$

$□$

The diagram of a permutation

We may regard $I (w)$ as a "diagram" of $w \in S_{n} .$ However, for many purposes it is more convenient to define the diagram of $w$ to be

D (w) = (1 \times w) I (w) .

Thus we have $(i, j) \in D (w)$ if and only if $(i, w^{- 1} j) \in I (w);$ that is

\begin{matrix} (1.20) & (i, j) \in D (w) ⟺ i < w^{- 1} j and j < w i . \end{matrix}

Hence the points $(i, j)$ in the square $Σ_{n} = {[1, n]}^{2}$ not in $D (w)$ are those for which either $i \geq w^{- 1} j$ or $j \geq w i .$

The graph $G (w)$ of $w$ is the set of points $(i, w (i))$ $(1 \leq i \leq n),$ or equivalently $(w^{- 1} j, j)$ $(1 \leq j \leq n) .$ The complement of $D (w)$ in $Σ_{n}$ therefore consists of all the lattice points due south or due east of some point of $G (w),$ hence is the union of the hooks with corners at the points of $G (w) .$ For example, if $w = (365142)$ and $n = 6,$ the diagram $D (w)$ consists of the points circled in the picture below:

If $m > n,$ we shall identify $S_{n}$ with the subgroup of permutations $w \in S_{m}$ that fix $n + 1, n + 2, \dots, m .$ We may then form the group

S_{\infty} = ⋃_{n \geq 1} S_{n}

consisting of all permutations of the set of positive integers that fix all but a finite number of them.

The diagram $D (w)$ of $w \in S_{n}$ is unchanged by this identification of $S_{n}$ with the subgroup of $S_{\infty}$ fixing all $m > n,$ and hence is well-defined for all $w \in S_{\infty} .$ Also, it is clear from the definitions and (1.7) that

(1.21)

(i)	$D (w^{- 1})$ is the transpose of $D (w)$ (i.e., we have $(i, j) \in D (w^{- 1})$ if and only if $(j, i) \in D (w)).$
(ii)	$Card D (w) = ℓ (w) .$
(iii)	If $(a_{1}, \dots, a_{p}) \in R (w),$ then $D (w)$ consists of the lattice points $(s_{a_{p}} \dots s_{a_{r + 1}} (a_{r}), s_{a_{1}} \dots s_{a_{r - 1}} (a_{r}))$ for $r = 1, 2, \dots, p .$

In particular, it follows from (iii) above that

(1.22)

(i)	If $ℓ (w s_{r}) > ℓ (w),$ then $D (w s_{r}) = (s_{r} \times 1) D (w) \cup {(r, w r)} .$
(ii)	If $ℓ (s_{r} w) > ℓ (w),$ then $D (w s_{r}) = (1 \times s_{r}) D (w) \cup {(w^{- 1} r, r)} .$

The code of a permutation

Let $w \in S_{n},$ and for each $i \geq 1$ let

c_{i} (w) = Card {j : j > i and w (j) < w (i)} .

Thus $c_{i} (w)$ is the number of points in the $i^{th}$ row of $I (w),$ or equivalently the number of points in the $i^{th}$ row of $D (w) .$ The vector

c (w) = (c_{1} (w), \dots, c_{n} (w)) \in ℕ^{n}

is called the code of $w .$ As with partitions, we may disregard any string of zeros at the right-hand end of $c (w),$ and with this convention the code $c (w)$ (like the diagram $D (w))$ is unchanged by the embedding of $S_{n}$ in $S_{m}$ where $m > n$ and is well-defined for all $w \in S_{\infty} .$

The permutation $w$ may be reconstructed from its code $c (w) = (c_{1}, c_{2}, \dots)$ as follows :- for each $i \geq 1,$ $w (i)$ is the ${(c_{i} + 1)}^{th}$ element, in increasing order, of the sequence of positive integers from which $w (1), w (2), \dots, w (i - 1)$ have been deleted. The sum $| c | = c_{1} + c_{2} + \dots$ is equal to $ℓ (w) .$ Each sequence $c = (c_{1}, c_{2}, \dots)$ of non-negative integers such that $| c | < \infty$ occurs as the code of a unique permutation $w \in S_{\infty} .$

The length of $c (w)$ is the largest $r$ such that $c_{r} (w) \neq 0 .$ From the definition, $r$ is the last descent of the permutation $w,$ that is to say $w (r) > w (r + 1)$ and $w (r + 1) < w (r + 2) < \dots .$

(1.23)

(i)	If $ℓ (w s_{r}) > ℓ (w)$ (i.e., if $w (r) < w (r + 1))$ then $c (w s_{r}) = s_{r} c (w) + ε_{r},$ where $ε_{r}$ is the sequence with $1$ in the $r^{th}$ place and $0$ elsewhere.
(ii)	If $(a_{1}, \dots, a_{p}) \in R (w)$ then $c (w) = \sum_{i = 1}^{p} s_{a_{p}} \dots s_{a_{i + 1}} (ε_{a_{i}}) .$

Proof.

(i) follows from (1.21)(i), and (ii) follows from (i) by induction on $p .$

$□$

(1.24) Let $i \geq 1 .$ Then

c_{i} (w) > c_{i + 1} (w) ⟺ w (i) > w (i + 1) .

Proof.

Suppose that $w (i) > w (i + 1) .$ Then the ${(i + 1)}^{th}$ row of $I (w)$ is strictly contained in the $i^{th}$ row, whence $c_{i} (w) > c_{i + 1} (w) .$ Conversely, if $w (i) < w (i + 1),$ then the $i^{th}$ row of $I (w)$ is contained in the ${(i + 1)}^{th}$ row, so that $c_{i} (w) \leq c_{i + 1} (w) .$

$□$

To compute the code of $w^{- 1}$ in terms of the code $(c_{1}, c_{2}, \dots)$ of $w,$ we introduce the following notation. If $u = (u_{1}, u_{2}, \dots)$ is any sequence and $r$ is an integer $\geq 0,$ let

ζ_{r} u = (u_{1}, u_{2}, \dots, u_{r}, 0, u_{r + 1}, u_{r + 2}, \dots)

so that the operation $ζ_{r}$ introduces a zero after the $r^{th}$ place. Then we have

\begin{matrix} (1.25) & c (w^{- 1}) = \sum_{i \geq 1} ζ_{c_{1}} \dots ζ_{c_{i - 1}} (1^{c_{i}}) \end{matrix}

where $(1^{c_{i}})$ is the sequence consisting of $c_{i}$ 1's.

Proof.

By induction on the length of $c (w)$ it is enough to show that if $w_{1}$ is the permutation whose code is $(c_{2}, c_{3}, \dots)$ then

\begin{matrix} (1) & c (w^{- 1}) = (1^{c_{1}}) + ζ_{c_{1}} c (w_{1}^{- 1}) . \end{matrix}

Now the diagram of $w_{1}$ is obtained from that of $w$ by deleting the first row (of length $c_{1})$ and then moving each column after the $c_{1}^{th}$ one space to the left. On reading the diagrams of $w$ and $w_{1}$ by columns, we obtain (1).

$□$

The shape $λ (w)$ of a permutation $w$ is the partition whose parts are the non-zero $c_{i} (w),$ arranged in weakly decreasing order. We have

| λ (w) | = Card D (w) = ℓ (w) .

Next, recall that for two partitions $λ = (λ_{1}, λ_{2}, \dots)$ and $μ = (μ_{1}, μ_{2}, \dots)$ the relation $λ \geq μ$ means that $| λ | = | μ |$ and $λ_{1} + \dots + λ_{i} \geq μ_{1} + \dots + μ_{i}$ for all $i \geq 1$ [Mac1979, Ch.I], With this understood, the shapes of $w$ and $w^{- 1}$ are related by

(1.26) $λ (w') \geq λ (w^{- 1}) .$

Proof.

Let $λ = λ (w),$ $μ = λ (w^{- 1}) .$ Define a matrix $M = (m_{i j})$ as follows: $m_{i j} = 1$ if $(i, j) \in D (w),$ and $m_{i j} = 0$ otherwise. Then $M$ is a $(0, 1)$ matrix with row-sums $λ_{1}, λ_{2}, \dots$ in some order, and column-sums $μ_{1}, μ_{2}, \dots$ in some order. Hence (see e.g. [Mac1979, Ch.I, §6]) we have $λ' \geq μ .$

$□$

Vexillary permutations

Special interest attaches to those permutations $w \in S_{\infty}$ for which $λ (w)' = λ (w^{- 1}) .$ They may be characterized in various ways:

(1.27) The following conditions on a permutation $w \in S_{\infty}$ are equivalent:

(i)	the set of rows of $D (w)$ is totally ordered by inclusion;
$(i)'$	the set of rows of $I (w)$ is totally ordered by inclusion;
(ii)	the set of columns of $D (w)$ is totally ordered by inclusion;
$(ii)'$	the set of columns of $I (w)$ is totally ordered by inclusion;
(iii)	there do not exist $a, b, c, d$ such that $1 \leq a < b < c < d$ and $w (b) < w (a) < w (d) < w (c);$
(iv)	there exist $u, v \in S_{\infty}$ such that $(u \times v) D (w)$ is the diagram $D (λ)$ of a partition $λ;$
(v)	$λ (w)' = λ (w^{- 1}) .$

Proof.

Since $D (w) = (1 \times w) I (w)$ it is clear that (i) $\Leftrightarrow$ $(i)'$ and (ii) $\Leftrightarrow$ $(ii)' .$ Morever (i) $\Leftrightarrow$ (ii), for either is false if and only if there exist $a, β, c, δ \in [1, n]$ such that $a < c,$ $β < δ$ and $(a, β), (c, δ)$ belong to $D (w),$ whilst $(a, δ)$ and $(c, β)$ do not. Let $b = w^{- 1} (β)$ and $d = w^{- 1} (δ);$ then we have $a < b < c < d$ and $w (b) < w (a) < w (d) < w (c) .$ Thus (i), (ii) and (iii) are all equivalent.

Next, it is clear that the conjunction of (i) and (ii) is equivalent to (iv). Thus it remains to show that (iv) and (v) are equivalent. If (iv) is satisfied, then $λ (w) = λ$ and $λ (w^{- 1}) = λ',$ whence (v) is satisfied. Conversely, if $λ (w) = λ$ and $λ (w^{- 1}) = λ',$ then $D (w)$ can be brought into coincidence with $D (λ)$ by suitable permutations of the rows and of the columns, whence (iv) is satisfied.

$□$

An element $w \in S_{\infty}$ is said to be vexillary if it satisfies the equivalent conditions of (1.27). By (1.27) (iii), the first non-vexillary permutation is $(2143)$ in $S_{4} .$

For each $w \in S_{n}$ let

\overline{w} = w_{0} w w_{0}

where as before $w_{0} = (n, n - 1, \dots, 2, 1)$ is the longest element of $S_{n} .$ Then

(1.28)

(i)	$ℓ (\overline{w}) = ℓ (w) .$
(ii)	$I (\overline{w})$ is the reflection of $I (w)$ in the "antidiagonal" $i + j = n + 1 .$
(iii)	$λ (\overline{w}) = λ (w)' .$

Proof.

(i) follows from (1.6) (or from (ii) below).

(ii) If $i < j$ then

\begin{matrix} (i, j) \in I (\overline{w}) & ⟺ & w_{0} w w_{0} (i) > w_{0} w w_{0} (j) \\ ⟺ & w (n + 1 - i) < w (n + 1 - j) \\ ⟺ & (n + 1 - j, n + 1 - i) \in I (w) . \end{matrix}

(iii) now follows from (ii).

$□$

From (1.27) and (1.28) it follows that

(1.29) $w$ is vexillary $⟺ w^{- 1}$ is vexillary $⟺$ $\overline{w}$ is vexillary.

Dominant permutations

We consider next two particular types of vexillary permutations.

(1.30) Let $w \in S_{\infty} .$ Then the following conditions are equivalent:

(i)	the code of $w$ is a partition;
(ii)	the code of $w^{- 1}$ is a partition;
(iii)	$D (w)$ is the diagram of a partition.

Proof.

Clearly (iii) implies (i) and (ii).

Conversely, suppose that $c (w)$ is a partition $λ = (λ_{1}, \dots, λ_{m}),$ where $λ_{1} \geq \dots \geq λ_{m} \geq 0 .$ We shall show by induction on $i$ that

(i, j) \in D (w) ⟺ 1 \leq j \leq λ_{i} .

This is true for $i = 1,$ so assume that $1 < i \leq m$ and that the statement is true for $i - 1 .$ Then we have $w (k) \leq λ_{i - 1}$ for $1 \leq k \leq i - 1,$ and $w (k) = λ_{i - 1}$ for some $k \leq i - 1 .$ Since $λ_{i} \leq λ_{i - 1}$ it follows that the $i^{th}$ row of $D (w)$ consists of the points $(i, j),$ $1 \leq j \leq λ_{i},$ as required. Hence (i) implies (iii), and the same argument applied to $w^{- 1}$ shows that if the code of $w^{- 1}$ is a partition, then $D (w^{- 1})$ is the diagram of a partition. Hence so is $D (w),$ by (1.21) (i), and the proof is complete.

$□$

A permutation is said to be dominant if it satisfies the equivalent conditions of (1.30). Dominant permutations are clearly vexillary, and $w$ is dominant if and only if $w^{- 1}$ is dominant.

Grassmannian permutations

(1.31) Let $w \in S_{\infty} .$ Then the following conditions are equivalent:

(i)	$c_{1} (w) \leq \dots \leq c_{r} (w)$ and $c_{i} (w) = 0$ for $i > r;$
(ii)	$w (i) < w (i + 1)$ unless $i = r .$

Proof.

(i) $\Rightarrow$ (ii). By (1.15) we have $w (1) < \dots < w (r)$ and $w (r + 1) < \dots < w (n) .$ (ii) $\Rightarrow$ (i). We have

c (w) = (w (1) - 1, \dots, w (r) - r) .

$□$

If $w$ satisfies the equivalent conditions of (1.31), $w$ is called a Grassmannian permutation. By (1.27)(iii), Grassmannian permutations are vexillary, and $w \in S_{n}$ is Grassmannian if and only if $\overline{w} = w_{0} w w_{0}$ is Grassmannian.

Enumeration of vexillary permutations

Let $w$ be a permutation, $c = c (w) = (c_{1}, c_{2}, \dots)$ its code. Consider the following two conditions on the sequence $c :$

(V1)	If $i < j$ and $c_{i} > c_{j},$ then $Card {k : i < k < j and c_{k} < c_{j}} \leq c_{i} - c_{j};$
(V2)	If $i < j$ and $c_{i} \leq c_{j},$ then $c_{k} \geq c_{i}$ whenever $i < k < j .$

(1.32) A permutation $w$ is vexillary if and only if its code $c (w)$ satisfies (V1) and (V2).

Proof.

For each $i \geq 1,$ let

ρ_{i} = {j : (i, j) \in D (w)}

be the $i^{th}$ row of $D (w) .$

Suppose first that $w$ is vexillary with code $c = (c_{1}, c_{2}, \dots) .$ Let $i < k < j$ be such that $c_{i} \geq c_{j} > c_{k} .$ Then $ρ_{i} \supseteq ρ_{j} \supset ρ_{k}$ (where $\supset$ denotes strict containment), hence there exists $t \in ρ_{j},$ $t \notin ρ_{k} .$ Let $s = w (k),$ then $s < t$ and (since $t \in ρ_{i})$ we have $s \in ρ_{i}$ and $s \notin ρ_{j} .$ Hence for fixed $(i, j)$ such that $i < j$ and $c_{i} > c_{j},$ the number of $k$ between $i$ and $j$ such that $c_{j} > c_{k}$ is at most $Card (ρ_{i} - ρ_{j}) = c_{i} - c_{j},$ so that (V1) is satisfied.

Next let $w$ be vexillary, $i < k < j$ and $c_{i} < c_{j},$ so that $ρ_{i} \subseteq ρ_{j} .$ Let $s \in ρ_{i} .$ If $s \notin ρ_{k}$ then $w (k) \leq s < w (i),$ so that $w (k)$ lies in $ρ_{i}$ but not in $ρ_{j},$ which is impossible. Hence $s \in ρ_{k}$ and therefore $ρ_{i} \subseteq ρ_{k} .$ So we have $c_{i} \leq c_{k},$ and (V2) is satisfied.

Conversely, suppose that the code $c$ of $w$ satisfies (V1) and (V2). Then so does the sequence $(c_{2}, c_{3}, \dots)$ and we may therefore assume that the set ${ρ_{2}, ρ_{3}, \dots}$ is totally ordered by inclusion.

Let $j > 1$ and suppose first that $c_{1} \geq c_{j} .$ If $ρ_{1} ⊉ ρ_{j},$ there exists $s \in ρ_{j}$ such that $s \notin ρ_{1},$ so that $w (1) < s < w (j) .$ There are at least $c_{1} - c_{j} + 1$ elements $t \in ρ_{1}$ such that $t \notin ρ_{j},$ and since each such $t$ satisfies $t < w (1) < w (j),$ it is of the form $t = w (k)$ for some $k$ between $1$ and $j .$ Since $w (k) = t < w (1) < s,$ it follows that $s \notin ρ_{k} .$ Since either $ρ_{k} \subseteq ρ_{j}$ or $ρ_{j} \subseteq ρ_{k},$ we conclude that $ρ_{k} \subset ρ_{j}$ (strict inclusion) and hence that $c_{k} < c_{j} .$ Hence there are at least $c_{1} - c_{j} + 1$ values of $k$ between $1$ and $j$ for which $c_{k} < c_{j},$ contradicting (V1). Hence $ρ_{1} \supseteq ρ_{j} .$

Finally, let $j > 1$ and $c_{1} < c_{j},$ so that $w (1) < w (j) .$ If $ρ_{1} ⊈ ρ_{j}$ there exists $s \in ρ_{1}$ such that $s \notin ρ_{j};$ we have $s = w (k)$ for some $k$ between $1$ and $j,$ and since $w (k) < w (1)$ we have $c_{k} < c_{1},$ contradicting (V2). Hence $ρ_{1} \subseteq ρ_{j}$ in this case, and the proof is complete.

$□$

Remark. It is stated in [LSc1985, prop. 2.4] that $w$ is vexillary if and only if $c (w)$ satisfies (V1) and

(V3)	If $c_{i} > c_{i + 1}$ for some $i \geq 1,$ then $c_{i} > c_{j}$ for all $j > i .$

Since (V3) is implied by (V2), it follows from (1.32) that every vexillary code satisfies (V1) and (V3). However, the conjuction of (V1) and (V3) is not sufficient for vexillarity: for example, the permutation $w = (2571634)$ is not vexillary (since e.g. it contains the subword 2163) but its code is $c = (13402),$ which satisfies (V1) and (V3) (but not (V2)).

Let $w$ be a permutation with code $c (w) = (c_{1}, c_{2}, \dots) .$ For each $i \geq 1$ such that $c_{i} \neq 0,$ let

e_{i} = max {j : j \geq i and c_{j} \geq c_{i}} .

Arrange the numbers $e_{i}$ in increasing order of magnitude, say $ϕ_{1} \leq \dots \leq ϕ_{m} .$ The sequence

ϕ (w) = (ϕ_{1}, \dots, ϕ_{m})

is called the flag of $w .$ It is a sequence of length equal to $ℓ (λ),$ where $λ$ is the shape of $w .$

Remark. There is another definition of the flag of a permutation $w,$ due to M.Wachs [Wac1985]. For each $i \geq 1$ such that $c_{i} \neq 0,$ let

d_{i} = min {j : j > i and w (j) < w (i)} .

Arrange the numbers $d_{i} - 1$ in increasing order of magnitude, say $ϕ_{1}^{*} \leq \dots \leq ϕ_{m}^{*},$ and let

ϕ^{*} (w) = (ϕ_{1}^{*}, \dots, ϕ_{m}^{*}) .

These two notions are not equivalent. In fact

(1.33) (J. Alfano) We have $ϕ (w) = ϕ^{*} (w)$ if and only if the permutation $w$ satisfies (V2).

Proof.

If $c_{i} \neq 0$ we have $w (j) > w (i)$ for $i < j < d_{i},$ and hence $c_{j} \geq c_{i}$ for these values of $j .$ Hence $d_{i} - 1 \leq e_{i}$ in all cases, and we shall have $ϕ (w) = ϕ^{*} (w)$ if and only if $d_{i} - 1 = e_{i}$ for each $i .$ But this condition means that, for each $i \geq 1,$ the set of $j \geq i$ such that $c_{j} \geq c_{i}$ is an interval; and this is just a restatement of the condition (V2).

$□$

We shall show that a vexillary permutation is uniquely determined by its shape $λ (w)$ and its flag $ϕ (w) .$

Let us write $λ = λ (w)$ in the form

\begin{matrix} (1.34) & λ = (p_{1}^{m_{1}}, p_{2}^{m_{2}}, \dots, p_{k}^{m_{k}}) \end{matrix}

where $p_{1} > p_{2} > \dots > p_{k} > 0$ and each $m_{i} \geq 1 .$ For $1 \leq r \leq k$ let

f_{r} = max {j : c_{j} \geq p_{r}}

so that $f_{1} \leq \dots \leq f_{k} .$ If $c = (c_{1}, c_{2}, \dots)$ is the code of $w,$ each nonzero $c_{i}$ is equal to $p_{r}$ for some $r,$ and

e_{i} = max {j : j \geq i and c_{j} \geq p_{r}} = f_{r} .

It follows that (whether $w$ is vexillary or not)

\begin{matrix} (1.35) & ϕ (w) = (f_{1}^{m_{1}}, f_{2}^{m_{2}}, \dots, f_{k}^{m_{k}}) . \end{matrix}

Moreover we must have

\begin{matrix} (1.36) & f_{r} \geq m_{1} + \dots + m_{r} (1 \leq r \leq k) \end{matrix}

since in the sequence $(c_{1}, c_{2}, \dots)$ there are $m_{1} + \dots + m_{r}$ terms $\geq p_{r},$ and they must all occur in the first $f_{r}$ places of the sequence.

(1.37) Suppose $w$ is a vexillary pennutation with shape $λ$ and flag $ϕ$ given by (1.34) and (1.35). Then the $f_{r}$ must satisfy the inequalities

0 \leq f_{r} - f_{r - 1} \leq m_{r} + p_{r - 1} - p_{r} .

Proof.

If $f_{r - 1} = f_{r}$ there is nothing to prove, so assume that $f_{r - 1} < f_{r}$ and therefore $c_{f_{r}} = p_{r} .$ Let

s = max {i : c_{i} = p_{r - 1}} \leq f_{r - 1} .

Since $c_{s} = p_{r - 1} > p_{r} = c_{f_{r}}$ and $w$ is vexillary, we have by (V1)

\begin{matrix} (1) & Card {k : s < k \leq f_{r} and c_{k} < p_{r}} \leq p_{r - 1} - p_{r} . \end{matrix}

Also

\begin{matrix} (2) & Card {k : s < k \leq f_{r} and c_{k} = p_{r}} \leq m_{r}, \end{matrix}

since exactly $m_{r}$ terms of the sequence $c$ are equal to $p_{r} .$

Finally we have

\begin{matrix} (3) & Card {k : s < k \leq f_{r} and c_{k} > p_{r}} = f_{r - 1} - s \end{matrix}

because $c_{k} \leq p_{r}$ for all $k > f_{r - 1},$ and $c_{k} \geq p_{r - 1}$ for all $k$ such that $s < k \leq f_{r - 1},$ by virtue of (V2).

From (1), (2), and (3) we deduce that

f_{r} - s \leq p_{r - 1} - p_{r} + m_{r} + f_{r - 1} - s

which proves (1.37).

$□$

(1.38) For each sequence $(f_{1}, \dots, f_{k})$ satisfying (1.36) and (1.37) there is a unique vexillary permutation $w$ with shape $λ$ and flag $ϕ = (f_{1}^{m_{1}}, \dots, f_{k}^{m_{k}}) .$ The code $c$ of $w$ is constructed as follows: first the $m_{1}$ entries equal to $p_{1}$ are inserted at the right-hand end of the interval $[1, f_{1}];$ then the $m_{2}$ entries in $c$ equal to $p_{2}$ are inserted in the rightmost available spaces in the interval $[1, f_{2}],$ and so on: for each $r \geq 1,$ when all the terms $> p_{r}$ in the sequence $c$ have been inserted, the $m_{r}$ entries equal to $p_{r}$ are inserted in the rightmost available spaces of the interval $[1, f_{r}] .$

Proof.

Suppose first that $w$ is vexillary. If $1 \leq i \leq f_{r}$ and $c_{i} = p_{r},$ then by (V2) we have $c_{j} \geq p_{r}$ for all $j$ such that $i < j < f_{r} .$ Hence the entries equal to $p_{r}$ in the sequence $c$ must be inserted as described above.

Conversely, if the sequence $c$ is constructed as above, we claim that $c$ satisfies (V1) and (V2), and hence $w$ is vexillary by (1.32). Suppose first that $i < j$ and $c_{i} \geq c_{j} :$ say $c_{i} = p_{r},$ $c_{j} = p_{s},$ $r \leq s .$ Then the number of $k$ such that $i < k < j$ and $c_{k} < p_{s}$ is equal to the number of blank spaces in the interval $[f_{r}, f_{s}]$ after all the entries $p_{i},$ $r + 1 \leq i \leq s$ have been inserted, hence is at most

f_{s} - f_{r} - (m_{r + 1} + \dots + m_{s})

which by (1.37) is $\leq p_{r} - p_{s} .$ Hence the sequence $c$ satisfies (V1). Suppose next that $i < j$ and $c_{i} < c_{j} :$ say $c_{i} = p_{s},$ $c_{j} = p_{r}$ with $r < s .$ Then we have $j \leq f_{r} \leq f_{s} .$ From the definition of the sequence $c,$ it follows that for each $k$ such that $i \leq k \leq f_{s}$ we have $c_{k} \geq p_{s},$ and hence $c_{k} \geq c_{i}$ whenever $i < k < j .$ Consequently the condition (V2) is satisfied, and the proof is complete.

$□$

If $w$ is a permutation and $r \geq 0,$ we denote by $1_{r} \times w$ the permutation

1_{r} \times w = (1, 2, \dots, r, r + w (1), r + w (2), \dots) .

Let us say that two permutations $w, w'$ are diagonally equivalent if either $w' = 1_{r} \times w$ or $w = 1_{r} \times w'$ for some $r \geq 0 .$ Graphically, this means that the diagram of $w'$ can be brought into coincidence with that of $w$ by a translation along the diagonal $i = j,$ and $w'$ is vexillary if and only if $w$ is vexillary. The equivalence classes of vexillary permutations of a given shape $λ$ are then determined by the differences $f_{r} - f_{r - 1}$ $(2 \leq r \leq k),$ and hence it follows from (1.37) and (1.38) that

(1.39) The number of diagonal equivalence classes of vexillary permutations of shape $λ = (p_{1}^{m_{1}}, \dots, p_{k}^{m_{k}})$ is

\prod_{r = 2}^{k} (p_{r - 1} - p_{r} + m_{r} + 1) .

We may remark that this number is the product of the hook lengths at the re-entrant nodes of the border of the diagram of $λ$ (i.e., the nodes with coordinates $(m_{1} + \dots + m_{r - 1}, p_{r}),$ $2 \leq r \leq k).$

Example. If $λ = (3 2^{2} 1)$ the flag $ϕ = (f_{1}, f_{2}^{2}, f_{3})$ must satisfy $0 \leq f_{2} - f_{1} \leq 3,$ $0 \leq f_{3} - f_{2} \leq 2 .$ Hence there are $(3 + 1) (2 + 1) = 12$ vexillary classes, and the representatives of these classes for which $w (1) \neq 1$ (or equivalently $c_{1} (w) \neq 0)$ are as follows:

\begin{matrix} ϕ (w) & c (w) & w \\ 4444 & 1223 & 2457136 \\ 3444 & 1232 & 246513 \\ 2444 & 1322 & 254613 \\ 1444 & 3122 & 425613 \\ 3334 & 2231 & 346215 \\ 2334 & 2321 & 35421 \\ 1334 & 3221 & 43521 \\ 1445 & 30221 & 415632 \\ 3335 & 22301 & 346152 \\ 2335 & 23201 & 354162 \\ 1335 & 32201 & 435162 \\ 1446 & 302201 & 4156273 \end{matrix}

\begin{matrix} 1223 & 2231 & 22301 \\ 1232 & 2321 & 23201 \\ 1322 & 3221 & 32201 \\ 3122 & 302201 & 302201 \end{matrix}

Let $λ = (p_{1}^{m_{1}}, \dots, p_{k}^{m_{k}})$ as before and let

λ' = (q_{1}^{n_{1}}, q_{2}^{n_{2}}, \dots, q_{k}^{n_{k}})

be the conjugate partition, where $q_{1} > q_{2} > \dots > q_{k} > 0$ and each $n_{i} \geq 1 .$ We have

\begin{matrix} (1.40) & {\begin{matrix} p_{r} = n_{1} + \dots + n_{s}, \\ q_{r} = m_{1} + \dots + m_{s}, \end{matrix} \end{matrix}

where $s = k + 1 - r$ $(1 \leq r \leq k) .$ The border of the diagram of $λ$ is a staircase with risers of heights $m_{1}, m_{2}, \dots, m_{k}$ (starting from the top) and treads of lengths $n_{1}, n_{2}, \dots, n_{k}$ (starting at the bottom).

Recall (1.27) that if $w$ is vexillary of shape $λ,$ then $w^{- 1}$ is vexillary of shape $λ' .$

(1.41) Let $w$ be a vexillary permutation of shape $λ$ and flag $ϕ (w) = (f_{1}^{m_{1}}, \dots, f_{k}^{m_{k}}) .$ Then the flag of $w^{- 1}$ is

ϕ (w^{- 1}) = (g_{1}^{n_{1}}, \dots, g_{k}^{n_{k}})

where

\begin{matrix} (*) & g_{i} + q_{i} = f_{k + 1 - i} + p_{k + 1 - i} (1 \leq i \leq k) . \end{matrix}

Proof.

We proceed by induction on $ℓ (w) = | λ | .$ Let $c = (c_{1}, c_{2}, \dots)$ be the code of $w,$ and let $w'$ be the permutation with code $c' = (c_{2}, c_{3}, \dots) .$ We may assume that $c_{1} \neq 0 .$ Then $c_{1} = p_{r}$ for some $r,$ and we have

\begin{matrix} λ (w') & = & (p_{1}^{m_{1}}, \dots, p_{r}^{m_{r} - 1}, \dots, p_{k}^{m_{k}}), \\ ϕ (w') & = & ({(f_{1} - 1)}^{m_{1}}, \dots, {(f_{r} - 1)}^{m_{r} - 1}, \dots, {(f_{k} - 1)}^{m_{k}}) . \end{matrix}

Since $w$ is vexillary, its code $c$ satisfies the conditions (V1) and (V2). Hence $c'$ also satisfies these conditions, and therefore $w'$ is vexillary. It follows that $λ ({w'}^{- 1}) = λ (w')',$ so that

λ ({w'}^{- 1}) = ({(q_{1} - 1)}^{n_{1}}, \dots, {(q_{s} - 1)}^{n_{s}}, q_{s + 1}^{n_{s} + 1}, \dots, q_{k}^{n_{k}})

where $s = k + 1 - r .$ We have $ℓ (w') = ℓ (w) - c_{1},$ so that the inductive hypothesis applies to $w' .$ Hence if $g_{1}, \dots, g_{k}$ are defined by the formula $(*),$ we have

\begin{matrix} (1) & ϕ ({w'}^{- 1}) = (g_{1}^{n_{1}}, \dots, g_{s}^{n_{s}}, {(g_{s + 1} - 1)}^{n_{s + 1}}, \dots, {(g_{k} - 1)}^{n_{k}}) . \end{matrix}

But if ${w'}^{- 1}$ has code $c ({w'}^{- 1}) = (d_{1}, d_{2}, \dots)$ then by (1.25) we have

\begin{matrix} (2) & c (w^{- 1}) = (d_{1} + 1, \dots, d_{p_{r}} + 1, 0, d_{p_{r} + 1}, d_{p_{r} + 2}, \dots) . \end{matrix}

From (1) and (2) and (1.40) it follows that

ϕ (w^{- 1}) = (g_{1}^{n_{1}}, \dots, g_{s}^{n_{s}}, g_{s + 1}^{n_{s + 1}}, \dots, g_{k}^{n_{k}})

as required.

$□$

If $w \in S_{n},$ let ${\overline{w}}_{n} = w_{0} w w_{0},$ where $w_{0}$ is the longest element in $S_{n} .$ If $w$ is vexillary, of shape $λ,$ then ${\overline{w}}_{n}$ is vexillary of shape $λ',$ by (1.27) and (1.28). Let

ϕ ({\overline{w}}_{n}) = ({\overline{f_{1}}}^{n_{1}}, \dots, {\overline{f_{k}}}^{n_{k}})

be the flag of ${\overline{w}}_{n} .$ Then we have

\begin{matrix} (1.42) & \overline{f_{i}} = n - f_{k + 1 - i} (1 \leq i \leq k) . \end{matrix}

For once we shall leave the proof to the reader.

Let $N_{n}$ denote the number of non-vexillary $w \in S_{n},$ and let

P_{n} = N_{n} / n!

be the probability that an element of $S_{n}$ is non-vexillary. The first few values of $N_{n}$ and $P_{n}$ are

\begin{matrix} n & N_{n} & P_{n} \\ 1 & 0 & 0 \\ 2 & 0 & 0 \\ 3 & 0 & 0 \\ 4 & 1 & .042 \\ 5 & 17 & .142 \\ 6 & 207 & .288 \\ 7 & 2279^{*} & .452 \end{matrix}

$^{*}$ was computed by A. Garsia. I would guess that $N_{8}$ is of the order of $24000 .$

If we divide up the sequence $(w (1), \dots, w (n))$ into consecutive blocks of length $4,$ and observe that the probability that such a block satisfies the vexillarity condition (1.27)(iii) is $23 / 24$ (because $S_{4}$ contains only one non-vexillary permutation), we see that the probability that $w \in S_{n}$ is vexillary is at most ${(23 / 24)}^{[n / 4]},$ hence decreases exponentially to zero. (A. Lascoux.) Thus the vexillary permutations in $S_{n}$ become sparser and sparser as $n$ increases.

Instead of counting non-vexillary permutations, we may attempt to count vexillary permutations. Let us say that a permutation $w \in S_{n}$ is primitive if $w (1) \neq 1$ and $w (n) \neq n .$ For each $n \geq 1,$ let $V_{n}$ (resp. $U_{n})$ denote the number of vexillary (resp. primitive vexillary) permutations $w \in S_{n} .$ Since each primitive vexillary $w \in S_{n}$ gives rise to $r + 1$ imprimitive vexillary permutations in $S_{n + r},$ namely $1_{p} \times w \times 1_{q}$ where $p, q \geq 0$ and $p + q = r,$ it follows that

V_{n} = 1 + U_{n} + 2 U_{n - 1} + 3 U_{n - 2} + \dots

Hence the generating functions

\begin{matrix} V (t) & = & \sum_{n \geq 1} V_{n} t^{n} \\ U (t) & = & \sum_{n \geq 1} U_{n} t^{n} \end{matrix}

are related by

\begin{matrix} (1.43) & V (t) = \frac{t}{1 - t} + \frac{U (t)}{{(1 - t)}^{2}} . \end{matrix}

For each partition $λ \neq 0,$ let $U_{n, λ}$ denote the number of primitive vexillary permutations of shape $λ$ in $S_{n},$ and let

U_{λ} (t) = \sum_{n \geq 1} U_{n, λ} t^{n},

so that

\begin{matrix} (1.44) & U (t) = \sum_{λ \neq 0} U_{λ} (t) . \end{matrix}

Each $U_{λ} (t)$ is a polynomial, and we shall now show how to compute it. Write $λ$ in the form

λ = (p_{1}^{m_{1}}, p_{2}^{m_{2}}, \dots, p_{k}^{m_{k}})

as before, where $p_{1} > p_{2} > \dots > p_{k} > 0 .$ By (1.37) a vexillary permutation $w$ of shape $λ$ is uniquely determined by its flag $ϕ (w) = (f_{1}^{m_{1}}, \dots, f_{k}^{m_{k}}),$ where $(f_{1}, \dots, f_{k})$ is any vector of positive integers satisfying the inequalities (1.36), (1.37):

\begin{matrix} f_{r} \geq m_{1} + \dots + m_{r} (1 \leq r \leq k), \\ 0 < f_{r} - f_{r - 1} \leq m_{r} + p_{r - 1} - p_{r} (2 \leq r \leq k) . \end{matrix}

Moreover we shall have $w (1) \neq 1$ if and only if the first element of the code of $w$ is not zero, and this will be the case if and only if

\begin{matrix} (1) & f_{r} = m_{1} + \dots + m_{r} for some r + 1, \dots, k . \end{matrix}

In general, if $c = (c_{1}, c_{2}, \dots)$ is the code of a permutation $w,$ then $w \in S_{n}$ if and only if $n \geq c_{i} + i$ for $1 \leq i \leq r,$ where $r$ is the length of $c .$ In other words, the least $n$ for which $w \in S_{n}$ is $n = max {c_{i} + i : 1 \leq i \leq r} .$ In the case of a vexillary permutation $w$ as above, with flag $(f_{1}^{m_{1}}, \dots, f_{k}^{m_{k}}),$ the numbers $c_{i} + i$ will increase strictly as $i$ runs through each non-empty interval $[f_{r - 1} + 1, f_{r}]$ $(r = 1, \dots, k),$ and hence $w$ will be primitive in $S_{n}$ if and only if $w$ satisfies (1) above and

\begin{matrix} (2) & n = max {p_{r} + f_{r} : 1 \leq r \leq k} . \end{matrix}

Let $π_{r} = m_{1} + \dots + m_{r}$ for $1 \leq r \leq k$ and put

u_{r} = f_{r} - π_{r}

so that $u_{r} \geq 0$ for each $r .$ From (1.36) we have

\begin{matrix} (3) & π_{1} + u_{1} \leq π_{2} + u_{2} \leq \dots \leq π_{k} + u_{k} \end{matrix}

and

\begin{matrix} m_{r} + p_{r - 1} - p_{r} & \geq & f_{r} - f_{r - 1} \\ = & (u_{r} + π_{r}) - (u_{r - 1} + π_{r - 1}) \\ = & m_{r} + u_{r} - r_{r - 1} \end{matrix}

so that

\begin{matrix} (4) & p_{1} + u_{1} \geq p_{2} + u_{2} \geq \dots \geq p_{k} + u_{k} . \end{matrix}

It now follows that

\begin{matrix} (1.45) & U_{λ} (t) = \sum_{u} t^{max {p_{r} + π_{r} + u_{r} : 1 \leq r \leq k}} \end{matrix}

summed over the integer vectors $u = (u_{1}, \dots, u_{k}) \in ℕ^{k}$ having at least one zero component, and satisfying the inequalities (3), (4) above. We have

U_{λ} (1) = \prod_{r = 2}^{k} (m_{r} + p_{r - 1} - p_{r} + 1)

and

U_{λ} (t) = U_{λ'} (t)

(since $w \in S_{n}$ is primitive vexillary of shape $λ$ if and only if $w^{- 1}$ is primitive vexillary of shape $λ').$

Added in proof

Julian West, a student of R. Stanley, has recently shown that

\begin{matrix} (1) & V_{n} = \sum_{\underset{ℓ (λ) \leq 3}{| λ | = n}} {(f^{λ})}^{2} \end{matrix}

where $f^{λ}$ is the degree of the irreducible representation of the symmetric group $S_{n}$ indexed by the partition $λ .$ From this and results of A. Regev (Advances in Math. 41 (1981) 115-136) it follows that

\begin{matrix} (2) & V_{n} \sim c 9^{n} n^{- 4} \end{matrix}

as $n \to \infty,$ where $c$ is a constant that Regev determines explicitly.

The formula (1) gives that $N_{8} = 24553 .$

Notes and References

This is a typed excerpt of the book Notes on Schubert Polynomials by I. G. Macdonald.

page history

Notes on Schubert PolynomialsChapter 1

Permutations

The Bruhat order

The diagram of a permutation

The code of a permutation

Vexillary permutations

Dominant permutations

Grassmannian permutations

Enumeration of vexillary permutations

Added in proof

Notes and References

Notes on Schubert Polynomials
Chapter 1