Chapter 2: Divided differences - Notes on Schubert Polynomials

Notes on Schubert Polynomials
Chapter 2

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 27 June 2013

Divided differences

If $f$ is a function of $x$ and $y$ (and possibly other variables), let

\partial_{x y} f = \frac{f (x, y) - f (y, x)}{x - y}

("divided difference"). Equivalently

\partial_{x y} f = {(x - y)}^{- 1} (1 - s_{x y})

where $s_{x y}$ interchanges $x$ and $y .$ The operator $\partial_{x y}$ takes polynomials to polynomials, and has degree $- 1$ (i.e., if $f$ is homogeneous of degree $d,$ then $\partial_{x y} f$ is homogeneous of degree $d - 1).$ Explicitly, if $f = x^{r} y^{s}$ we have

\begin{matrix} (2.1) & \begin{matrix} \partial_{x y} (x^{r} y^{s}) & = & \frac{x^{r} y^{s} - x^{s} y^{r}}{x - y} \\ = & σ (r - s) \sum x^{p} y^{q} \end{matrix} \end{matrix}

where the sum is over $(p, q)$ such that $p + q = r + s - 1$ and $max (p, q) < max (r, s),$ and $σ (r - s)$ is $+ 1, 0$ or $- 1$ according as $r - s$ is positive, zero or negative.

On a product $f g, \partial_{x y}$ acts according to the rule

\begin{matrix} (2.2) & \partial_{x y} (f g) = (\partial_{x y} f) g + (s_{x y} f) (\partial_{x y} g) . \end{matrix}

In particular we have

\begin{matrix} (2.2') & \partial_{x y} (f g) = f \partial_{x y} g \end{matrix}

if $f (x, y) = f (y, x) .$

(2.3)

(i)	$\partial_{x y} s_{x y} = - \partial_{x y}, s_{x y} \partial_{x y} = \partial_{x y},$
(ii)	$\partial_{x y}^{2} = 0,$
(iii)	$\partial_{x y} \partial_{y z} \partial_{x y} = \partial_{y z} \partial_{x y} \partial_{y z} .$

Proof.

(i) and (ii) are immediate from the definitions, and (iii) is verified by direct calculation: each side is equal to

{(x - y)}^{- 1} {(x - z)}^{- 1} {(y - z)}^{- 1} \sum_{w \in S_{3}} ε (w) w,

where the symmetric group $S_{3}$ permutes $x, y$ and $z,$ and $ε (w)$ is the sign of the permutation $w .$

$□$

Let $x_{1}, x_{2}, \dots, x_{n}, \dots$ be independent variables, and let

P_{n} = ℤ [x_{1}, x_{2}, \dots, x_{n}]

for each $n \geq 1,$ and

\begin{matrix} P_{\infty} & = & ℤ [x_{1}, x_{2}, \dots] \\ = & ⋃_{n = 1}^{\infty} P_{n} . \end{matrix}

For each $i \geq 1$ let

\partial_{i} = \partial_{x_{i}, x_{i + 1}} .

Each $\partial_{i}$ is a linear operator on $P_{\infty}$ (and on $P_{n}$ for $n > i)$ of degree $- 1 .$ From (2.3) we have (compare with (1.1))

\begin{matrix} (2.4) & {\begin{matrix} \partial_{i}^{2} = 0, \\ \partial_{i} \partial_{j} = \partial_{j} \partial_{i} & if | i - j | > 1, \\ \partial_{i} \partial_{i + 1} \partial_{i} = \partial_{i + 1} \partial_{i} \partial_{i + 1} \end{matrix} \end{matrix}

For any sequence $a = (a_{1}, \dots, a_{p})$ of positive integers, we define

\partial_{a} = \partial_{a_{1}} \dots \partial_{a_{p}} .

Recall that if $w$ is any permutation, $R (w)$ denotes the set of reduced words for $w,$ i.e. sequences $(a_{1}, \dots, a_{p})$ such that $w = s_{a_{1}} \dots s_{a_{p}}$ and $p = ℓ (w) .$

(2.5) If $a, b \in R (w)$ then $\partial_{a} = \partial_{b} .$

Proof.

We proceed by induction on $p = ℓ (w) .$ Let us write $a \equiv b$ to mean that $\partial_{a} = \partial_{b} .$ The inductive hypothesis then implies that

\begin{matrix} (*) & a \equiv b if either a_{1} = b_{1} or a_{p} = b_{p} . \end{matrix}

By the exchange lemma (1.8) we have

c_{i} = (b_{1}, a_{1}, \dots, {\hat{a}}_{i}, \dots, a_{p}) \in R (w)

for some $i = 1, \dots, p .$ If $i \neq p$ then $b \equiv c_{i} \equiv a$ by virtue of $(*),$ so that $a \equiv b .$ If $i = p$ and $| b_{1} - a_{1} | > 1$ then by (2.4) and (1.1)

c_{p}^{'} = (a_{1}, b_{1}, a_{2}, \dots, a_{p - 1}) \in R (w)

and $a \equiv c_{p}^{'} \equiv c_{p} \equiv b,$ so that again $a \equiv b .$

Finally, if $i = p$ and $| b_{1} - a_{1} | = 1,$ we apply the exchange lemma again, this time to $c_{p}$ and $a;$ this shows that

d_{i} = (a_{1}, b_{1}, a_{1}, \dots, {\hat{a}}_{i}, \dots, a_{p - 1}) \in R (w)

for some $i = 2, \dots, p - 1 .$ But then by (2.4) and (1.1) we have

d_{i}^{'} = (b_{1}, a_{1}, b_{1}, a_{2}, \dots, {\hat{a}}_{i}, \dots, a_{p - 1}) \in R (w)

and $a \equiv d_{i} \equiv d_{i}^{'} \equiv b .$ Hence $a \equiv b$ in all cases.

$□$

Remark. For any permutation $w,$ let $G R (w)$ denote the graph whose vertices are the reduced words for $w,$ and in which a reduced word $a$ is joined by an edge to each of the words obtained from $a$ by either interchanging two consecutive terms $i, j$ such that $| i - j | > 1,$ or by replacing three consecutive terms $i, j, i$ such that $| i - j | = 1$ by $j, i, j .$ Then the proof of (2.5) shows that

(2.5') The graph $G R (w)$ is connected.

From (2.5) it follows that we may define

\partial_{w} = \partial_{a}

unambiguously, where $a$ is any reduced word for w. By (2.2'), the operators $\partial_{w}$ for $w \in S_{n}$ are $Λ_{n}$ linear, where

Λ_{n} = ℤ {[x_{1}, \dots, x_{n}]}^{S_{n}} \subset P_{n}

is the ring of symmetric polynomials in $x_{1}, \dots, x_{n} .$

A sequence $a = (a_{1}, \dots, a_{p})$ will be said to be reduced if $a \in R (w)$ for some permutation $w .$

(2.6) If $a = (a_{1}, \dots, a_{p})$ is not reduced, then $\partial_{a} = 0 .$

Proof.

By induction on $p .$ If $a' = (a_{1}, \dots, a_{p - 1})$ is not reduced, then $\partial_{a'} = 0$ and hence $\partial_{a} = \partial_{a'} \partial_{a_{p}} = 0 .$ So we may assume that $a'$ is reduced. Let $v = s_{a_{1}} \dots s_{a_{p - 1}},$ $w = s_{a_{1}} \dots s_{a_{p}} .$ We have $ℓ (v) = p - 1$ and $ℓ (w) \leq p - 1,$ hence by (1.3) $ℓ (w) = p - 2,$ so that $ℓ (v) = ℓ (w s_{a_{p}}) = ℓ (w) + 1 .$ Consequently $\partial_{v} = \partial_{w} \partial_{a_{p}}$ and therefore $\partial_{a} = \partial_{v} \partial_{a_{p}} = \partial_{w} \partial_{a_{p}}^{2} = 0 .$

$□$

(2.7) Let $u, v$ be permutations. Then

\partial_{u} \partial_{v} = {\begin{matrix} \partial_{u v} & if ℓ (u v) = ℓ (u) + ℓ (v), \\ 0 & otherwise. \end{matrix}

Proof.

(2.5), (2.6).

$□$

(2.8) Let $w$ be a permutation, $i \geq 1 .$ Then

s_{i} \partial_{w} = \partial_{w} ⟺ ℓ (s_{i} w) = ℓ (w) - 1 .

Proof.

We have $s_{i} \partial_{w} = \partial_{w} ⟺ \partial_{i} \partial_{w} = 0,$ hence the result follows from (2.7).

$□$

As before let $w_{0} = (n, n - 1, \dots, 2, 1)$ be the longest element of $S_{n} .$ One element of $R (w_{0})$ is the sequence

\begin{matrix} (2.9) & (1, 2, \dots, n - 1, 1, 2, \dots, n - 2, \dots, 1, 2, 3, 1, 2, 1) . \end{matrix}

(2.10) We have

\partial_{w_{0}} = a_{δ}^{- 1} \sum_{w \in S_{n}} ε (w) w

where $a_{\partial} = \prod_{1 \leq i < j \leq n} (x_{i} - x_{j}),$ and $ε (w) = \pm 1$ is the sign of $w .$

Proof.

From the definition it follows that $\partial_{w_{0}}$ is of the form

\begin{matrix} (1) & \partial_{w_{0}} = \sum_{w \in S_{n}} c_{w} w \end{matrix}

with coefficients $c_{w}$ rational functions of $x_{1}, \dots, x_{n} .$ By (2.8) we have $s_{i} \partial_{w_{0}} = \partial_{w_{0}}$ for $1 \leq i \leq n - 1,$ so that $v \partial_{w_{0}} = \partial_{w_{0}}$ for all $v \in S_{n},$ and therefore

\begin{matrix} (2) & \partial_{w_{0}} = \sum_{w \in S_{n}} v (c_{w}) v w . \end{matrix}

Comparison of (1) and (2) shows that

\begin{matrix} (3) & c_{v w} = v (c_{w}) (v, w \in S_{n}) . \end{matrix}

Hence all the coefficients $c_{w}$ are determined by one of them, say $c_{w_{0}} .$ From the sequence (2.9) for $w_{0}$ it is easily checked that the coefficient of $w_{0}$ in $\partial_{0}$ is

c_{w_{0}} = ε (w_{0}) a_{δ}^{- 1} .

Hence from (3) we have

c_{w} = w w_{0} (c_{w_{0}}) = ε (w) a_{δ}^{- 1}

which proves (2.10).

$□$

From (2.10) it follows that, for any $α = (α_{1}, \dots, α_{n}) \in ℕ^{n},$

\begin{matrix} (2.11) & \partial_{w_{0}} x^{α} = s_{α - δ} (x_{1}, \dots, x_{n}) \end{matrix}

where $x^{α}$ means $x_{1}^{α_{1}} \dots x_{n}^{α_{n}},$ $δ = (n - 1, n - 2, \dots, 1, 0)$ and $s_{α - δ}$ is the Schur function indexed by $α - δ .$ Thus $\partial_{w_{0}}$ is a $Λ_{n} -linear$ mapping of $P_{n}$ onto $Λ_{n} .$

For $w \in S_{n},$ let $\overline{w} = w_{0} w w_{0} .$ Then

\begin{matrix} (2.12) & \partial_{\overline{w}} = ε (w) w_{0} \partial_{w} w_{0} . \end{matrix}

Proof.

From the definition of $\partial_{i}$ we have

w_{0} \partial_{i} w_{0} = - \partial_{n - i}

from which (2.12) follows easily, since $w_{0}^{2} = 1 .$

$□$

If $f$ and $g$ are polynomials in $x_{1}, x_{2}, \dots,$ the expression of $\partial_{w} (f g)$ as a sum of polynomials $\partial_{u} f \cdot \partial_{v} g$ (i.e. the "Leibnitz formula" for $\partial_{w})$ is in general rather complicated. However, there is one case in which it is reasonably simple, namely when one of the factors $f, g$ is linear:

(2.13) If $f = \sum α_{i} x_{i}$ then

\partial_{w} (f g) = w (f) \partial_{w} g + \sum (α_{i} - α_{j}) \partial_{w t_{i j}} g

summed over all pairs $i < j$ such that $ℓ (w t_{i j}) = ℓ (w) - 1,$ where $t_{i j}$ is the transposition that interchanges $i$ and $j .$

Proof.

Let $(a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Since $f$ is linear it follows from (2.2) that

\begin{matrix} \partial_{w} (f g) & = & \partial_{a_{1}} \dots \partial_{a_{p}} (f g) \\ = & s_{a_{1}} \dots s_{a_{p}} (f) \partial_{a_{1}} \dots \partial_{a_{p}} g + \sum_{r = 1}^{p} s_{a_{1}} \dots \partial_{a_{r}} \dots s_{a_{p}} (f) \partial_{a_{1}} \dots {\hat{\partial}}_{a_{r}} \dots \partial_{a_{p}} g . \end{matrix}

Now $\partial_{a_{1}} \dots {\hat{\partial}}_{a_{r}} \dots \partial_{a_{p}} = 0$ unless $(a_{1}, \dots, {\hat{a}}_{r}, \dots, a_{p})$ is reduced, and then by (1.11) it is equal to $\partial_{w t},$ where $w t = s_{a_{p}} \dots {\hat{s}}_{a_{r}} \dots s_{a_{p}}$ has length $p - 1 = ℓ (w) - 1,$ and $t = s_{a_{p}} \dots s_{a_{r}} \dots s_{a_{p}} = t_{i j}$ where $(i, j) = s_{a_{p}} \dots s_{a_{r + 1}} (a_{r}, a_{r + 1}),$ so that

s_{a_{1}} \dots s_{a_{r - 1}} \partial_{a_{r}} s_{a_{r + 1}} \dots s_{a_{p}} (f) = α_{i} - α_{j} .

$□$

We also introduce the operators $π_{i} (i \geq 1)$ defined by

π_{i} f = \partial_{i} (x_{i} f) .

In place of (2.4) we have

\begin{matrix} (2.14) & {\begin{matrix} π_{i}^{2} = π_{i}, \\ π_{i} π_{j} = π_{j} π_{i} & if | i - j | > 1, \\ π_{i} π_{i + 1} π_{i} = π_{i + 1} π_{i} π_{i + 1} . \end{matrix} \end{matrix}

If we define $π_{a}$ to be $π_{a_{1}} \dots π_{a_{p}}$ for any sequence $a = (a_{1}, \dots, a_{p})$ of positive integers, then corresponding to (2.5) we have

(2.15) If $a, b \in R (w)$ then $π_{a} = π_{b} .$

The proof is the same as that of (2.5), and rests only on the second and third of the relations (2.14). From (2.15) it follows that we may define

π_{w} = π_{a}

unambiguously, where $a$ is any reduced word for $w .$

In place of (2.10) we have

(2.16) For any $f \in P_{n},$

π_{w_{0}} f = a_{δ}^{- 1} \sum_{w \in S_{n}} ε (w) w (x^{δ} f) = \partial_{w_{0}} (x^{δ} f) .

In particular, if $α \in ℕ^{n},$

\begin{matrix} (2.16') & π_{w_{0}} x^{α} = s_{α} (x_{1}, \dots, x_{n}) . \end{matrix}

Proof.

We have

\begin{matrix} π_{1} f = \partial_{1} (x_{1} f), \\ π_{1} π_{2} f = \partial_{1} (x_{1} \partial_{2} (x_{2} f)) = \partial_{1} \partial_{2} (x_{1} x_{2} f) \end{matrix}

and generally

π_{1} \dots π_{r} f = \partial_{1} \dots \partial_{r} (x_{1} \dots x_{r} f)

for each $r \geq 1 .$ From this and (2.10) it follows easily that $π_{w_{0}} f = \partial_{w_{0}} (x^{δ} f) .$

$□$

Let $(a_{1}, \dots, a_{p})$ be a reduced word for $w .$ Then

\begin{matrix} \partial_{w} & = & \partial_{a_{1}} \dots \partial_{a_{p}} \\ = & {(x_{a_{1}} - x_{a_{1} + 1})}^{- 1} (1 - s_{a_{1}}) {(x_{a_{2}} - x_{a_{2} + 1})}^{- 1} (1 - s_{a_{2}}) \dots \end{matrix}

which shows on expansion that $\partial_{w}$ is of the form

\partial_{w} = \sum_{v \leq w} f_{v w} v

where $f_{v w}$ are rational functions of $x_{1}, x_{2}, \dots,$ and in particular (by (1.7))

f_{w w} = {(- 1)}^{p} \prod_{(i, j) \in I (w^{- 1})} {(x_{i} - x_{j})}^{- 1}

and thus is $\neq 0 .$ It follows that the $\partial_{w}$ are linearly independent over the field of rational functions $ℚ_{\infty} = ℚ (x_{1}, x_{2}, \dots) .$

Now from (2.2) we have

\partial_{a} (f g) = (\partial_{a} f) g + (s_{a} f) (\partial_{a} g)

or equivalently, if $μ : P_{\infty} \otimes P_{\infty} \to P_{\infty}$ is the multiplication map,

\partial_{a} \circ μ = μ \circ (\partial_{a} \otimes 1 + s_{a} \otimes \partial_{a}) .

From this it follows that

\partial_{w} \circ μ = μ \circ (\partial_{a_{1}} \otimes 1 + s_{a_{1}} \otimes \partial_{a_{1}}) \circ \dots \circ (\partial_{a_{p}} \otimes 1 + s_{a_{p}} \otimes \partial_{a_{p}})

On expansion this is a sum over subsequences $b$ of $a = (a_{1}, \dots, a_{p}),$ say

\begin{matrix} (1) & \partial_{w} \circ μ = μ \circ \sum_{b \subset a} ϕ (a, b) \otimes \partial_{b} \end{matrix}

where

ϕ (a, b) = ϕ_{1} (a, b) \circ \dots \circ ϕ_{p} (a, b)

and

ϕ_{i} (a, b) = {\begin{matrix} s_{a_{1}} & if a_{i} \in b, \\ \partial_{a_{1}} & if a_{i} \notin b . \end{matrix}

Since $\partial_{b} = 0$ if $b$ is not reduced (2.6), the sum is over reduced subsequences $b$ of $a,$ and by (1.17) we can write

\begin{matrix} (2) & \partial_{w} \circ μ = μ \circ \sum_{v \leq w} v \partial_{w / v} \otimes \partial_{v} \end{matrix}

where for $v \leq w$

\begin{matrix} (3) & \partial_{w / v} = v^{- 1} \sum ϕ (a, b) \end{matrix}

summed over subsequences $b \subset a$ such that $b$ is a reduced word for $v .$

So for each pair of permutations $w, v$ such that $w \geq v$ we have a well-defined operator $\partial_{w / v}$ on $P_{\infty},$ defined by (3). Since the $\partial_{v}$ are linearly independent, the definition (3) is independent of the reduced word $a \in R (w) .$

(2.17) For each pair $w, v \in S_{\infty}$ such that $w \geq v$ there is a linear operator $\partial_{w / v}$ on $P_{\infty}$ such that

\partial_{w} (f g) = \sum_{v \leq w} v (\partial_{w / v} f) \cdot \partial_{v} g .

$\partial_{w / v}$ has degree $- ℓ (w) + ℓ (v) .$

Examples

Let $v = w,$ then

\partial_{w / w} = w^{- 1} ϕ (a, a) = w^{- 1} s_{a_{1}} \dots s_{a_{p}} = 1 .

Let $v = 1,$ then

\partial_{w / 1} = ϕ (a, \emptyset) = \partial_{a_{1}} \dots \partial_{a_{p}} = \partial_{w} .

Suppose that $v \to w,$ so that $v = s_{a_{1}} \dots ŝ_{a_{r}} \dots s_{a_{p}}$ for an unique $r \in [1, p] .$ Then $b = (a_{1}, \dots, {\hat{a}}_{r}, \dots, a_{p})$ and

\begin{matrix} \partial_{w / v} & = & v^{- 1} ϕ (a, b) \\ = & v^{- 1} s_{a_{1}} \dots s_{a_{r - 1}} \partial_{a_{r}} s_{a_{r + 1}} \dots s_{a_{p}} \\ = & s_{a_{p}} \dots s_{a_{r + 1}} \partial_{a_{r}} s_{a_{r + 1}} \dots s_{a_{p}} \end{matrix}

Now $w = v t$ where $t$ is the transposition

t = t_{i j} = s_{a_{p}} \dots s_{a_{r}} \dots s_{a_{p}} (i < j)

so that $(i, j) = s_{a_{p}} \dots s_{a_{r + 1}} (a_{r}, a_{r} + 1)$ and therefore

\begin{matrix} \partial_{w / v} & = & s_{a_{p}} \dots s_{a_{r + 1}} {(x_{a_{r}} - x_{a_{r} + 1})}^{- 1} (1 - s_{a_{r}}) s_{a_{r + 1}} \dots s_{a_{p}} \\ = & {(x_{i} - x_{j})}^{- 1} (1 - t_{i j}) \end{matrix}

is the divided difference operator $\partial_{x_{i}, x_{j}} .$

The product formula for $\partial_{w / u}$ is

\begin{matrix} (2.18) & \partial_{w / u} (f g) = \sum_{u \leq v \leq w} u^{- 1} v (\partial_{w / v} f) \partial_{w / u} g . \end{matrix}

Proof.

We have

\begin{matrix} \partial_{w} (f g h) = \sum_{u \leq w} u \partial_{w / u} (f g) \partial_{u} h & (1) \end{matrix}

and on the other hand

\begin{matrix} \partial_{w} (f g h) & = & \sum_{u \leq w} v \partial_{w / u} (f) \partial_{v} (g h) \\ = & \sum_{u \leq v \leq w} v \partial_{w / v} (f) \cdot u \partial_{v / u} (g) \cdot \partial_{u} h . \end{matrix}

Comparison of (1) and (2) gives

u \partial_{w / u} (f g) = \sum_{u \leq v \leq w} v \partial_{w / u} (f) \cdot u \partial_{v / u} (g)

which gives the result.

$□$

When $u = 1,$ this reduces to (2.17).

Notes and References

This is a typed excerpt of the book Notes on Schubert Polynomials by I. G. Macdonald.

page history

Notes on Schubert PolynomialsChapter 2

Divided differences

Examples

Notes and References

Notes on Schubert Polynomials
Chapter 2