
SECTION 8.7 MULTIVECTOR DIFFERENTIATION 235
We first evaluate from the right, so we start with the directional differentiation
of G(y) = y
2
. For a general vector z, the directional derivative (z ∗ ∂
y
) y
2
= 2 z · y,
so with z = y(x) the result is 2 y(x) ·y = 2(x · a)(b ·y). Note that we kept y.Inthe
second step, this expression needs to be differentiated to x, giving
∂
x
2(x · a)(b ·
y)
= 2 a (b · y). That is the answer, but we prefer it in terms of x, so we should
substitute the expression for y in terms of x, giving the same result as before.
If instead we had evaluated from the left, we would first need to evaluate
`
∂
x
y(
`
x) ∗
∂
y
=
∂
x
(x ·a) b
∗
∂
y
= a (b ∗∂
y
). Do not be bothered by the presence
of
∂
y
in this derivation; since it is not differentiating anything, it behaves just like
a vector. Now we apply the resulting operator to G(y) = y
2
, giving 2 a (b · y) as in
the other evaluation order. Here, too, you would need to substitute the expression
y(x) to get the result in terms of x.
The operator we just evaluated can be rewritten using the definition of the
adjoint of the function y(x) = (x · a) b,whichis
y(x) = (x · b) a.Wethenrec-
ognize a (b ∗
∂
y
) as the adjoint of the y-function applied on ∂
y
, i.e.,
y[∂
y
].Wecan
also use the adjoint to write the actual answer for our differentiation of the squar-
ing function G as 2
y(y(x )), which actually holds for any function y usedtowrap
the argument x.
The implicit understanding of how to deal with the substitutions in the equation is a bit
cumbersome. A more proper notation for the process may be to keep the x in there at all
steps:
∂
x
G(y(x)) =
`
∂
x
y(
`
x) ∗
∂
y(x)
G(y(x)) =
y[∂
y(x)
] G(y(x)). (8.15)
The final rewriting uses the differential definition of the adjoint of (8.14) (which also
holds for nonlinear vector functions y). This usage was motivated in the example. It
means that we treat the differentiation operator
∂
y(x)
just as the vector it essentially is.
Then the differentiation with respect to y(x) should be understood as above, but the
lack of an accent denotes that that particular x-dependence should not be differentiated
by
`
∂
x
.
So in the end, the chain rule is essentially a transformation of the differentiation opera-
tor: when an argument gets wrapped into a function, the differentiation with respect to that
argument gets wrapped into the adjoint of that function.
8.7 MULTIVECTOR DIFFERENTIATION
We can extend these forms of differentiation beyond vectors to general multivectors,
though for geometric algebra, the extension to differentiation with respect to blades and
versors is most useful. Another extension is the differentiation with respect to a linear
function of multivectors, which finds uses in optimization. We will not treat that here,
but refer to Chapter 11 in [15].