0% found this document useful (0 votes)
36 views4 pages

Lecture 8

Den guo shen

Uploaded by

Ciprian Coman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views4 pages

Lecture 8

Den guo shen

Uploaded by

Ciprian Coman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

General Relativity Fall 2019

Lecture 8: covariant derivatives


Yacine Ali-Haı̈moud
September 26th 2019

METRIC IN NON-COORDINATE BASES

Last lecture we defined the metric tensor field g as a “special” tensor field, used to convey notions of infinitesimal
spacetime “lengths”. In a coordinate basis, we write ds2 = gµν dxµ dxν to mean g = gµν dx(µ) ⊗ dx(ν) . While we will
mostly use coordinate bases, we don’t always have to. In a non-coordinate basis, we would write explicitly
g = gµν e∗(µ) ⊗ e∗(ν) .
Let us consider for example flat 3-D space, in which the line element is
d`2 = dx2 + dy 2 + dz 2 = dr2 + r2 dθ2 + r2 sin2 θdϕ2
in cartesian and spherical polar coordinates, respectively. In homework 3, we assumed there existed some scalar
product h..i on the space of vectors; well, this scalar product is just the metric: hV , W i ≡ g(V , W ), and
||V ||2 ≡ g(V , V ). We can read off the norms of the coordinate basis vectors from the line element:
||∂(r) ||2 = 1, ||∂(θ) ||2 = r2 , ||∂(ϕ) ||2 = r2 sin2 θ.
Thus the unit-norm vectors along the coordinate basis vectors are
1 1
e(r) ≡ ∂(r) , e(θ) ≡ ∂(θ) , e(ϕ) ≡ ∂(ϕ) .
r r sin θ
The dual basis vectors are then
e∗(r) = dr, e∗(θ) = r dθ, e∗(ϕ) = r sin θ dϕ.
Indeed, you can check explictly that these vectors satisfy e∗(µ) · e(ν) = δνµ – again, the “·” operation represents the
action of dual vectors on vectors, not the scalar product. In this non-coordinate basis, the line element is then
d`2 ≡ g = e∗(r) ⊗ e∗(r) + e∗(θ) ⊗ e∗(θ) + e∗(ϕ) ⊗ e∗(ϕ) .

RAISING AND LOWERING INDICES WITH THE METRIC AND ITS INVERSE

We define the rank-(2, 0) inverse metric tensor field g αβ such that g αβ gβγ = δγα .

We saw last week that there isn’t any basis-independent mapping from Vp to Vp∗ (in contrast to the basis-independent
mapping between Vp and Vp∗∗ ). Now that we have a special tensor gαβ , we can use it and its inverse to go from Vp
to Vp∗ , without having to define a basis. Specifically, given a vector X α , we define the dual vector Xα (using the same
letter!)

Xα ≡ gαβ X β .

Similarly, given a dual vector Yα , we may define the vector Y α

Y α ≡ g αβ Yβ .

You can check that these two definitions are self-consistent. More generally, we can use the metric to define a rank
(k − 1, l + 1) tensor field from a rank-(k, l) tensor field, and the inverse metric to do the opposite. For instance, we
have
T αγ βδ ≡ gγρ gδλ T αρβλ .
You see that the position of the up and down indices matters. For instance,
T αβ γδ − T αγ βδ = gγρ gδλ T αβρλ − gγρ gδλ T αρβλ = gγρ gδλ T αβρλ − T αρβλ =

6 0 in general,
unless the rank-(4,0) tensor T αβγδ is symmetric in its middle two indices.
2

COVARIANT DERIVATIVES

Given a scalar field f , i.e. a smooth function f – which is a tensor of rank (0, 0), we have already defined the
dual vector ∇α f . We saw that, in a coordinate basis,
∂f
V α ∇α f = V µ ≡ ∇V f
∂xµ
gives the directional derivative of f along V .
We now want to generalize this idea of directional derivative to tensor fields of arbitrary rank, and we want to do
so in a geometric, basis-independent way. We cannot just recklessly take derivatives of a tensor’s components:
partial derivatives of components do not transform as tensors under coordinate transformations. Indeed,
given a vector field V α , under a coordinate transformation, the partial derivatives of its components transform as
0
" 0
# 0 0
∂V µ ∂xν ∂ ∂xµ µ ∂xµ ∂xν ∂V µ ∂xν ∂ 2 xµ
= V = + V µ.
∂xν 0 ∂xν 0 ∂xν ∂xµ ∂xµ ∂xν 0 ∂xν ∂xν 0 ∂xν ∂xµ

The presence of the second piece means that the partial derivatives of the components do not transform as a tensor.
We thus need to correct correct for this: we have to find some non-tensor coefficients Γµνσ such that

∇ν V µ ≡ V µ;ν ≡ ∂ν V µ + Γµνσ V σ

as a whole transforms as a tensor. These coefficients will thus transform as


0 0
0 ∂xµ ∂xν ∂xσ µ ∂xµ ∂ 2 xµ
Γµν 0 σ0 = 0 0 Γνσ + . (1)
µ ν
∂x ∂x ∂x σ ∂xµ ∂xν 0 ∂xσ0

The proof that the combination ∂ν V µ + Γµνσ V σ does indeed transform as a tensor is identical to Exercise 1 (iii) in
homework 2.

• Axiomatic definition. We will define a covariant derivative in an axiomatic way – we will see that this definition
does not uniquely specify the covariant derivative yet. Given a tensor T of rank (k, l), we define the tensor ∇T of
rank (k, l + 1), denoted as follows

(∇T )α1 ...αk β1 ...βl γ ≡ ∇γ T α1 ...αkβ1 ...βl ≡ T α1 ...αkβ1 ...βl ;γ ,

satisfying the following properties:

(i) the operator ∇ is linear, i.e. for two tensors T , S of equal rank (and the same index structure), and two real
numbers a, b, ∇(aT + bS) = a∇T + b∇S.

(ii) for a scalar field f , ∇f is just the usual gradient.

(iii) it satisfies Leibniz’s rule: given two tensors T , S of arbitrary ranks (not necessarily equal),

∇(T ⊗ S) = ∇T ⊗ S + T ⊗ ∇S,

(iv) it commutes with contractions: given a rank-(1, 1) tensor T αβ - or more generally, a tensor of rank
(k ≥ 1, l ≥ 1),

∇δ (T αα ) = ∇δ (δβα T β α ) = δβα (∇T )α βδ = (∇T )α αδ .

In words, we are requiring that the gradient of a scalar field (which is itself a contraction) is equal to the contraction
of the rank-(1,2) tensor (∇T )α βγ in its first two indices, which is a non-trivial requirement.

• Connection coefficients. Let us now apply our axiomatic definition to the covariant derivative of a vector
field. Suppose that we are given a coordinate basis {∂(µ) } that is smoothly defined around a neighborhood, so that
each one of the basis vectors is a smooth vector field. The dual basis {dx(µ) } will then also be defined around some
neighborhood, and consist of smooth dual vector fields. Now consider a vector field V = V µ ∂(µ) . The components of
3

V are V µ = V · dx(µ) , and are thus n smooth scalar fields. We can thus formally see the expression V = V µ ∂(µ)
as a sum of tensor products of rank-(0,0) tensors (the components) with rank (1, 0) tensors: V = V µ ⊗ ∂(µ) .
Let us now compute a covariant derivative of V , which is a rank (1, 1) tensor. Using Leibniz’s rule, we get

∇V = ∇(V µ ⊗ ∂(µ) ) = (∇V µ ) ⊗ ∂(µ) + V µ ⊗ (∇∂(µ) ).

By construction, any covariant derivative must just give the usual gradient when applied to scalar fields, thus ∇V µ =
∂ν V µ dx(ν) . Since ∇∂(µ) is a rank (1, 1) tensor, we may find its components on the basis {dx(ν) ⊗ ∂(σ) }:

∇∂(µ) = Γσνµ dx(ν) ⊗ ∂(σ) .

This defines the connection coeefficients Γσνµ . We thus found

∇V = ∂ν V µ dx(ν) ⊗ ∂(µ) + Γσνµ V µ dx(ν) ⊗ ∂(σ) = (∂ν V µ + Γµνσ V σ ) dx(ν) ⊗ ∂(µ) ,

after renaming dummy indices. Thus we have

∇ν V µ ≡ V µ;ν = ∂ν V µ + Γµνσ V σ .

You see that the connection coefficients “connect” the covariant derivative to the partial derivative.

• Covariant derivative of a dual vector field. Consider a dual vector field Wα . For any vector field V α , the
contraction V α Wα is a scalar field. Thus, in a coordinate basis,

∇ν (V µ Wµ ) = ∂ν (V µ Wµ ) = (∂ν V µ )Wµ + V µ (∂ν Wµ ),

per property (ii) of a covariant derivative, followed by Leibniz’s rule for the usual partial derivative.
On the other hand, from property (iii) combined with (iv), we also have

∇ν (V µ Wµ ) = (∇ν V µ )Wµ + V µ (∇ν Wµ ) = (∂ν V µ )Wµ + Γµνσ V σ Wµ + V µ (∇ν Wµ ).

Equating the two, and renaming dummy indices, we get

V µ ∇ν Wµ + Γσνµ Wσ = V µ ∂ν Wµ .


This equality must hold for any vector field V α . Thus we found the components of the rank-(0, 2) tensor ∇W :

∇ν Wµ ≡ Wµ;ν = ∂ν Wµ − Γσνµ Wσ .

Note the minus sign!

• Covariant derivative of a general tensor field. It is straightforward to generalize this to arbitrary tensor
fields – you’ll do this explicitly in the homework for rank-(0,2) and rank(1,1):

µ1 ···µk−1 λ
∇σ T µ1 ···µk ν1 ···νl = ∂σ T µ1 ···µk ν1 ···νl + Γµσλ1 T λµ2 ···µkν1 ···νl + · · · Γµσλk T ν1 ···νl

− Γλσν1 T µ1 ···µk λν2 ···νl − λ


· · · Γσνl T µ1 ···µk
ν1 ν2 ···νl−1 λ

In words, the covariant derivative is the partial derivative plus k + l “corrections” proportional to a connection
coefficient and the tensor itself, with a plus sign for all upper indices, and a minus sign for all lower indices.

THE TORSION-FREE, METRIC-COMPATIBLE COVARIANT DERIVATIVE

The properties that we have imposed on the covariant derivative so far are not enough to fully determine it.
In fact, there is an infinite number of covariant derivatives: pick some coordinate basis, chose the 43 = 64
connection coefficients in this basis as you wis. This thus defines the covariant derivative in this basis, hence defines
in it any basis since ∇T is a tensor – all you have to do is transform components from the original basis to any other
4

basis. We will impose two more conditions on the covariant derivative to fully specify it.

• Torsion-free. From the transformation law of connection coefficients, you see that the non-tensorial part (the
second derivative) drops out of the antisymmetric component:
0
0 ∂xµ ∂xν ∂xσ µ
Γµ[ν 0 σ0 ] = Γ .
∂xµ ∂xν 0 ∂xσ0 [νσ]
Thus, while Γµνσ is not a tensor, its antisymmetric part (in the lower two indices) Γµ[νσ] is indeed a tensor. This
is called the torsion tensor field. We will only consider covariant derivatives for which the torsion tensor field
vanishes,

Γµ[νσ] = 0 .

The most straightforward implication is the commutation of double covariant derivatives of scalars:

∇µ ∇ν f − ∇ν ∇µ f = ∂µ (∇ν f ) − Γσµν ∇σ f − ∂ν (∇µ f ) + Γσνµ ∇σ f = ∂µ (∇ν f ) − ∂ν (∇µ f ) + 2Γσ[νµ] ∇σ f.

Now ∇µ f = ∂µ f in a coordinate basis. Thus the two first terms cancel out, and we see that

∇µ ∇ν f − ∇ν ∇µ f = 2Γσ[νµ] ∇σ f = 0 .

• Metric-compatible. We want the covariant derivative to just recover the usual partial derivative in LICS. Thus
we want to impose that Γµνσ = 0 in a LICS. Note that if this is true in one LICS around p ∈ M, it is true in all
other LICS, which are just related by a Lorentz transformation. Indeed,
0
for such transformations,
0
the non-tensor 0part
(the second derivatives) in the transformation (1) vanishes, since xµ = const + Λµ µ xµ . This means that, if {xµ } is
a LICS and {xµ } is a general coordinate system, we have
0
∂xµ ∂ 2 xµ 0
Γµνσ = , {xµ } LICS.
∂xµ0 ∂xν ∂xσ
You already showed in homework 2 that this implies that the connection coefficients are just equal to the Christoffel
symbols:

1 µλ
Γµνσ = g (gνλ,σ + gσλ,ν − gνσ,λ ) .
2

The fact that LICS are tied to the metric tensor ties the connection, hence covariant derivative to the metric tensor.
Another, equivalent way to arrive at the same conclusion, is to require that

∇σ gµν = 0 .

You will show in the homework that this requirement indeed uniquely specifies the connection to be equal to the
Christoffel symbols. We will see in a little what this requirement means in an intuitive way. The connection of the
metric-compatible, torsion-free covariant derivative is also called the Levi-Civita connection.
Note: if we wanted to keep only metric-compatibility, but have a non-zero torsion, we could impose that Γµ(νσ) = 0
in a LICS. Equivalently, ∇µ gνσ = 0 only determines the symmetric part of the connection coefficients. So torsion-
free and metric compatible are independent conditions on the antisymmetric and symmetric parts of the connection
coefficients, respectively.

You might also like