Mennucci. Metrics of Curves in Shape Optimization and Analysis (2011) (95s)
Mennucci. Metrics of Curves in Shape Optimization and Analysis (2011) (95s)
analysis
A. Mennucci1
Lectures at Scuola Normale Superiore,
Feb 2011
Contents
2 Shapes in applications 9
2.1 Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 Shape Optimization & Active Contours . . . . . . . . . . . . . . 10
2.3 Geometric Active Contour Method . . . . . . . . . . . . . . . . . 12
2.4 Shape Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.5 Problems & Goals . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1
4.3 Geometric curves, and functionals . . . . . . . . . . . . . . . . . 33
4.4 The two categories of “Shape Spaces” . . . . . . . . . . . . . . . 34
4.5 Geometric curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
4.5.1 Bi,f is a Fréchet manifold . . . . . . . . . . . . . . . . . . 35
4.6 Goals (revisited) . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
6 Metrics of sets 39
6.1 Some more maths on distance and geodesics . . . . . . . . . . . . 39
6.2 Hausdorff metric . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
6.3 A Hausdorff like distance of compact sets . . . . . . . . . . . . . 42
2
V Riemannian manifolds of curves 75
12 Horizontality 77
12.1 Horizontal and vertical space . . . . . . . . . . . . . . . . . . . . 77
12.2 From curve-wise p.i to homotopy-wise p.i . . . . . . . . . . . . . 79
13 Momenta 81
14 Existence of Geodesics 83
14.1 The H 0 distance is degenerate . . . . . . . . . . . . . . . . . . . 83
14.2 (Critical) geodesics for H j . . . . . . . . . . . . . . . . . . . . . . 84
14.3 The direct method . . . . . . . . . . . . . . . . . . . . . . . . . . 84
14.4 Winding and pulley . . . . . . . . . . . . . . . . . . . . . . . . . 85
14.5 Curvature(s) as measures . . . . . . . . . . . . . . . . . . . . . . 86
14.6 Existence of geodesics for H 0 . . . . . . . . . . . . . . . . . . . . 86
3
Start of lecture: I. The matter at hand
Part I
The matter at hand
1 Shapes & Curves
What is this course about? We begin by summarizing in a simpler form the
definitions, review the goals, and infer what are the needed mathematical tools.
1.1 Shapes
1. Examples
Some examples of “Shape Spaces”.
• The family of all collections of k points in lRn .
• The family of all non empty compact subsets of lRn .
• The family of all closed curves in lRn .
This is the shape space that we will study in the main part of these
lectures.
• Other: surfaces, measures, metric spaces, etc.
2. Shape as a variable
In all of the above models, shapes are collected in a Shape Space M , where
the shape is a point, i.e. a variable x ∈ M .
A tentative and vague list of operations.
calculation/calculus → math. analysis , differential geometry
nearness → topology, distance
optimization → analysis, topology, differential geometry...
categorization →...distance...
compression (reduce
complexity/dimensionality) → ...quantization...PCA...
estimation of unobservables → probability/statistics
3. Computer vision
A more precise list of applications of Shape Theory in Computer Vision.
There are two different (but interconnected) fields.
Shape Optimization where we want to find the shape that best satisfies a
design goal; a topic of interest in Engineering at large;
e.g. segmentation, tracking
Shape Analysis where we study a family of Shapes for purposes of statistics,
(automatic) cataloging, probabilistic modeling, among others, and possi-
bly create an a-priori model for a better Shape Optimization.
4
1.2 Curves
4. Curves
(Most of the theory following will be developed for curves in lRn , when this does
not complicate the math).
S 1 = {x ∈ lR2 | |x| = 1} is the circle in the plane. It is the template for all
possible (closed) curves. (Open curves will be called paths).
A C 1 curve is a continuously differentiable map c : S 1 → lRn such that
0
c (θ) := ∂θ c(θ) exists at all points θ ∈ S 1 .
Definition 1.1. An immersed curve is a C 1 curve c such that c0 (θ) 6= 0 at
all points θ ∈ S 1 .
: S1 →
c c(S 1 )
7→
c(S 1 ) is the image, or trace, of the curve.
5. Manifold of curves
For the purposes of this introduction, we define the manifold of (paramet-
ric) curves M to be the set of all closed immersed curves. It is an open subset
of all C 1 curves (so it is manifold). Let’s see what this means, practically.
Suppose that c : S 1 → lRn is a closed immersed curve.
A deformation of c is a function h : S 1 → lRn .
The set of all such h is the tangent space Tc M of M at c.
An infinitesimal deformation of the curve c0 in “direction” h will yield (on
first order) the curve c0 (u) + εh(u).
A homotopy C connecting c0 to c1 is a continuous function
h C : [0, 1] × S 1 → S 1 such that c0 (θ) = C(0, θ) and c1 (θ) =
c1 C(1, θ).
M c0 By defining γ(t) = C(t, ·) we can associate C to a path
γ : [0, 1] → M in the space of curves M , connecting c0
to c1 .
When dealing with homotopies C = C(t, θ), we will use the “prime notation”
C 0 for the derivative ∂θ C in the “curve parameter” θ, and the “dot notation” Ċ
for the derivative ∂t C in the “time parameter” t.
5
(We will sometimes also write Ds instead of ∂s .)
Definition 1.3. At all points where c0 (θ) 6= 0, we define the tangent vector
c0 (θ)
T (θ) = = ∂s c(θ) .
|c0 (θ)|
10. Length
Definition 1.4. The length of the curve c is
Z
len(c) := |c0 (θ)| dθ . (1)
S1
ds := |c0 (θ)| dθ ;
1.2.2 Curvature(s)
There are two different definitions of curvature of an immersed curve: mean
curvature H and signed curvature κ, which is defined when c is valued in lR2 .
H and k are extrinsic curvatures, they are properties of the embedding of c
into lRn .
13. Curvature H
Definition 1.6 (H). If c is C 2 regular and immersed, we can define the (mean)
curvature H of c as
H = ∂s ∂s c = ∂s T
1
where ∂s = |c0 | ∂θ is the derivation with respect to the arc parameter.
6
• 1/|H| is the radious of a tangent circle that best approximates the curve
to second order.
14. Curvature κ
Let c be an immersed curve.
Definition 1.7 (N). When the curve c is in lR2 we can define a normal vector
N to the curve, by requiring that |N | = 1, N ⊥ T and N is rotated π/2 degree
anticlockwise with respect to T .
Definition 1.8 (κ). If c is in lR2 and C 2 , then we can define a signed scalar
curvature κ = hH, N i, so that
∂s T = κN = H and ∂s N = −κT .
κ>0
H N
N
H
H
N
κ<0
Figure 1: curvature
7
In all above cases, the evaluated operator O(c) is itself a function; for this
reason, to avoid confusion in the notation, in this proposition we write the
Gâteaux derivation as Dc,h O instead of DO(c; h). These rules following are
the building blocks that are used (by means of standard calculus rules for deriva-
tives) to compute the derivation of all other geometric operators usualy found
in computer vision applications.
Proposition 1.9.
since (Dss c · Ds c) = 0.
8
2 Shapes in applications
A number of methods have been proposed in Shape Analysis to define distances
between shapes, averages of shapes and optimal morphings between shapes. At
the same time, there has been much previous work in Shape Optimization, for
example Image Segmentation via Active Contours, 3D Stereo Reconstruction via
Deformable Surfaces; in these later methods, many authors have defined Energy
Functionals E(c) on curves (or on surfaces), whose minima represent the de-
sired Segmentation/Reconstruction; and then utilized the Calculus of Variations
to derive curve evolutions to search minima of E(c), often referring to these
evolutions as Gradient Flows. The reference to these flows as gradient flows
implies a certain Riemannian metric on the space of curves; but this fact has
been largely overlooked. We call this metric H 0 , and define it by
Z
hh, kiH 0 := hh(s), k(s)i ds
S1
2.1 Images
20. Cosa è un’immagine
Prendiamo una
immagine:
9
Ingrandiamo molto un dettaglio: notiamo
dei “quadratini”, che sono detti “pixel”.
Ogni pixel P è identificato da un’ascissa i
e un’ordinata j nel piano dello schermo.
verde
Tutti i colori che si possono visualizzare G giallo
sono contenuti in un cubo. Il vertice
(0, 0, 0) rappresenta il nero, il (255, 255, 255) grigio
blu
il bianco. B magenta
10
Active contours, introduced by [Kass et al., 1987], have been widely used for the
segmentation problem. The idea is to minimize an energy, defined on contours
or curves, that contains an image based edge attraction term and a smoothness
term, which becomes large when the curve is irregular. An evolution is derived to
minimize the energy based on principles from the calculus of variations. There
have been many variations to original model of [Kass et al., 1987]; for example
[Cohen, 1991], and a survey in [Blake and Isard, 1998]. An unjustified feature
of the model of [Kass et al., 1987] is that the evolution is dependent on the way
the contour is parameterized. Hence there have been geometric evolutions sim-
ilar to the idea of [Kass et al., 1987] in [Caselles et al., 1993; Malladi et al.,
1995], which can be implemented by level set methods [Osher and Sethian, 1988].
Thereafter, [Kichenassamy et al., 1995] [Caselles et al., 1995] considered min-
imizing a geometric energy, which is a generalization of Euclidean arclength,
defined on curves for the edge-detection problem. The authors derived the gra-
dient descent flow in order to minimize the geometric energy. In the works of
[Siddiqi et al., 1998] and [Xu and Prince, 1998], the authors build on the active
contour edge-detection approach to segmentation.
In contrast to the edge-based approaches for active contours (mentioned above),
region-based energies [Ronfard, 1994] [Zhu et al., 1995] [Yezzi et al., 1999]
[Chan and Vese, 2001] for active contours have provided many desirable fea-
tures; for example, they provide less sensitivity to noise, better ability to cap-
ture concavities of objects, more dependence on global features of the image,
and less sensitivity to the initial contour placement. In these approaches, an
energy is designed to be minimized when the curve partitions the image into
statistically distinct regions. In [Mumford and Shah, 1985, 1989], the authors
introduce and rigorously study a region-based energy that is designed to both
extract the boundary of distinct regions while also smoothing the image within
these regions. Subsequently, [Tsai et al., 2001b] [Vese and Chan, 2002] gave a
curve evolution implementation of minimizing the energy functional considered
by Mumford&Shah in a level set framework [Osher and Sethian, 1988]. Another
work that makes use of a region-based energy is [Paragios and Deriche, 2002a,b],
which also combines edge-based information. Like the edge-based methods, the
gradient descent flows are calculated to minimize these energies.
More recent works that use active contours for segmentation are not only
based on information from the image to be segmented (edge-based or region-
based), but also a-priori information known about the shape of the desired object
to be segmented. The work of [Leventon et al., 2000] showed how to incorporate
prior information into the active contour paradigm. Subsequently, there have
been a number of works, for example [Tsai et al., 2001a; Rousson and Paragios,
2002; Chen et al., 2002; Cremers and Soatto, 2003; Raviv et al., 2004], which
design energy functionals that incorporate prior shape information of the desired
object. In these works, the main idea is designing a prior term of the energy
that is small when the curve is close, in some sense, to a pre-specified shape.
27. Energies in Computer Vision
A variational approach to solving Computer Vision problems: minimizing
geometric energy functionals F (c) where c is a curve enclosing the region R.
11
R2
• Boundary Based Energies:
Z
F (c) = g(c(s)) ds
c
ds=arclength element
c
Rc R • Region Based Energies:
Z Z
F (c) = fin (x) dx + fout (x) dx
R Rc
dx=area element
R and Rc are the interior and exterior areas.
28. Examples of geometric energy functionals for segmentation
I : Ω → [0, 1] is the image (in black & white).
• Chan-Vese
R2
Z 2
F (c) = I(x) − avgR (I) dx +
ZR 2
c
+ I(x) − avgRc (I) dx
Rc R
Rc
Z
1
where avgR I = I(x)dx is the average
|R| R
on the region.
• Geodesic “Edge detector”
Z
1
F (c) = g(c(s)) ds where g(x) =
c 1 + |∇I(x)|2
12
4. Consider v to be the gradient, the direction which increases E fastest.
5. Evolve ∂t c = −v; this is called the gradient descent flow.
(The part of the flow tangent to the curve is always zero – this will be
properly explained in the fourth lecture, in slide 247).
4. Write the gradient Flow:
∂c
= gκN − (∇g · N )N
∂t
13
2.4 Shape Analysis
32. Shape Analysis.
Many method and tools comprise the shape analysis.
uA
uB
x2
B
A
x1
bA
bB
x2
B
A
x1
Examples:
• Hausdorff Distance
dH (c1 , c2 ) := sup uc2 (x) ∨ sup uc1 (x)
x∈c1 x∈c2
14
35. Shape Averages
Many definitions have also been proposed for the average c̄ of a finite set of
shapes c1 , . . . , cN .
Definition 2.1. Suppose that X is a random vector taking values in lRn ; let
X = E(X) be the mean of X. The principal component analysis is the
representation of X as
Xn
X=X+ Yi Si
i=1
where Si are constant vectors, and Yi are uncorrelated real random variables
with zero mean, and with decreasing variance. Si is known as the i-th mode
of principal variation.
The PCA is possible in general in any finite dimensional linear space equipped
with an inner product. In infinite dimensional spaces, or equivalently in case of
random processes, the PCA is obtained by the Karhunen-Loève theorem.
38. Principal Component Analysis (PCA)
With “linear” shape representations, PCA has become popular for imposing
shape priors in various shape optimization problems.
N
X
c̄ + αn Sn
n=1
15
where
c̄ = mean shape
Sn = n’th mode of variation
αn ∈ lR
| {z } | {z } | {z } | {z }
mean first mode second mode third mode
Segmentation:
16
on geometric active contours use the same geometric L2 -type inner product, which we
refer to as H 0 , to define gradient. However, [Michor and Mumford, 2003; Yezzi and
Mennucci, 2005a] have shown a surprising property: the Riemannian metric on the
space of curves induced by the H 0 inner product is pathological, i.e., the “distance”
between any two curves is zero.
In addition to the pathologies of the Riemannian structure induced by H 0 , there
are also undesirable features of H 0 gradient flows as we will demonstrate in this paper.
Some of these features are listed below.
1. There are no regularity terms in the definition of the H 0 inner product. That
is, there is nothing in the definition of H 0 that discourages flows that are not
smooth in the space of curves. By smooth in the spaces of curves, we mean that
the surface formed by plotting the evolving curve as a function of time is smooth.
Thus, when energies are designed to depend on the image that is to be segmented,
the H 0 gradient is very sensitive to noise in the image. As a result, the curve
becomes non-smooth instantly. Therefore, in geometric active contours models,
a penalty on the curve’s length is added to keep the curve smooth in addition to
producing a variational problem that is well-posed. However, this changes the
energy that is being optimized. Moreover, as we shall demonstrate, the length
penalty is sometimes insufficient, and leads to other problems.
2. H 0 gradients, evaluated at a particular point on the curve, depend locally on
derivatives of the curve. Therefore, as the curve becomes non-smooth, as men-
tioned above, the derivative estimates become inaccurate, and thus, the curve
evolution becomes inaccurate. Moreover, for region-based and edge-based active
contours, the H 0 gradient at a particular point on the curve depends locally on
image data at the particular point. Although region-based energies may depend
on global statistics, such as mean values of regions, the H 0 gradient still depends
on local image data. These facts imply that the H 0 is sensitive to noise and local
features.
3. The H 0 norm gives non-preferential treatment to arbitrary deformations regard-
less of whether the deformations are global motions (not changing the shape of
the curve) such as translations, rotations and scales or whether they are more
local deformations. This implies, as we shall show, that the H 0 -gradient of
image dependent energies “encourages” points on the evolving curve to move
“independently” to decrease energy rather than encouraging the points to move
more collectively. This restricts the gradient at a particular point from “seeing”
information located at other points of the curve. Therefore, the curve evolving
according to the H 0 gradient flow is susceptible to local minima of the energy.
4. Many geometric active contours (such as edge and region-based active contours)
require that the unit normal to the evolving curve be defined. As such, the evolu-
tion does not make sense for polygons. Moreover, since in general, an H 0 active
contour does not remain smooth, one needs special numerical schemes based on
viscosity theory in a level set framework to define the flow.
5. If the energy depends on n derivatives of the curve, then the H 0 gradient has 2n
derivatives of the curve. Since the corresponding level set flows with higher than
two derivatives are not known to have a maximum principle, level set methods
[Osher and Sethian, 1988] are difficult to use for the numerical implementation
[Chopp and Sethian, 1999] Therefore, one may try to use particle methods to im-
plement the flow. However, flows with higher than two derivatives are generally
difficult to implement because of numerical artifacts.
6. Lastly, as a specific example, the H 0 gradient ascent for arclength, i.e., backward
heat flow, is ill-posed. This is quite odd in an intuitive manner because there
is nothing in the definition of length itself that indicates that a flow to increase
17
length should be ill-posed. We shall see that this ill-posedness is a property of
H 0 not just the energy itself.
41. Troubles
As of year ∼2000, a number of troubles were affecting Shape Theory.
• The active contours method was not robust against noise; as in this ex-
ample of segmentation using Chan-Vese
18
• The flow is well posed only for positive times , that is, for increasing t
(similarly to the usual heat equation).
supposing that g is reasonably constant in the region where c is running, E(c) '
g len(c), due to the integral being performed wrt the arc parameter ds. So one
way to reduce E is to shrink the curve. Consequently, when the image is smooth
and featureless (as in medical imaging), the usual edge based energies would
often drive the curve to a point. To avoid it, we may normalize the energy to
obtain a more length–independent energy
Z
1
avgc (g(c)) := g(c(s)) ds
len(c) c
In the above, we omit the argument (s) for brevity; ∇g is the gradient of g wrt
x. If the curve is in the plane, that is n = 2, then we define the normal vector
N ⊥ T by rotating T counterclockwise, and define scalar curvature κ so that
∂s ∂s c = κN ; then, integrating by parts, the above becomes
Z
1
DE(c; h) = ∇g(c) · h − ∇g(c) · T hh · T i −
L c
− g(c) − avgc (g(c)) hh · Ds2 ci ds =
Z
= ∇g(c) · N − κ g(c) − avgc (g(c)) hh · N i ds
c
19
unfortunately the above flow is ill defined: it is a backward heat flow on roughly
half of the curve. We will now propose two simple such examples.
45. Example: A weird H 0 Active Contours
Let c = avgc (c) be the center of mass of the curve.
Let us fix a target point v ∈ lR2 .
E(c) := 12 |c − v|2 penalizes the distance from the center of mass to v.
The H 0 gradient descent flow is
∂c
= −∇H 0 E(c) = h(v − c) · N iN − κN (c − c) · (v − c)
∂t
P
v
The first term in this gradient descent flow tries to move
the whole curve towards v.
Let P := {w : h(w−c)·(v−c) ≥ 0} be the halfplane “on
c the v side” . The second term in this gradient descent
c flow tries to decrease the curve length out of P and
increase the curve length in P : and this is ill posed.
The need for prior information terms arose from several factors such as
20
• the energies had too many extraneous local minima, and the gradient
flows to minimize these energies allowed for arbitrary deformations that
gave rise to unlikely shapes; as in the example in the previous figure of
segmentation using the Chan-Vese energy, where the flowing contour gets
trapped into noise.
If one wishes to have a consistent view of the geometry of the space of curves
in both Shape Optimization and Shape Analysis, then one should use the H 0
metric when computing distances, averages and morphs between shapes.
Surprisingly, H 0 does not yield a well define metric structure, since the
associated distance is identically zero. (This striking fact was first described
in [Mumford]; it is generalized to spaces of submanifolds in [Michor and Mum-
ford, 2005]).
So to achieve our goal, we will need to devise a new metric.
Start of lecture: II. Basic notions, Deeper review
Part II
Basic notions, Deeper review
In this part we review all the program that was expressed in the first lecture,
providing the needed mathematical definitions in full detail.
21
Example 3.3. • lRn has distance d(x, y) = |x − y|, and is complete.
• Any closed subset of a complete space is complete.
• If we cut away an accumulation point out of those spaces, the result is not
complete.
d(x, y) := kx − yk.
Let I ⊂ lRk be open; let p ∈ [1, ∞]. A standard example of Banach space is the
Lp space of functions f : I → lRn with norm
sZ
kf kLp := p
|f (x)|p dx for p ∈ [1, ∞) , kf kL∞ := sup |f (x)| ;
I I
Note that, in this spaces, f = g iff (by definition) the set {f 6= g} has measure
zero.
The following citations [Hamilton, 1982] are referred to the first part of
Hamilton’s 1982 survey on the Nash&Moser theorem.
55. Fréchet space [Hamilton, 1982]
22
A locally-convex topological vector space (l.c.t.v.s.) E has a collection
of seminorms k · kk . The seminorms induce a topology, such that cn → c iff
kcn − ckk →n 0 for all k. All vector space operations are continuous w.r.t. this
topology.
A Fréchet space E is a complete Hausdorff metrizable locally-convex topo-
logical vector space; where
complete when, for any sequence (cn ), the fact that kcm − cn kk →m,n 0 for all
k implies that cn converges;
Hausdorff if kckk = 0 for all k then c = 0;
metrizable there are countably many seminorms associated to E.
The reason for the last definition is that, if E is metrizable, then we can define
a distance
∞
X 2−k kx − ykk
d(x, y) :=
1 + kx − ykk
k=0
that generates the same topology as the family of seminorms k · kk ; and the vice
versa is true as well, see [Rudin, 1973].
56. More examples of spaces of functions
Let I ⊂ lRk be open, or I be a finite dimensional manifold.
• The Banach spaces of C j functions; where
kf k = sup sup |f (k) (t)| ;
k≤j t
(k)
so cn → c when supt |cn (t) − c(k) (t)| →n 0 for all k ≤ j;
• A standard choice of Hilbert spaces of functions is provided by the Sobolev
spaces H j (I → lRn ), with scalar product
Z
hf, giH n := f (t) · g(t) + · · · + f (j) · g (j) dt .
c
23
58. Example
One example is F = C 0 (S 1 ) and the map L : lR × F → F where L(t; f ) = g
with g defined as the translation g(x) = f (t + x) of f . This map is linear in the
second argument; it is easy to prove that it is jointly continuous; but the linear
operator map
t 7→ (f 7→ L(t; f ))
(9)
lR → L(F, F )
is not continuous, since the operator norm is
59. Derivatives
An example is the Gâteaux differential of map P : U ⊂ F → G, where F, G
are Fréchet spaces, U ⊂ F is open, and h ∈ F
P (f + th) − P (f )
DP (f ; h) := lim = [∂t P (f + th)]|t=0 . (10)
t→0 t
We say that P is C 1 if DP : U × F → G exists and is jointly continuous. (This
is weaker than what is usually required in Banach spaces).
The basics of the usual calculus hold.
Theorem 3.5 ([Hamilton, 1982, Thm. 3.2.5]). If P is C 1 then DP (f ; h) is
linear in h.
Theorem 3.6 (Chain rule [Hamilton, 1982, Thm. 3.3.4]). If P, Q are C 1 then
P ◦ Q is C 1 and
D(P ◦ Q)(f ; h) = DP Q(f ); DQ(f ; h) .
A consequence (that we will discuss more later on) is that the exponential
map in Riemannian manifolds may fail to be locally surjective. See [Hamilton,
1982, Sec. 5.5.2].
24
3.3 Manifold
To build a manifold of curves, we have ahead two main definitions of manifold
to choose from.
62. Differentiable Manifold.
An (abstract) differentiable manifold is a topological space M associ-
ated to a model l.c.t.v.s. E and equipped with an atlas of charts φk : Uk → Vk .
V2
φk are homeomorphisms , with c
V1
Uk ⊂ E open sets, and Vk ⊂ M
open sets that cover M . M
The composition φ−1 1 ◦φ2 restricted
to V1 ∩V2 will satisfy some require- φ1 φ2
ments, as e.g. being a diffeomor-
phism of class C r .
U U1 U2
X
φk are diffeomorphisms, and φk
maps U ∩ Uk onto M ∩ Vk . U1 U
25
3.4 Fréchet Manifold
When studying the space of all curces, we will deal with Fréchet manifolds,
where the model space E will be a Fréchet space, and the composition of local
charts φ−1
1 ◦ φ2 is smooth.
66. Examples of Fréchet manifolds
Given two finite-dimensional manifolds S, R, with S compact and n dimen-
sional,
• [Hamilton, 1982, Example 4.1.3]. the space C ∞ (S; R) of smooth maps
f : S → R is a Fréchet manifold. It is modeled on E = C ∞ (lRn ; R).
• [Hamilton, 1982, Example 4.4.6]. The space of diffeomorphisms Diff(S) of
S onto itself is a Fréchet manifold. The group operation φ, ψ → φ ◦ ψ is
smooth.
• But if we try to model Diff(S) on C k (S; S), then the group operation is
not even differentiable.
C ∞ (S; R)/Diff(S)
We have found a math theory that seems apt to define and operate on the
manifold of geometric curves.
26
(Sometimes F is called a “Minkowsky norm”).
As for the case of norms, a Finsler metric may be asymmetric; but, for sake
of simplicity, we will only consider the symmetric case.
Using the norm |v|c we can then again define the length of paths by
Z 1 Z 1
F
Len (γ) = |γ̇(t)|γ(t) dt = F (γ(t), γ̇(t)) dt
0 0
The action E is a smooth quadratic integral, and we can compute the Euler–
Lagrange equations; its minima are more regular, since they are guaranteed to
have constant speed; consequently, when trying to find geodesics, we will try to
minimize the action and not the lengh.
72. Exponential map
Definition 3.13 (exp map). Let γ̈ + Γ(γ̇, γ) = 0 be the Euler–Lagrange ODE
R1
of E(γ) = 0 |γ̇(v)|2 dv. Any solution of this ODE is a critical geodesic. (Note
that a minimal geodesic is a critical geodesic).
Define the exponential map expc : Tc M → M as expc (η) = γ(1), the
solution of
n
γ̈(v) + Γ(γ̇(v), γ(v)) = 0, γ(0) = c, γ̇(0) = η (11)
27
74. exp as a local linear chart
Proposition 3.15. Suppose that M is a smooth differentiable manifold modeled
on a Hilbert space with a smooth Riemannian metric. The derivative of the
exponential map expc : Tc M → M at the origin is an isometry, hence expc is a
local diffeomorphism between a neighborhood of 0 ∈ Tc M and a neighborhood of
c ∈ M.
The exponential map can then “linearize” small portions of M , and so it
will enable us to use linear methods such as the principal component analysis.
Unfortunately, the above result does not hold if M is modeled on a Fréchet
space.
75. Hopf–Rinow theorem
Theorem 3.16 (Hopf–Rinow). Suppose M is finite dimensional Riemannian
or Finsler manifold. The following are equivalent
• (M, d) is complete ⇐⇒
• (11) can be solved for all c ∈ M and v ∈ lR ⇐⇒
• the map expc (η) is surjective;
and all those =⇒ ∀x, y ∈ M there exist a minimal geodesic connecting x to y.
28
Example 3.20 ([Grossman, 1965] example (or, see sec. VIII.§6 in [Lang, 1999])).
e0
N Build an ellipsoid
( )
X
γ1 e1 M= x ∈ `2 | x20 + x2i /(1 + 1/i)2 = 1
e3 e2 i=1
29
81. Gradient
Suppose now that we add a Riemannian geometry to M ; this defines an
inner product h·, ·ic on Tc M , so we can then define the gradient.
Definition 3.23 (Gradient). The gradient ∇E(c) of E at c is the unique
vector v ∈ Tc M such that
hv, hic = DE(c; h) ∀h ∈ Tc M .
If M is modeled on a Hilbert space H, and the inner product h·, ·ic used on
Tc M is equivalent to the inner product in H then the above equation uniquely
defines what the gradient is. When M is modeled on a Fréchet space, though,
in some cases the gradient does not exist.
Example 3.24. Let M = C ∞ ([−1, 1] → lR); since M is a vector space, then it is
trivially a manifold, with a single identity chart; and Tc M = M . Let E : M → lR
be the evaluation functional E(f ) = f (0), then DE(f ; h) = h(0); let
Z 1
hf, gi = f (t)g(t) dt ;
−1
then the gradient of E would be δ0 (the Dirac’s delta), that is a distribution (or
more simply a probability measure) but not an element of M .
30
Definition 3.26. A (semi)distance may be defined on B = M/G by
31
Proposition 4.4. Imm(S 1 , lRn ) , Immf (S 1 , lRn ) , Emb(S 1 , lRn ) , are open
subset of the Banach space C r (S 1 → lRn ) when r ≥ 1
Remark 4.5. • Imm(S 1 , lRn ) , Immf (S 1 , lRn ) , Emb(S 1 , lRn ) are (triv-
ially) submanifolds. (In particular, the charts are identity maps).
• They are closed w.r.t. reparameterizations.
• They are connected iff n ≥ 3; whereas in the case n = 2 of planar curves,
they are divided in connected components each containing curves with the
same rotation index (see Prop. 8.1 for the definition).
4.1.1 Gradients
The quotient spaces M/G are the spaces of curves up to rotation and/or
translations and/or scaling (. . . ).
92. Diffeomorphisms and reparameterization
Definition 4.6. Let Diff(S 1 ) be the family of diffeomorphisms of S 1 : all the
maps φ : S 1 → S 1 that are C 1 and invertible, and the inverse φ−1 is C 1 .
32
Diff(S 1 ) enjoys some important mathematical properties:
• Diff(S 1 ) is a group, and its group operation is the composition φ, ψ 7→
φ ◦ ψ.
• Diff(S 1 ) acts on M , the action is the right composition c, φ 7→ c◦φ. This
action is a reparameterization of c.
Note that Diff(S 1 ) has two connected components
where
• Diff+ (S 1 ) is the family of diffeomorphisms with φ0 > 0;
• Diff− (S 1 ) is the family of diffeomorphisms with φ0 < 0.
Diff+ (S 1 ) is a group, its action does not change the orientation of a curve.
We may also consider the quotient w.r.t Diff+ (S 1 ). The quotient space
M/Diff+ (S 1 ) is the space of geometric oriented curves.
95. Geometric functionals
Definition 4.8. A functional F (c) defined on curves will be called geometric
if it is invariant w.r.t. reparameterization of c; this may take two forms,
• F (c) = F (c ◦ φ) for all curves c and for all φ ∈ Diff(S 1 ).
• For all curves c and all φ, if φ ∈ Diff+ (S 1 ) then F (c) = F (c ◦ φ), whereas
if φ ∈ Diff− (S 1 ) then F (c) = −F (c ◦ φ)
In the first case, F may be “projected” to B = M/Diff(S 1 ), that is, it may
be considered as a function F : B → lR.
“Geometrical” theories have often provided the best results in Computer
Vision.
96. Gâteaux differential of rep. invariant operator
33
Consider the case of operators when O(c) is a curve or a vector field, and O
is reparameterization invariant i.e. ∀ϕ ∈ Diff+ (S 1 )
Dc,h O(c)(θ) = a(θ) . ∂θ O(c)(θ) for any h(θ) = a(θ).∂θ c(θ) (15)
Dc,h O(c)(s) = a(s) . Ds O(c)(s) for any h(s) = a(s).Ds c(s) . (16)
Proof. We define ϕ : lR × S 1 → S 1 by
ϕ̇(t, θ) = a(ϕ(t, θ))
ϕ(0, θ) = θ
(ϕ(t, ·) = exp(ta), the exponential map in Diff(S 1 )) We define γ(t, θ) = c(t, ϕ(t, θ)).
To compute Dc,h O(c) we compute ∂t O(γ)|t=0 but by invariance of O O(γ)(θ) =
O(c)(ϕ(t, θ)) so ∂t O(γ) = aO(c)(ϕ(t, θ)) hence we set t = 0.
so when h = aT we get
Similarly
Proposition 4.11. if E : M → lR is a reparameterization invariant energy,
then
Dc,h E = 0 for any h = a.Ds c . (17)
34
4.5 Geometric curves
99. Main definition: “geometric curves” [Michor and Mumford, 2006]
Unfortunately the quotient
is the quotient of Immf (S 1 , lR2 ) (free immersions) by the diffeos Diff(S 1 ) (repa-
rameterization).
The good news is that
Theorem 4.13 (§2.4.3 in [Michor and Mumford, 2006]). If Immf and Diff(S 1 )
have the topology of the Fréchet space of C ∞ functions, then Bi,f is a Fréchet
manifold modeled on C ∞ .
The bad news is that
• when we add a simple Riemannian metric to Bi,f , the resulting metric
space is usually not metrically complete;
• Some calculus is lost (Cauchy thm, exponential map).
Remark 4.14. It seems that this is the only way to properly define the man-
ifold. If otherwise we choose M = C k (S 1 → lR2 ) to be the manifold of curves,
then if c ∈ M , c0 6∈ Tc M . Hence we (must?) use M = C ∞ functions.
that is, it moves c(u) in direction k(u); and it is easily seen that the chart does
not depend on the choice of c such that Π(c) = [c].
We will prove that the family Φ[c] is a smooth atlas for Bi,f .
102. Lemmas for proof of the theorem 4.13
Let c be a C 2 immersed planar curve. We recall these properties.
Lemma 4.15. A curve c ∈ C 2 can be reparameterized to c̃ = c ◦ ϕ with
|c̃0 | =constant.
35
Note that, since the curve is closed, then κ cannot be identically zero.
Proposition 4.16 (Local embedding). Let δ := π/(4 max |κ|). For any a, b ∈
S 1 with L := len c|[a,b] ≤ δ; then |c(b) − c(a)| ≥ L/2; so c|[a,b] is embedded.
Proposition 4.17 (tubolar neighbourhood). Let δ := π/(4 max |κ|), τ :=
1/(2 max |κ|). Fix a, b ∈ S 1 with len c|[a,b] ≤ δ. Let φ(s, t) = c(s) + tN (s),
for s ∈ [a, b] and t ∈ [−τ, τ ]. Then φ is a diffeomorphism.
As a corollary, there is a neighbourhood of the image of c where the “projection
to c” is a C 2 multivalued map (with finitely many projections).
Lemma 4.18. Let c be a C 2 immersed planar curve. There exists a r > 0 such
that, if
c(s) + a(s)N (s) = c(φ(s))
with φ ∈ Diff(S 1 ) and a : S 1 → lR continuous with kak ≤ r, then a ≡ 0.
(Note that if c is a circle, then c(s) + 2N (s) = c(s + φ)).
Lemma 4.19. Immf is open in Imm
36
2. [euclidean invariance] apply euclidean transformation A to c (with R
is its rotational part) then
hRh1 , Rh2 iG|Ac = hh1 , h2 iG|c
Implies that distance is invariant w.r.t. euclidean transformation,
d(Ax, Ay) = d(x, y) ∀A roto-traslation
and what is the relationship between the degrees i, j and the properties in this
section?
Also said: can we design metrics?.
Start of lecture: III. Literature Overview
Part III
Literature Overview
In this third lecture we will present some current literature, and discuss it
37
5 Representation/embedding/quotient
110. The representation/embedding/quotient paradigm
A common way to model shapes is by representation/embedding:
• we represent the shape A by a function uA
• and then we embed this representation in a space E, so that we can
operate on the shapes A by operating on the representations uA .
Most often, representation/embedding alone do not directly provide a sat-
isfactory Shape Space. In particular, in many cases it happens that the repre-
sentation is “redundant”, that is, the same shape has many different possible
representations. An appropriate quotient is then introduced.
There are many examples of Shape Spaces in the literature that are studied
by means of the representation/embedding/quotient scheme. Understanding the
basic math properties of these 3 operations is then a key step in understanding
Shape Spaces and designing/improving them.
We now present a rapid overview of how this scheme is exploited in the
current literature of Shape Spaces; then we will come back to some of them to
explain more in depth.
Remark 5.1. When we started talking of geometric curves, we proposed the
quotient B = M/Diff(S 1 ) (the space of curves up to reparameterization); and
this had to be followed by other quotients w.r.to Euclidean motions and/or
scaling, to obtain M/Diff(S 1 )/E(n)/lR+ . In practice, though, the space B
happens to be more difficult to study; hence most shape space theories that deal
with curves prefer a different order: the quotient M/E(n)/lR+ is modeled and
studied first; then the quotient M/E(n)/lR+ /Diff(S 1 ) is performed (often, only
numerically).
38
as a subset N in L2 (0, 2π) or W 1,2 (0, 2π). Since the goal is to obtain a Shape
Space representation for Shape Analysis purposes, a quotient is then introduced
on N .
Example 5.5. Another representation of planar curves for Shape Analysis is
found in [Younes, 1998b]. In this case the angle function is considered mod(π).
This representation is both simple and very powerful at the same time. In-
deed, it is possible to prove that geodesics do exist and to explicitly show
examples of geodesics: see [Younes et al., 2008].
114. More examples
Example 5.6 (Harmonic representation). A. Duci et al (see [Duci et al., 2003,
2006]) represent a closed planar contour as the zero level of a harmonic function.
This novel representation for contours is explicitly designed to possess a linear
structure, which greatly simplifies linear operations such as averaging, principal
component analysis or differentiation in the space of shapes.
And, of course, we have in this list the spaces of embedded curves.
Example 5.7. When studying embedded curves, usually, for the sake of math-
ematical analysis, the curves are modeled as immersed parametric curves; a
quotient w.r.t the group of possible reparameterizations of the curve c (that
coincides with the group of diffeomorphisms Diff(S 1 )) is applied afterward to
all the mathematical structures that are defined (such as the manifold of curves,
the Riemannian metric, the induced distance, etc.).
6 Metrics of sets
We now present two example of Shape Theories where a shape may be a generic
subset of the plane; with particular attention to how they behave wrt curves.
39
Definition 6.3. If in the definition dg (x, y) := inf γ Lend γ there is a curve γ ∗
providing the minimum, then γ ∗ is called a (minimal) geodesic connecting x
to y.
Proposition 6.4. In Riemann and Finsler manifolds, the integral length Len(γ)
that we defined in slide 68 coincides with the total variation length Lend γ that
we defined in 6.1. As a consequence, d = dg : d is path-metric.
In general, though, it is easy to devise examples where d 6= dg .
Example 6.5 (for M = S 1 ⊂ lR2 ).
if d(x, y) = |x − y|,
then dg (x, y) = | arg(x) − arg(y)|.
Dg (x, ρ) := {y | dg (x, y) ≤ ρ}
40
6.2 Hausdorff metric
122. Hausdorff distance
Let Ξ be the collection of all compact subsets of lRN . (This is sometimes called
the“topological hyperspace” of lRN ). Let A, B ⊂ lRN compact non-empty. We
defined the distance function uA (x) := inf y∈A |x − y|. We defined the
Hausdorff distance of two sets A, B as
dH (A, B) := sup uB (x) ∨ sup uA (x) .
x∈A x∈B
• (Ξ, dH ) is path–metric;
• each family of compact sets that are contained in a fixed large closed ball
in lRN is compact in (Ξ, dH ); so
• any closed ball D(A, ρ) := {B | dH (A, B) ≤ ρ} is compact in (Ξ, dH ), and
moreover
• by Theorem 6.11, minimal geodesics exist in (Ξ, dH ).
123. An alternative definition
Let Dr (x) be the closed ball of center x and radius r > 0 in lRN , and
Dr = Dr (0). We define the fattened set to be
[
A + Dr := {x + y | x ∈ A, |y| ≤ r} = Dr (x) = {y | uA (y) ≤ r}.
x∈A
41
and in the picture we represent (dashed) the fattened sets A + D√5/2 and
√ √
B + D√5/2 . Note that dH (A, B) = 5 while dH (A, Ct ) = dH (B, Ct ) = 5/2:
so Ct are all midpoints that are on different geodesics between A and B.
125. Curves and connected sets
Let Ξc be the class of compact connected A ⊂ lRN ; Ξc is a closed subset
of (Ξ, dH ); Ξc is the closure in (Ξ, dH ) of the class of (images of) all embedded
curves. Ξc is Lipschitz-path-connected 1 ; for all above reasons, it is possible
to connect any two A, B ∈ Ξc by a minimal geodesic moving in Ξc . So, if we
try to find a minimal geodesic connecting two curves using the metric dH , we
will end up finding a geodesic in (Ξc , dH ); and similarly if we try to optimize
an energy defined on curves. But note that Ξc is not geodesically convex in Ξ,
that is, there exist two points A, B ∈ Ξc and a minimal geodesics ξ connecting
A to B in the metric space (Ξ, d), such that the image of ξ is not contained
inside Ξc . We don’t know if (Ξc , dH ) is path-metric.
126. Applications in Computer Vision
[Charpiat et al., 2004] proposed an approximation method to compute lenH (ξ)
by means of a family of energies defined using a smooth integrand; the approx-
imation is mainly based on the property kf kLp →p kf kL∞ , for any measurable
function f defined on a bounded domain; they successively devise a method to
find approximation of geodesics.
42
Remark 6.13. Given any f, g ∈ Lp with p ∈ (1, ∞), the segment connecting
f to g is the unique minimal geodesic connecting them. Suppose now that the
dimension of L∞ (Ω, A, µ) is greater than 1. Given generically f, g ∈ L∞ , there
is an uncountable number of minimal geodesics connecting them. (“Generically”
is meant in the Baire sense).
(For the proof see 2.11 & 2.13 in [Duci and Mennucci, 2007]). The above
result suggests that it may be possible to shoot geodesics in the “Riemannian
metric” associated to this distance.
129. Contingent cone
Given a v ∈ Nc , let Tv Nc ⊂ Lp be the contingent cone
where it is intended that the above limits are in the sense of strong convergence
in Lp . Tv Nc contains all directions in which it is possible “in the Lp sense” to
infinitesimally deform a compact set. For example, if Φ(x, t) : lRN × (−ε, ε) →
lRN is smooth diffeomorphical motion of lRN , and At := Φ(A, t), then vAt is
(locally) Lipschitz, so it is almost everywhere differentiable, and the derivative
is in T Nc . This includes all perspective deformations. Unfortunately the
contingent cone is not capable of expressing some shape deformations.
Example 6.14. We consider the removing motion; let A be compact, and suppose
that x is in the internal part of A; let At := A \ B(x, t) be the removal of a
small ball from A. The motion vAt inside Nc is Lipschitz, but the limit
vA t − vA
lim
t→0+ kvAt − vA k Lp
43
The proof is in 3.22 in [Duci and Mennucci, 2007].
To conclude, we propose an explicit computation of the Riemannian Metric
for the case of compact sets in the plane with smooth boundaries; we then pull
back the metric to obtain a metric of closed embedded planar curves. We start
with the case of convex sets.We fix p = 2, N = 2.
133. Polar coordinates of smooth convex sets
Let Ω ⊂ lR2 be a convex set with smooth boundary; let y(θ) : [0, L] → ∂Ω
be a parameterization of the boundary (by arc-parameter), ν(θ) the unit vector
normal to ∂Ω and pointing external to Ω: then the following “polar” change
of coordinates ψ holds:
y(θ)
where
πν (w) := νhν, wi = w − ∂s yh∂s y, wi
is the projection of w parallel to ν. Supposing now that ρ = ρ(t) as well, we
can express the point ψ(ρ, y) in a first order approximation as
dψ = (∂t y) + ρ0 ν + ρJπν (∂s (∂t y)) dt + ∂θ y + ρ∂θ ν dθ
where moreover
∂θ y + ρ∂θ ν dθ = ∂s y + ρ∂s ν ds = 1 + ρκ ∂s y ds .
44
If y(θ, t), ρ(t) are expressing a constant point x = ψ(ρ, y), then dψ = 0; we
apply scalar products w.r.t. ν and ∂s y to the above relations
hν, (∂t y)i + ρ0 = 0 , h∂s y, (∂t y)i − ρhν, ∂s (∂t y)idt + (1 + ρκ) ds = 0 .
that is, Z
hα, βi = (a + bκ(s))α(s)β(s)ds (19)
∂Ω
with Z Z
0
a= (ϕ (ρ)) dρ 2
, b= (ϕ0 (ρ))2 ρ dρ .
lR+ lR+
moreover R(s) is Lipschitz (by results in [Itoh and Tanaka, 2000],[Li and Niren-
berg, 2005]).
Cut
45
The pullback metric for deformations of the contour ∂Ω is
Z "Z R(s)
#
hh, ki = (ϕ0 (ρ))2 (1 + ρκ(s)) dρ α(s)β(s) ds
∂Ω 0
142. Conclusion
In this case we have found (aposteriori) a Riemannian metric of closed em-
bedded planar curves, and we know the structure of the completion, and the
completion admits minimal geodesics. On the down side, the completion is
not really “a smooth Riemannian manifold”. For example, it is difficult to study
the minimal geodesics, and to prove any property about them (for the reasons
already explained in slide 125). Anyway, the fact that Nc is locally compact
and the regularity property 6.13 of Lp spaces suggest that it may be possible
to “shoot geodesics” (in some weak form). Unfortunately the topology has too
few open sets to be used for Shape Optimization: many simple functionals are
not continuous.
We present two examples of Finsler geometries of curves that have been used
(sometimes covertly) in the literature.
46
7.1 L∞ metric and Fréchet distance
144. Tangent and normal
Let v, T ∈ lRN , let N = T ⊥ .2 We define the projection onto the normal space
N = T⊥
πN : lRN → lRN , πN v = v − hv, T iT
and on the tangent T
this metric weights for the maximum normal motion of the curve.
This Finsler metric is (homotopy-wise) geometric. The length of a homotopy
is then Z 1
Len(C) = sup |πN ∂t C(t, θ)| dv
0 θ
2 There is a slight abuse of notation here, since in the definition N = T ⊥ given for planar
47
The length of a homotopy is then
ZZ
Len(C) = |πN ∂t C(t, θ)||C 0 (t, θ)| dt dθ
This last is easily recognizable as the surface area of the homotopy (up to
multiplicity); the problem of finding a minimal geodesic connecting c0 and c1 in
the F 1 metric may be reconducted to the Plateau problem of finding a surface
which is an immersion of I = S 1 × [0, 1] and which has fixed borders to the
curves c0 and c1 . The Plateau problem is a wide and well studied subject upon
which Fomenko expounds in the monograph [Fomenko, 1990].
48
Proposition 8.1 (Continuous lifting, rotation index). If ξ ∈ C 1 and by arc-
parameter, then ξ 0 is continuous and |ξ 0 | = 1, so there exists a continuous
function α : lR → lR satisfying
3
1 −2
0
152. Shape Representation using Direction Functions
The addition of a generic constant to α is equivalent to a rotation of ξ.
We then understand that we may represent arc-parameterized curves ξ(s), up
to translation, scaling, and rotations, by considering a suitable class of liftings
α(s) for s ∈ [0, 2π].
Two spaces are defined in [Klassen et al., 2004]; we present the case of
“Shape Representation using Direction Functions”; let L2 := L2 ([0, 2π] → lR);
Φ : L2 → lR3 is defined by
Z 2π Z 2π Z 2π
Φ1 (α) := α(s) ds , Φ2 (α) := cos α(s) ds , Φ3 (α) := sin α(s) ds .
0 0 0
S := α ∈ L2 | Φ(α) = (2π 2 , 0, 0) ;
This means that α identifies an unique curve (of length 2π, and arc-parameterized)
up to rotations and translations, and to the choice of the base point ξ(0); for
this last reason, S is called in [Klassen et al., 2004] a preshape space.
Inside the family of arc-parameterized curves, the reparameterization of ξ is
ˆ
restricted to the transformation ξ(u) = ξ(u − s0 ), that is a different choice of
basepoint for the parameterization along the curve. Inside the pre shape space
S, the same operation is encoded by the relation α̂(s) = α(s − s0 ). To obtain
a model of immersed curves up to reparameterization, rotation, translation,
scaling (recalling slide 98), we need to quotient S by the relation α ∼ α̂, for all
possible s0 ∈ lR. We do not discuss this quotient here.
154. S \ Z is a manifold
49
Definition 8.3 (Flat curves). Let Z be the set of all α ∈ L2 ([0,R2π]) such that
α(s) = a + k(s)π where k(s) ∈ Z and k is measurable, a = 2π − k ∈ lR, and
Z is closed (by thm. 4.9 in [Brezis, 1986]). We see that Z contains the
(representations α of) flat curves ξ, that is, curves ξ whose image is contained
in a line;
Example 8.4. one such curve is
( √ (
s/ 2 s ∈ [0, π] π/2 s ∈ [0, π]
ξ1 (s) = ξ2 (s) = √ , α=
(2π − s)/ 2 s ∈ (π, 2π] 3π/2 s ∈ (π, 2π]
a1 cos(α(s)) + a2 sin(α(s)) + a3 = 0
detailed proof
50
This implies that the family Aξ of measurable α ∈ S that represent the same
curve ξ is infinite. It may be then advisable to define a quotient distance dˆ
(see Defn. 3.26) as follows:
ˆ ξ 0 ) :=
d(ξ, inf d(α, α0 ) (22)
α∈Aξ ,α0 ∈Aξ0
Proposition 8.7. For any h ∈ Z, the set of closed smooth curves ξ with rotation
index h, when represented in S using the continuous lifting (20), is dense in
S \ Z.
It implies that we cannot properly extend the concept of rotation index to
S.
The proof is based on this lemma.
Lemma 8.8. Suppose that ξ is not flat, let τ be one of the measurable liftings
of ξ. There exists a smooth projection π : V → S defined in a neighborhood
V ⊂ L2 of τ such that, if f ∈ L2 ∩ C ∞ then π(f ) is in L2 ∩ C ∞ .
This is the proof of both the lemma and the proposition.Fix α0 ∈ S \ Z.
Let T = Tα0 S be the tangent at α0 . T is the vector space orthogonal to ∇φi (α0 )
for i = 1, 2, 3. Let ei = ei (s) ∈ L2 ∩ Cc∞ be near ∇φi (α0 ) in L2 , so that the map
(x, y) : T × lR3 → L2
3
X
(x, y) 7→ α = α0 + x + ei yi (23)
i=1
Then
3
X
π(α)(s) − α(s) = ei (s)ai , ai := (Fi (x(α)) − yi (α)) ∈ lR (24)
i=1
so if α(s) is smooth, then π(α)(s) is smooth. This ends the proof of the lemma.
4 Indeed, the continuous lifting is unique up to addition of a constant to α(s), which is
51
Let αn be smooth functions such that αn → α in L2 , then π(αn ) → α0 ; if we choose
them to satisfy αn (2π)−αn (0) = 2πh, then, by the formula (24), π(α)(2π)−π(α)(0) =
2πh so that π(αn ) ∈ S and it represents a smooth curve with the assigned rotation
index h.
52
Proof. By ξ 0 = (e+if )2 /2 we obtain |ξ 0 | = |e+if |2 /2 and δξ 0 = (e+if )(δe+iδf )
so that Z 2π
|δξ 0 |2
Z Z
2
|Ds δξ| ds = 0
dθ = 2 |δe + iδf |2 dθ
ξ S 1 |ξ | 0
53
8.2 Riemannian spaces of immersed curves
Metrics of “geometric” curves have been studied by Michor–Mumford [Michor
and Mumford, 2006, 2007, 2005] and Yezzi–Mennucci [Yezzi and Mennucci,
2005b, 2004a,b]; more recently, Yezzi–Mennucci–Sundaramoorthi [Sundaramoor-
thi et al., 2007b, 2006; Mennucci et al., 2008; Sundaramoorthi et al., 2005,
2008a, 2007a] have studied Sobolev–like metrics of curves and shown many good
properties for applications to Shape Optimization; similar results have also been
shown independently by Charpiat et al [Charpiat et al., 2004].
We now discuss some Riemannian metrics on immersed curves.
• H 0 (the simplest)
Z
hh1 , h2 iH 0 = hh1 (s), h2 (s)i ds
c
8.2.1 H 0 (traditional)
• The H 0 distance is identically zero (We wil prove this in sec. 14.1.)
• In section 14.6 we will see a possible result of existence of geodesics for
this metric, when restricted to curves with bounded curvature.
54
8.2.2 H A (Michor-Mumford)
171. H A Metric
[Michor and Mumford, 2006] propose the metric H A
Z L
hh1 , h2 iH A = (1 + A|κc |2 )hh1 , h2 i ds
|c
0
BV 2 ⊂ M ⊂ Lip
where BV 2 are the curves that admit curvature as a measure (see section
14.5) and Lip are the rectifiable curves.
• Compactness bounds (prevents the pulley example).
∂C 1
= −∇ψ E(C) = − ∇E(C)
∂t ψ(C)
55
8.3 Sobolev type Riemannian metrics
174. Sobolev Metrics [Sundaramoorthi et al., 2005]
Let Ds := |c10 | ∂θ be “the derivative with respect to arclength”. Let
Z
hh1 , h2 iH j := hDsj h1 , Dsj h2 i ds
0
S1
56
Let again L = len(c). Recall the definition 3.23 of gradient ∇E by means of
the identity
h∇E, hic = DE(c)(h) ∀h ∈ Tc M .
Let f = ∇H 0 E, g = ∇H 1 E be the gradients wrt the inner products H 0 and H 1 ;
by the definition of gradient, we obtain that
that is Z Z
h · f ds = h · g + λL(Ds h · Ds g) ds ∀h
c c
with some integration by parts this becomes
Z
h · (f − g + λLDs2 g) ds = 0 ∀h
c
∇H 0 E = ∇H 1 E − λLDs2 (∇H 1 E) .
∇H 1 E = ∇H 0 E ∗ Kλ , ∇H̃ 1 E = ∇H 0 E ∗ K̃λ .
∇
\H 0 E(l)
∇
\H n E(l) = for l ∈ Z (30)
1 + λ(2πl)2n
57
6 10
λ=0.01 λ=0.01
λ=0.04 λ=0.04
5 λ=0.08 8 λ=0.08
6
4
4
3
2
2
0
1
−2
0 −4
−0.5 0 0.5 −0.5 0 0.5
Figure 3: Plots of Kλ (left) and K̃λ (right) for various λ with L = 1. The plots
show the kernels over one period.
and
∇
\H 0 E(l)
∇
\
H̃ n E(0) = ∇H 0 E(0),
\ ∇
\
H̃ n E(l) = for l ∈ Z\{0}, (31)
λ(2πl)2n
It is clear from the previous expressions that high frequency components of
∇H 0 E(c) are increasingly less pronounced in the various forms of the H n gradi-
ents.
Note that we are smoothing gradients, so we will obtain smoother minimiza-
tion flows. But we are not smoothing the curves themselves.
182. Flow regularization
It is possible to show mathematically that the Sobolev–type gradients regu-
larize the flows of well known energies, by reducing the degree of the P.D.E., as
shown in this example:
Example 8.11. in the case of the elastic energy E(c) = c κ2 ds = c |Ds2 c|2 ds,
R R
(3)
the H 0 gradient is ∇H 0 E = LDs (2Ds c+3|Ds2 c|2 Ds c) that includes fourth order
derivatives; whereas the H̃ 1 -gradient is
∇H 0 E = L(∇φ(c) · N )N − Lφ(c)κN
58
where φs := (φ◦c)0 . Since K̃λ is the fundamental solution of the gradient prob-
lem, that is, K 00 (s) = L1 L1 − δ(s) (that is the eqn. (18) in [Sundaramoorthi
et al., 2007a]), we find that
φc − avgc (φc)
∇H̃ 1 E = − L(φs c) ∗ K̃λ0 + L∇φ ∗ K̃λ . (33)
λL
The above does not require that the curve be twice differentiable, and then we
may prove that
1,1
Proposition 8.12. Suppose that φ ∈ Cloc (that is, φ ∈ C 1 and its derivative is
n
Lipschitz on compact subsets of lR ), let the initial curve be c0 ∈ C 1 ; then the
gradient flow
dc
= −∇H̃ 1 E(c)
dt
exists and is unique in C 1 for small times.
If lim inf x→∞ φ(x)|x| > 0, then it exists for all (positive) times.
Of particular interest is when φ = 1, that is E = L, the length of the curve.
In this case, we already know that the H 0 flow is the geometric heat flow.
We see that
c − avgc (c)
∇H̃ 1 L = .
λL
so H̃ 1 gradient flow constitutes a simple rescaling of the curve about its centroid
(!).
It is interesting to notice that the H 1 and H̃ 1 gradient flows are well-posed
for both ascent and descent while the H 0 gradient flow is only well-posed for
descent.
While the H 0 gradient descent flow smooths the curve, the H̃ 1 gradient
descent (or ascent) has neither a beneficial nor a detrimental effect on the reg-
ularity of the curve.
59
Proof. We rapidly sketch the proof of local existence and uniqueness (for more
details, look into section 10.6 in [Mennucci, 2009]).
We show that ∇H̃ 1 E(c) is locally Lipschitz in the C 1 Banach space (of para-
1,1
metric curves). Indeed, since φ ∈ Cloc , the maps
c 7→ ∇φ(c)
c 7→ φ(c)Ds c
are locally Lipschitz as maps from C 1 to C 0 , so (see Lemmas 10.25 and 10.26
in [Mennucci, 2009]) we obtain that
c 7→ avgc (∇φ(c))
are locally Lipschitz as maps from C 1 to C 1 ; combining all together (and using
the Lemma 10.25 again)
1 1
∇H̃ 1 E(c) = Lavgc (∇φ(c)) − Pc Pc ∇φ(c) + Pc φ(c)Ds c
λL λL
is locally Lipschitz as maps from C 1 to C 1 ; so by the Cauchy–Lipschitz theorem
we know that the gradient flow exists for small times.
When φ ∈ C 1 , φ > 0, it is nonetheless possible to prove that the flow exists
for small (positive) times.
the above (by using the Euler method) implies existence, in the class of curve
evolutions c(θ, t) that are Lipschitz in both variables.
We propose the trace of the proof, in case λ > 24, for H̃ 1 flow.
Let B(0, r) ⊂ Rd be the ball centered in 0 of radious r > 0.
Let Mr > 0 be a supremum to φ and |∇φ| in B(0, r).
Since
L inf φ(x) ≤ E(c) ≤ L sup φ(x)
|x|<r |x|<r
then
1 1
E(c) ≤ L ≤ E
Mr mr
60
By using the Poincaré–type inequality (8) in [Sundaramoorthi et al., 2007a]
we prove that
Z Z Z
φc − avgc (φc) ≤ |(φc) | ds ≤ (∇φ · c )|c| ds + φ|c0 | ds ≤ Mr (1 + r)L
0 0
moreover Z L
1
|K̃λ0 | ds =
0 2λ
then, for all s,
1 1
|∇H̃ 1 E(s)| ≤ α(r, L) := Mr (1 + r) + LMr r + LMr
λ 2λ
Suppose now that the curve c(·, t) is contained in B(0, R) with R = R(t):
then R0 (t) ≤ α(R(t), L);
We use the Euler method, in the class of curve evolutions c(θ, t) that are
Lipschitz in both variables. We pick a N ; let tj := j2−N ; let so that, defining
for N big enough, we will have that the energy E(c(θ, tj ) is decreasing in j; so
L will be bounded from above, and then
c(θ, t) − c(θ, s) ≤ α
61
188. Related works
A family of metrics similar to H j above (but for the length dependent scale
factors) was studied in [Michor and Mumford, 2007]: the Sobolev-type weak
Riemannian metric on Imm(S 1 , R2 )
Z X n Z
hh, kiGnc = hDsi h, Dsi ki ds = hAn (h), ki ds where
S 1 i=0 S1
n
X
An (h) = An,c (h) = (−1)i Ds2i (h)
i=0
192. Metric
Definition 8.19. If h, k ∈ Tc M are decomposed as above, then we define the
Riemannian metric H as
Z
. t t
hh, kiH = h · k + λl h k + λd Ds hd · Ds k d ds,
l l
(38)
c
62
where the first two products are the Euclidean dot products, the last term is a
normalized geometric Sobolev metric, and λl , λd > 0 are (constant) weights.
The third term of the metric may be rewritten directly as a function of
h ∈ Tc M by using the identity
Z Z Z Z
d d
Ds h · Ds k ds = Ds h · Ds k ds − Ds h · Ds c ds Ds c · Ds k ds . (39)
c c c c
194. Md
We will find it useful to define a submanifold Md of M .
Definition 8.20. Let
that is the space of all smooth immersed curves with unit length and with
centroid at the origin.
This is a smooth submanifold of M , when modeled on C ∞ (the proof follows
from a corollary of Nash–Moser theorem, see [Hamilton, 1982]).
Its tangent space at ξ ∈ M is
195. Isometry
Theorem 8.21. Let λl = λd = 1 for simplicity. We define a map and its
inverse
c − avgc (c)
c∈M 7→ v = avgc (c) , l = log L(c) , ξ =
L(c)
with (v, l, ξ) ∈ R2 × R × Md
2
(v, l, ξ) ∈ lR × lR × Md 7→ v + el ξ ∈ M
C(t, ·) = L1−t (c0 )Lt (c1 )C̃(t, ·) + (1 − t)avgc (c0 ) + tavgc (c1 ) .
63
Corollary: along a geodesic of curves, the centroid translates uni-
formly, and the scale/length changes exponentially.
So we have to compute geodesics in Md , that is, in St(L2 , 2).
197. Square root lifting
Since the space Md is the same studied in [Younes et al., 2008] (and with
the same metric) we again use the square root lifting 8.9.
Let e, f be real and imaginary part of the square root lifting, so that
1
ξ 0 (θ) = (e + if )2 .
2
We map ξ ∈ Md to (e, f ) ∈ St(L2 , 2) the Stiefel manifold of orthonormal
pairs in L2 = L2 ([0, 2π] → lR).
Z 2π Z 2π Z 2π
St(L2 , 2) := (e, f ) | e2 dθ = 1 = f 2 dθ, ef dθ = 0
0 0 0
2 2
with the induced Riemannian metric from L × L ; and again by Prop. 8.10 this
is an isometry.
198. Computing Geodesics in St(L2 , 2)
n
St(R , p) is the set of n × p matrixes with orthonormal columns.
Proposition 8.22 (Lippert in [Edelman et al., 1998]). Let Y : [0, 1] → St(Rn , p)
be a path, then the geodesic equation is
Ÿ + Y (Ẏ T Ẏ ) = 0.
The solution is
A −S
(Y (t)eAt , Ẏ (t)eAt ) = (Y (0), Ẏ (0)) exp t
I A
64
202. Shape Space(s)
This is the scheme of Shape spaces (of smooth curves) and their quotients.
lR ⊗ lR2 ⊗ Md o / lR ⊗ lR2 ⊗ St(C ∞ , 2)/{±1} immersed curves
Md o / St(C ∞ , 2)/{±1} immersed curves up to scale,
translation
immersed curves up to scale,
Md /Rot o / Gras(C ∞ , 2)/{±1}
translation, rotation
geometric curves up to scale,
Md /Rot/Diff(S 1 ) o / Gras(C ∞ , 2)/{±1}/?
translation, rotation
(Note that, to ensure that all quotients are manifolds, only freely immersed
curves should be considered)
65
Part IV
Sobolev Active Contours
9 Sobolev Active Contours
Most of what follows was originally presented in [Sundaramoorthi et al., 2007a]
In this part we present the Sobolev Active Contours, with Applications &
Experiments.
206. The Important Properties of Sobolev Active Contours
• PDE order reducing property: Sobolev gradient flows are lower order than
H 0 (we already saw an example, and more to come)
• Coarse-to-fine deformation property
– automatically favors coarse-scale deformations before moving to fine-
scale deformations
– ideal for visual tracking
• Sobolev-type norms induce well-defined Riemannian metrics on space of
curves [Michor and Mumford, 2007],[Mennucci et al., 2008]
9.1 Gradients
We now add some more comments on the gradient ∇E(c) that was defined in
3.23
207. Interpretation of the Gradient
DE(c)(v) |DE(c)(h)|
= sup . (43)
kvkc h∈Tc M \{0} khkc
In a sense, the focus of research for 20 years has been on the numerator in
eqn. (43) – whereas we now focus on the denominator.
66
9.2 Why H̃ j is better than H j
208. H̃ 1 : Easier gradients
Let E(c) be an energy of curves. (Let λ = 1, len(c) = 1 for simplicity.)
Gradients are implicitely define by:
Tc M = lR2 ⊕ Dc M
with n o
Dc M := h : S 1 → lR2 | avgc (h) = 0 .
where g = ∇H 1 E and f = ∇H 0 E
• nearly same computational speed as H 0 gradient; moreover the result-
ing flow is more stable, so it works fine with larger choice of time step
discretization. For this reason, H̃ 1 Sobolev Active Contours are very fast.
67
• Add a regularization term E(c) + εR(c) and minimize this by H 0 gradient
descent. The numerical complexity of the ∇H 0 R is of order n.
9.3 Experiments
212. Sobolev Norms Robust to Noise and Local Minima
Region-Based
Z Segmentation:
Z E =Chan-Vese, + regularization for H 0 flows.
E(c) = (I − u)2 dA + (I − v)2 dA
cin cout
−∇H 0 E + ακN
−∇H 0 E + βκN
(β >> α)
−∇H̃ 1 E
68
. . . . . . . . . . . . . . . . . . . . . . . . time → . . . . . . . . . . . . . . . . . . . . . . . .
H0
. . . . . . . . . . . . . . . . . . . . . . . . time → . . . . . . . . . . . . . . . . . . . . . . . .
“Rigidified”
(Translation favored)
. . . . . . . . . . . . . . . . . . . . . . . . time → . . . . . . . . . . . . . . . . . . . . . . . .
Sobolev
Note: The same result holds for Sobolev even without perturbing the local
minimum (due to numerical noise).
Numerically, these local minima no longer exist!!!
215. Experiment for Tracking Benefits
Region-based energy using means and variances.
H 0 active contour
69
10 New Applications With Sobolev Active Con-
tours
Most of what follows was originally presented in [Sundaramoorthi et al., 2007b]
and [Sundaramoorthi et al., 2008a]
• Both energies have ill-posed H 0 gradient descents (backward heat flow compo-
nents arising from length increasing terms).
• Increasing length is well-posed with Sobolev gradient descent.
Notation: φ : R2 → R, κ = signed curvature, N = inward normal vector.
c − avgc (c)
∇H̃ 1 E1 = E1 − (φcs ) ∗ Ks − ∇φ ∗ K
L
70
(where cs = Ds c, css = Dss c).
The second term, which was the cause of ill-posedness of the H 0 gradient
c−avgc (c)
descent, is converted to E1 L , which is a stable rescaling of curve.
220. Another Example of Energy Precluded by H 0
Weighted Scale-Invariant Type Elastic Energy:
Z
E(c) = L φ(c(s))κ2 (s) ds
c
∇H 0 E(c) = −Ecss + 2L2 ∂ss (φcss ) + 3L2 ∂s (φκ2 cs ) + L2 κ2 ∇φ
E 2
∇H̃ 1 E(c) = − 2 (avgc (c) − c) + (avgc ((φcss )) − φcss )−
λL λ
− 3L2 K 0 ∗ (φκ2 cs ) + L2 K ∗ (κ2 ∇φ).
Notation: φ : R2 → R, κ = signed curvature, cs = Ds c, css = Dss c.
• If φ = constant, then the energy is scale-invariant.
• Fourth order H 0 gradient descent (numerically difficult to implement; level
set methods?)
• Sobolev gradient descent is second order; numerically easier to implement
(H 0 gradient descent)
71
E(c) = Edata (c)
Z
+ αL(c) κ2 (s) ds
c
Z Z
φ(c(s)) L−1 + αLκ2 (s) ds
Eold (c) = φ(c(s)) ds, Enew (c) =
c c
225. Length-Increasing
Z Edge-Based
Z Model
Length Increasing Model: Einc (c) = φ(c(s)) ds − α κ2 (s) ds Typical Edge-
Z c Z c
72
Left to right: initial contour, minimizing Ebal α = 0.2, 0.25, 0.4 using H 0 , and increas-
ing weighted Einc α = 0.1 using Sobolev. The contour expands to enclose the entire
image (fifth image).
Input x y
System Mesurement
e
Error
System x̂ Measurement ŷ
Model Model
pk = pk−1 + vk−1 pk = exppk−1 (vk−1 ) ∈ M
→
vk = vk−1 + ηk−1 vk = Pk−1 (vk−1 + ηk−1 ) ∈ Tpk M
73
(noise affect only velocity; no input, that is, u = 0; Pk−1 is parallel transport
along the geodesic related to exppk−1 (vk−1 )).
The measurement model becomes
yk = p k + ξk → yk = exppk (ξk ) ∈ M
(in the estimator we only update the velocity). The goal is to choose Kp , Kv > 0
so that the observer tracks the system.
230.
Comparison between frame-by-frame segmentation, i.e., no dynamics (left),
dynamical model only on the motion (scales and translation) in the middle col-
umn and dynamical model on both the motion and deformation (right column).
Red: p̂k , blue: v̂k , and green: yk .
74
Frame=36 frame34 frame34
0 0 0
50 50 50
50 0 50 100 150 200 250 50 0 300 50 100 150 200 250 50 0 300 50 100 150 200 250 300
50 0 50 100 150 200 250 50 0 300 50 100 150 200 250 50 0 300 50 100 150 200 250 300
50 0 50 100 150 200 250 50 0 300 50 100 150 200 250 50 0 300 50 100 150 200 250 300
50 0 50 100 150 200 250 50 0 300 50 100 150 200 250 50 0 300 50 100 150 200 250 300
50 0 50 100 150 200 250 50 0 300 50 100 150 200 250 50 0 300 50 100 150 200 250 300
0 50 100 150 200 250 0 300 50 100 150 200 250 0 300 50 100 150 200 250 300
75
Part V
Riemannian manifolds of curves
In this part we study some mathematical properties of the Riemannian mani-
fold of (freely)immersed curves, when it is endowed with the metrics that were
presented previously.
We recall the definitions: the projection onto the normal πN v = v − hv, T iT
and on the tangent πT v = hv, T iT seen in slide 144 on page 47
247. Reparameterization to normal motion
Proposition 11.1. Given any smooth homotopy C of immersed curves, there
exists a reparameterization given by a parameterized family of diffeomorphisms
Φ : [0, 1] × S 1 → S 1 , so that setting
πT̃ ∂v C̃ = 0 , πÑ ∂v C̃ = πN ∂t C
C C’
76
The dotted line represent the trajectory of a point on the curve.
The two motions C and C̃ coincide in B but are represented differently in
M.
Which one is best? Both “translation” and “orthogonal motions” seem a
good idea at first glance.
12 Horizontality
250. Recall: parameterization invariance
Let M be the manifold of (freely)immersed curves.p Let hh1 , h2 ic be a Rie-
mannian metric G, for h1 , h2 ∈ Tc M ; let khkc = hh, hic be its associated
R1
norm. Let E(γ) := 0 kγ̇(v)k2γ(v) dv be the action. Let C : [0, 1] × S 1 → S 1 be
a smooth homotopy, and γ(t) = C(t, ·) be the associate path. We recall these
two definition that we saw in slide 106. The metric G is
• curve-wise p.i. when the metric does not depend on the parameteri-
zation of the curve, that is kh̃kc̃ = khkc when c̃(t) = c(ϕ(t)) and h̃(t) =
h(ϕ(t));
77
The orbit is Oc = [c] = {c ◦ φ | φ} = Π−1 ({c}).
(Orbits Oc are dotted, The vertical space Vc is the tangent to Oc :
spaces Wc and Vc are dashed.)
M Vc := Tc Oc ⊂ Tc M
Oc
that can be explicitly written as
M Oc
Wc
c
Vc
Π
B
[c]
78
Summarizing, the homotopy-wise invariance implies that, if h1 , h2 are tan-
gent to c at all points, then hh1 , h2 i = 0. p
So, the Riemannian metric is not a metric on M . ( hh1 , h2 i is a
seminorm).
1
for all choices of b : S → lR.
Proposition 12.2. G⊥ is homotopy-wise p.i.
Corollary 12.3. G homotopy-wise p.i. ⇔
khkG = kh + bc0 kG
for all b.
So we understand/study/design the metric G on B by:
1. choose a metric G on M that is curve-wise p.i
2. G generates the horizontal space W = V ⊥
3. project G on W to define G⊥ that is homotopy-wise p.i.
c0 Oc
γ’
Wc is orthogonal to the orbits, Vc is tan-
c1oφ ’ gent to the orbits.
The path γ (that moves in some tracts
Wc γ
along the orbits, with γ̇ ∈ Vγ ) is longer
c
than the path γ 0 .
c1o φ
Vc
Lemma 12.4. 1. Let C̃(t, θ) = C(t, ϕ(t, θ)) where ϕ(t, ·) is a diffeomor-
phism for all fixed t. Minimize
min EG (C̃)
ϕ
79
Then at the minimum C̃ ∗ , ∂t C̃ ∗ is horizontal at every point, and EG (C̃ ∗ ) =
EG⊥ (C̃ ∗ )
2. So the distance can be equivalently computed as the infimum of LenG⊥ or
of LenG ; and distances are equal, dG⊥ = dG .
(A more general Lemma can be proven for Finsler metrics). So the hori-
zontal G⊥ is inevitable.
The proof of the above lemma is based on the validity of “the lifting Lemma”:
Lemma 12.5 (lifting Lemma). Given any smooth homotopy C of immersed
curves, there exists a reparameterization given by a parameterized family of
diffeomorphisms Φ : [0, 1] × S 1 → S 1 , so that setting
C̃(t, θ) := C(t, Φ(t, θ))
we have that ∂t C̃ is in the horizontal space WC at all times t.
For the metric H 0 , this reduces to lemma 11.1 in this paper; for Sobolev-type
metrics, the proof is in §4.6 in [Michor and Mumford, 2007].
262. Horizontality according to H 0
The horizontal space Wc w.r.t. H 0 is
Wc := {h : h(s) ⊥ c0 (s)∀s}
The horizontal version of H 0 is
D E Z
h1 , h2 0,⊥ := πN h1 · πN h2 ds
H S1
in closed form, and this needs (pseudo)differential operators. See [Michor and
Mumford, 2007] for details.
But remember: in this case,
80
“a good movement of a curve is not necessarily
a movement that is orthogonal to the curve.”
13 Momenta
266. The method of momenta
What follows comes from §2.5 in [Michor and Mumford, 2007].
Consider again the action of a group G on a Riemannian manifold M . Most
groups that act on curves are smooth differentiable manifolds themselves, and
the group operations are smooth. So they are Lie groups.
In our cases of interest, the actions are smooth as well.
We recall that the action satisfies the properties: e · m = m (with e ∈ G the
identity), and for any g, h ∈ G, m ∈ M
h · (g · m) = (h · g) · m . (49)
81
• The translation is represented by lRn , that is a vector space, so the tangent
directions are ξ ∈ lRn ; the action is just v, c 7→ v + c; then ζ = ξ (and is
constant in θ).
ζ = ξ(θ)c0 (θ)
we get
Z
0
hζ, γ̇iH̃ 1 = avgc (ξγ ) · avgc (γ̇) + λL 2
Ds (ξγ 0 ) · Ds γ̇ = constant;
integrating by parts,
Z
avgc (ξγ 0 ) · avgc (γ̇) − λL2 (ξγ 0 ) · Dss γ̇ = constant;
82
272. Momenta for the metric H
Proposition 13.2 (Momenta, Prop.3.6 in [Sundaramoorthi et al., 2011]). Sup-
pose that C is a geodesic, then the following quantities are conserved:
• [translation] ∂t avgc (C) is constant;
• [scaling] ∂t (log L(C)) is constant;
• [rotation] the angular momentum (that may be expressed in two equal
ways) Z Z
(ADs C) · (∂t Ds C) ds = (ADs C) · (Ds ∂t C) ds (50)
C C
is constant, for any antisymmetric matrix A; when the curve is planar,
this means that Z
κ T · ∂t C ds
C
is constant;
14 Existence of Geodesics
We present some results on how the direct method in Calculus of Variations
may be applied to the problem of existence of geodesics.
∂C
∂C ∂v
∂θ
Len(C) ∼ sin(α)
83
We start with a zizzag homothopy C; now
t = 1/2
∂C
t = 1/4 ∂C ∂v
∂θ
t = 1/8
t=0 α
Note that, if the topology τ is not defined by a metric, then we can introduce a
different condition:
84
f is sequentially lower semicontinuous, iff Γf |Ω = f .
If X is a metric space, then “sequentially lower semicontinuous functions” are
“lower semicontinuous functions”, and viceversa; so we may drop the adjective
“sequentially”.
279. The direct method
Definition 14.4. Consider again a function f : X → lR: it is called coercive
in X if ∀M ∈ lR,
{x ∈ X : f (x) ≤ M }
is contained in a compact set.
Proposition 14.5. If f is coercive in X and sequentally lower semi continuous
then it admits a minimum on any closed set, that is, for all C ⊂ X closed there
exists x ∈ C s.t.
f (x) = min f (y) .
y∈C
This result is one of the pillars of the modern Calculus of Variations. We will
see that unfortunately this result may not be applied directly to the problem at
hand.
F
E The thick line is the curve C5 (u, 0). The
thick arrows represent the normal part of
the velocity, while the thin dashed arrows
A
represent the tangent part of the velocity.
B
In the part DE (respectively F A) of the curve, vertical segments move apart
(resp., together) as the thick arrows indicate, with horizontal velocity with norm
1; as the curve unravels, it is forced to move also parallel to itself.
The generic curve movement Cj (θ, t) has 2j wheels. Let l = l(t, j) be the
distance from feature point D to feature point E. Then ∂t l → ∞ if j → ∞.
85
14.5 Curvature(s) as measures
282. Curvature H as a measure
Definition 14.7 (H as a measure). In general, we will say that a curve c ∈
W 1,1 (S 1 ) admits mean curvature in the measure sense if there exists a
vector valued measure H on S 1 such that
Z Z
T (s)∂s φ(s) ds = − φ(s) dH(s) ∀φ ∈ C ∞ (S 1 )
I I
that is Z Z
T (θ)∂θ φ(θ) dθ = − φ(θ) |c0 | dH(θ) ∀φ ∈ C ∞ (S 1 )
I I
86
Definition 14.8. Consider a continuous function f : lRn×2 → lR such that
f (W, V ) = g(W, V, W × V )
is W 1,p -weakly-lower-semi-continuous. 7
|πW ⊥ V |2 |W |2 = |V |2 |W |2 − hV, W i2 = |V × W |2
that is easily checked. (In lR3 we have hV, W i = |V ||W | cos α and |V × W | =
|V ||W | sin α, where α is the angle between the two vectors).
So |πW ⊥ V |2 |W |2 is polyconvex; then
ZZ
|π∂θ C ⊥ ∂t C|2 |∂θ C|2 dθ dt
Given an energy and two continuous curves c0 and c1 , we will try to find a
homotopy that minimizes this energy, searching in the class C.
287. Classes C and F
Definition 14.10 (class C). Let I = S 1 × [0, 1]. Let C be the class of all
homotopies C : I → lRn , continuous on I and locally Lipschitz in S 1 × (0, 1),
such that c0 = C(·, 0) and c1 = C(·, 1).
Definition 14.11 (class F of prescribed-parameter homotopies). We also fix
a measurable positive function l : [0, 1] → lR+ and restrict our attention to a
family F of homotopie such that
for all θ, t.
In particular, len(C) = 2πl(t).
288. Semicontinuity in F
6 The more general definition and the properties may be found sec. 4.1 in [Buttazzo, 1989],
87
Proposition 14.12. Let p ≥ 2. The action E is W 1,p -weakly-lower-semi-
continuous in the class F.
More precisely: let
C ∈ F, (Ch )h ⊂ F
if Ch * C weakly in W 1,p , that is,
Ch → C, ∂v Ch * ∂v C ∂θ Ch * ∂θ C
in Lp , and
l(v) = |Ċ(θ, v)| = |Ċh (θ, v)|
then
lim inf E(Ch ) ≥ E(C)
h
is l.s.c.
• for any homotopy C and any continuous g : [0, 1] → lR+ , define eg (C)(v) :
[0, 1] → lR+ ,
g(v)β−α S 1 f (∂θ C, ∂v C)α dθ if g(v) > 0
R
eg (C) :=
0 if g(v) = 0
and let Z 1 m Z
X ai+1
Ê(C) := eg (C)dv = egi (C)dv
0 i=1 ai
then we apply the previous reasoning to all addends and conclude that
lim inf Ê(Ch ) ≥ Ê(C)
h
88
• Suppose l is continuous and l ≥ 0. Let τ be the class of piecewise functions
g ≥ 0 defined as in (51), such that on any interval [ai , ai+1 ), either gi = 0
or
gi ≥ sup l .
[ai ,ai+1 )
and
0 if v ∈ Aj,i and Ij,i = 0
gj (v) =
Sj,i if v ∈ Aj,i and Ij,i > 0
Then Z 1
Eα,β (C) = sup egj (C)dv
j 0
89
Afterword
291. Acknowledgments
I gratefully acknowledge that some part of the theory and all experiments ex-
posited were discussed and developed in joint work with Prof. Yezzi (GaTech)
and Prof. Sundaramoorthi (KAUST), and Dr. Duci.
References
C. J. Atkin. The Hopf-Rinow theorem is false in infinite dimensions. Bull.
London Math. Soc., 7(3):261–266, 1975. doi: 10.1112/blms/7.3.261.
A. Blake and M. Isard. Active Contours. Springer Verlag, 1998.
90
Bernard Dacorogna. Weak Continuity and Weak Lower Semicontinuity of Non-
Linear Functionals, volume 922 of Lecture Notes in Mathematics. Springer–
Verlag, 1982.
Alessandro Duci and Andrea Mennucci. Banach-like metrics and metrics of
compact sets. 2007.
Alessandro Duci, Anthony J. Yezzi, Sanjoy K. Mitter, and Stefano Soatto.
Shape representation via harmonic embedding. In International Confer-
ence on Computer Vision (ICCV03), volume 1, pages 656 – 662, Washing-
ton, DC, USA, 2003. IEEE Computer Society. ISBN 0-7695-1950-4. doi:
10.1109/ICCV.2003.1238410.
Alessandro Duci, Anthony J. Yezzi, Stefano Soatto, and Kelvin Rocha. Har-
monic embeddings for linear shape. J. Math Imaging Vis, 25:341–352, 2006.
doi: 10.1007/s10851-006-7249-8.
A. Edelman, T. Arias, and S. Smith. The geometry of algorithms with or-
thogonality constraints. SIAM J Matrix Analy Appl, 20:303–353, 1998. doi:
10.1137/S0895479895290954. arXiv:physics/9806030v1 (1998).
J. Eells and K. D. Elworthy. Open embeddings of certain Banach manifolds.
Ann. of Math. (2), 91(3):465–485, May 1970. URL http://www.jstor.org/
stable/1970634.
Ivar Ekeland. The Hopf-Rinow theorem in infinite dimension. J. Differential
Geom, 13(2):287–301, 1978.
A. T. Fomenko. The Plateau problem. Studies in the development of modern
mathematics. Gordon and Breach, 1990.
M. Gage and R. S. Hamilton. The heat equation shrinking convex plane curves.
Journal of Differential Geometry, (23):69–96, 1986.
Joan Glaunès, Alain Trouvé, and Laurent Younes. Modeling planar shape vari-
ation via Hamiltonian flows of curves. In Anthony Yezzi and Hamid Krim,
editors, Analysis and Statistics of Shapes, Modeling and Simulation in Sci-
ence, Engineering and Technology, chapter 14. Birkhäuser - Springer - Verlag,
2005.
M. Grayson. The heat equation shrinks embedded planes curves to round points.
J. Differential Geometry, (26):285–314, 1987.
Nathaniel Grossman. Hilbert manifolds without epiconjugate points. Proc.
Amer. Math. Soc., 16:1365–1371, 1965. ISSN 0002-9939. doi: 10.2307/
2035933.
Richard S. Hamilton. The inverse function theorem of Nash and Moser. Bull.
Amer. Math. Soc. (N.S.), 7(1):65–222, 1982. ISSN 0273-0979.
Philipp Harms and Andrea Mennucci. Geodesics in infinite dimensional Stiefel
and Grassmann manifolds. 2010. http://cvgmt.sns.it/papers/harmen10/.
J. Itoh and M. Tanaka. The Lipschitz continuity of the distance function to the
cut locus. Trans. A.M.S., 353(1):21–40, 2000.
91
H. Karcher. Riemannian center of mass and mollifier smoothing. Communica-
tions on Pure and Applied Mathematics, 30:509–541, 1977.
Eric Klassen, Anuj Srivastava, Washington Mio, and Shantanu H. Joshi. Analy-
sis of planar shapes using geodesic paths on shape spaces. IEEE Transactions
on Pattern Analysis and Machine Intelligence, 26:372–383, 2004. ISSN 0162-
8828. doi: 10.1109/TPAMI.2004.1262333.
Serge Lang. Fundamentals of differential geometry. Springer–Verlag, 1999.
92
Washington Mio and Anuj Srivastava. Elastic-string models for representation
and analysis of planar shapes. In Conference on Computer Vision and Pattern
Recognition (CVPR), June 2004b. URL http://stat.fsu.edu/~anuj/pdf/
papers/CVPR_Paper_04.pdf.
D. Mumford. Slides of the gibbs lectures. URL http://www.dam.brown.edu/
people/mumford/Papers/Gibbs.pdf.
D. Mumford and J. Shah. Boundary detection by minimizing functionals. In
Proc. IEEE Conf. Computer Vision Pattern Recognition, 1985.
D. Mumford and J. Shah. Optimal approximations by piecewise smooth func-
tions and associated variational problems. Comm. Pure Appl. Math., 42:
577–685, 1989.
S. Osher and J. Sethian. Fronts propagating with curvature-dependent speed:
algorithms based on the Hamilton-Jacobi equations. J. Comp. Physics, 79:
12–49, 1988.
N. Paragios and R. Deriche. Geodesic active regions: a new paradigm to deal
with frame partition problems in computer vision. International Journal of
Visual Communication and Image Representation, Special Issue on Partial
Differential Equations in Image Processing, Computer Vision and Computer
Graphics, 13(2):249–268, June 2002a.
N. Paragios and R. Deriche. Geodesic active regions and level set methods for
supervised texture segmentation. International Journal of Computer Vision,
46(3):223, 2002b.
T. R. Raviv, N. Kiryati, and N. Sochen. Unlevel-set: Geometry and prior-based
segmentation. In Proc. European Conf. on Computer Vision, 2004.
Remi Ronfard. Region based strategies for active contour models. International
Journal of Computer Vision, 13(2):229–251, October 1994. URL http://
perception.inrialpes.fr/Publications/1994/Ron94.
M. Rousson and N. Paragios. Shape priors for level set representations. In Proc.
European Conf. Computer Vision, volume 2, pages 78–93, 2002.
Walter Rudin. Functional analysis. McGraw–Hill, 1973.
K. Siddiqi, Y. B. Lauzière, A. Tannenbaum, and S. Zucker. Area and length
minimizing flows for shape segmentation. IEEE Transactions on Image Pro-
cessing, 3(7):433–443, 1998.
L. Simon. Lectures on Geometric Measure Theory, volume 3 of Proc. Center for
Mathematical Analysis. Australian National University, Canberra, 1983.
Stefano Soatto and Anthony J. Yezzi. DEFORMOTION: Deforming motion,
shape average and the joint registration and segmentation of images. In ECCV
(3), pages 32–57, 2002.
Anuj Srivastava, Shantanu H. Joshi, Washington Mio, and Xiuwen Liu. Sta-
tistical shape analysis: Clustering, learning, and testing. IEEE Transactions
on Pattern Analysis and Machine Intelligence, 27:590–602, 2005. ISSN 0162-
8828. doi: 10.1109/TPAMI.2005.86.
93
Ganesh Sundaramoorthi, Anthony Yezzi, and Andrea Mennucci. Sobolev active
contours. In Nikos Paragios, Olivier D. Faugeras, Tony Chan, and Christoph
Schnörr, editors, VLSM, volume 3752 of Lecture Notes in Computer Science,
pages 109–120. Springer, 2005. ISBN 3-540-29348-5. doi: 10.1007/11567646_
10.
Ganesh Sundaramoorthi, Jeremy D. Jackson, Anthony Yezzi, and Andrea Men-
nucci. Tracking with Sobolev active contours. In Conference on Computer
Vision and Pattern Recognition (CVPR06). IEEE Computer Society, 2006.
ISBN 0-7695-2372-2. doi: 10.1109/CVPR.2006.314.
Ganesh Sundaramoorthi, Anthony Yezzi, and Andrea Mennucci. Sobolev
active contours. Intn. Journ. Computer Vision, 2007a. doi: 10.1007/
s11263-006-0635-2.
Ganesh Sundaramoorthi, Anthony Yezzi, Andrea Mennucci, and Guillermo
Sapiro. New possibilities with Sobolev active contours. In Scale Space Vari-
ational Methods 07, 2007b. URL http://ssvm07.ciram.unibo.it/ssvm07_
public/index.html. "Best Numerical Paper-Project Award" ; also [Sun-
daramoorthi et al., 2008b].
Ganesh Sundaramoorthi, Anthony Yezzi, and Andrea Mennucci. Coarse-to-fine
segmentation and tracking using Sobolev Active Contours. IEEE Transac-
tions on Pattern Analysis and Machine Intelligence (TPAMI), 2008a. doi:
10.1109/TPAMI.2007.70751.
Ganesh Sundaramoorthi, Anthony Yezzi, Andrea Mennucci, and Guillermo
Sapiro. New possibilities with Sobolev active contours. Intn. Journ. Computer
Vision, 2008b. doi: 10.1007/s11263-008-0133-9.
Ganesh Sundaramoorthi, Andrea Mennucci, Stefano Soatto, and Anthony Yezzi.
A new geometric metric in the space of curves, and applications to tracking
deforming objects by prediction and filtering. SIAM Journal on Imaging
Sciences, 4:109–145, 2011. doi: 10.1137/090781139.
Alain Trouvé and Laurent Younes. Local geometry of deformable templates.
SIAM J. Math. Anal., 37(1):17–59 (electronic), 2005. ISSN 0036-1410.
A. Tsai, A. Yezzi, W. Wells, C. Tempany, D. Tucker, A. Fan, E. Grimson, and
A. Willsky. Model-based curve evolution technique for image segmentation. In
Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, volume 1,
pages 463–468, Dec. 2001a. doi: 10.1109/CVPR.2001.990511.
A. Tsai, A. Yezzi, and A. S. Willsky. Curve evolution implementation of the
mumford-shah functional for image segmentation, denoising, interpolation,
and magnification. IEEE Transactions on Image Processing, 10(8):1169–1186,
Aug 2001b.
L. A. Vese and T. F. Chan. A multiphase level set framework for image seg-
mentation using the mumford and shah model. Int. J. Computer Vision, 50
(3):271–293, 2002.
C. Xu and J. L. Prince. Snakes, shapes, and gradient vector flow. IEEE Trans-
actions on Image Processing, 7(3):359–369, March 1998.
94
A. Yezzi and A. Mennucci. Metrics in the space of curves. Preprint,
arXiv:math.DG /0412454, May 2005a.
95