Term Structure of Interest Rates Explained
Topics covered
Term Structure of Interest Rates Explained
Topics covered
i.c
CM2-18: The term structure of interest rates Page 1
ng
si
om
o m
as
.m
w
w
w
The term structure of
interest rates
Syllabus objectives
4.5 Models of the term structures of interest rates
1. Explain the principal concepts and terms underlying the theory of a term
structure of interest rates.
2. Describe the desirable characteristics of models for the term structure of
interest rates.
3. Apply the term structure of interest rates to modelling various cash flows,
including calculating the sensitivity of the value to changes in the term
structure.
4. Describe, as a computational tool, the risk-neutral approach to the pricing
of zero-coupon bonds and interest rate derivatives for a general
one-factor diffusion model for the risk-free rate of interest.
5. Describe, as a computational tool, the approach using state price deflators
to the pricing of zero-coupon bonds and interest rate derivatives for a
general one-factor diffusion model for the risk-free rate of interest.
6. Demonstrate an awareness of the Vasicek, Cox-Ingersoll-Ross and
Hull-White models for the term structure of interest rates.
7. Discuss the limitations of these one-factor models and show an awareness
of how these issues can be addressed.
ng
si
om
o m
0 Introduction
as
.m
w
In this chapter we will look at stochastic models for the term structure of interest rates. In
w
w
particular, we will focus on models that are framed in continuous time and which are
arbitrage-free.
Continuous-time models are generally used for their greater tractability compared to
discrete-time ones. An arbitrage-free model of the term structure is one that generates arbitrage-
free bond prices.
There are two main types of models used to describe interest rates mathematically:
1. The Heath-Jarrow-Morton approach uses an Ito process to model the forward rate for an
investment with a fixed maturity. We will not consider this approach here.
2. Short-rate models use an Ito process to model the short rate. We will look at three
specific models of this type: the Vasicek model, the Cox-Ingersoll-Ross model and the
Hull-White model.
Ito processes are a key feature of these models, so it might be helpful to review the topics of
Brownian motion, Ito’s Lemma and stochastic differential equations from earlier in the course.
Interest rate modelling is the most important topic in derivative pricing. Interest rate
derivatives account for around 80% of the value of derivative contracts outstanding, mainly
swaps and credit derivatives used to support the securitisation of debt portfolios. More
philosophically, is the fact that at any one time there will be a multitude of contracts (bonds)
written on the same underlying (an interest rate), and derivative pricing is principally
concerned with pricing derivatives coherently.
The Core Reading in this chapter is adapted from the course notes written by Timothy
Johnson.
ng
si
om
o m
1 Notation and preliminaries
as
.m
w
1.1 Zero-coupon bonds
w
w
The multitude of traded instruments leads to the first challenge in interest rate modelling:
the multitude of definitions of interest rates.
Modelling interest rates is more complicated than modelling share prices because interest rates
depend not only on the current time (which we will denote by t ), but also on the term of the
investment. For example, an investor with a 10-year bond will normally earn a different rate of
interest than an investor with a 5-year bond.
The basic debt instrument is the discount bond (or, equivalently, the zero-coupon bond).
This is an asset that will pay one unit of currency at time T and is traded at time t T . If the
interest rate, R, is constant between t and T then we can say that the price of the discount
bond purchased at t and maturing at T is given by P (t ,T ) where:
1
P (t ,T )
(1 R (t ,T ))(T t )
The spot rate R(t ,T ) is the effective rate of interest applicable over the period from time t to
time T that is implied by the market prices at time t .
The discrete bond yield calculated from discount bond prices is:
1
R (t , t ) 1
P (t , t )1/
1
R(t ,T ) 1
P(t ,T )1/(T t )
nm
1 R
1
P (0, n) m
1
e r (t ,T ) P (t ,T )
(1 R (t ,T ))
ln P (t ,T )
r (t ,T )
ng
si
om
m
The spot rate r(t ,T ) is the continuously compounded rate of interest applicable over the period
o
as
from time t to time T that is implied by the market prices at time t .
.m
w
w
1.2 Yield curves
w
Fixing t 0 and plotting yield, R(0,T ) or r (0,T ) , against maturity, T, gives the yield curve
which gives information on the term structure, how interest rates for different maturities are
related. Typically, the yield curve increases with maturity, reflecting uncertainty about
far-future rates. However, if current rates are unusually high, the yield curve can be
downward sloping, and is inverted.
There are various theories explaining the shape of the yield curve. The expectations theory
argues that the long-term rate is determined purely by current and future expected
short-term rates, so that the expected final value of investing in a sequence of short-term
bonds equals the final value of wealth from investing in long-term bonds.
The market segmentation theory argues that different agents in the market have different
objectives: pension funds determine longer-term rates, market makers determine short-term
rates, and businesses determine medium-term rates, which are all determined by the supply
and demand of debt for these different market segments.
The liquidity preference theory argues that lenders want to lend short term while borrowers
wish to borrow long term, and so forward rates are higher than expected future zero rates
(and yield curves are upward sloping).
The figure below shows the yield curve for the UK Government bond market on 31 December
2003. The ‘term’ plotted on the x -axis corresponds to T t .
5.5%
5.0%
4.5%
Yield
4.0%
3.5%
0 5 10 15 20 25 30 35
Term
ng
si
om
m
The circles show the remaining term and the gross redemption yield for each of the available
o
as
bonds. These approximate to the spot rates r(t ,T ) for zero-coupon bonds with corresponding
.m
terms. A mathematical curve has been fitted to these points. At this particular time the yield
w
w
curve had a humped shape.
w
The other curve shown is the forward rate curve f (0,t ,T ) , which is defined in the next section.
r (t ) r (t , t ) R (t , t )
where is a small positive quantity. So the short rate r (t) is the force of interest that applies in
the market at time t for an infinitesimally small period of time . Using the relationship
developed in the opening section we have:
r (t ) ln P (t , t )
The short rate is often the basis of some interest rate models; however, it will not generate,
on its own, discount bond prices.
1
P (0, t ) T t
F (0, t ,T ) 1
P (0,T )
At time t we can consider the market prices of two investments, one maturing at time t and one
at a later time T . These two prices will imply a certain rate of interest applicable between time t
and time T . This is F (0,t ,T ) . In an arbitrage-free market, it represents the rate of interest at
which we can agree at time 0 to borrow or lend over the period from t to T .
F(0,t,T)
0 t T
ng
si
om
o m
Question
as
.m
Suppose that the current time corresponds to t 5 and that the force of interest has been a
w
w
w
constant 4% pa over the last 5 years. Suppose also that the force of interest implied by current
market prices is a constant 4% pa for the next 2 years and a constant 6% pa thereafter.
If T 10 and S 15 , write down or calculate each of the four quantities P(t ,T ) , r (t) , f (t ,T , S)
and r (t ,T ) using the notation above.
Solution
4% 6%
r(5) 0.04
f (5,10,15) 0.06
1
r (5,10) (2 0.04 3 0.06) 0.052
5
r (0,T )T r (0, t )t
f (0, t ,T )
T t
(r (0,T ) r (0, t ))T
r (0, t )
T t
1 P(0 t)
f (0 tT ) log for t T
T t P(0T )
Question
ng
si
om
o m
Solution
as
.m
w
One way to see this is to say that the price for a bond purchased at time 0 and maturing at time T
w
should equal the price for a similar bond maturing at the earlier time t but discounted for the
w
extra period T t at the forward rate applicable over that period. In other words:
1 P(0 t)
f (0 tT ) log
T t P(0T )
r (0,T )T r (0,t)t
f (0,t ,T )
T t
log P(0,T ) T log P(0,t) t
T t
T t
log P(0,t) log P(0,T )
T t
1 P(0 t)
log
T t P(0T )
(r (0,T ) r (0,t))T
f (0,t ,T ) r (0,t)
T t
r (0,T )
f (0,T ) r (0,T ) T
T
ln(P (0,T ))
T
Question
1
Show that r (0T ) log P(0T ) r (0T ) .
T T T
ng
si
om
o m
Solution
as
.m
w
1
By applying the product rule to the equation r (0T ) log P(0T ) , we have:
w
w
T
1
r (0T ) log P(0T )
T T T
1
log P(0T ) log P(0T )
1
T T T T
1 1
log P(0T ) log P(0T ) 2
T T T
1
log P(0T ) r (0T )
T T
So f (0,T ) , the instantaneous forward rate, is the force of interest at future time T implied by the
current market prices at time 0. Then the short rate r(T ) is given by r (T ) f (T ,T ) .
We can generalise lim f (0,t ,T ) f (0,T ) ln(P(0,T )) to give:
t T T
f (t ,T ,T ) f (t ,T ) ln(P(t ,T ))
T
T
P (t ,T ) exp f (t , u, u )du
t
T
exp f (t ,u)du
t
We can also deduce the following relationship, which shows that the spot rate is an average of
the forward rates:
1 1 T
T t t
r (tT ) log P(tT ) f (t ,u u)du
T t
Question
ng
si
om
o m
Solution
as
.m
The factor e 0.1(T t ) equals 1 when T t , but then decreases exponentially to zero as T .
w
w
w
So f (t ,T ) is a weighted average of 0.03 and 0.06, and its graph will increase from 0.03 (ie a force
of interest of 3%) to 0.06.
7%
6%
5%
4%
3%
2%
1%
0%
We can find P(t ,T ) from the relationship P(tT ) exp f (t u)du :
T
t
t
T
exp 0.03e 0.1(ut ) 0.06(1 e 0.1(ut ) ) du
t
T
exp 0.06 0.03e 0.1(ut ) du
T
exp 0.06u 0.3e 0.1(ut)
t
exp 0.06(T t) 0.3e 0.1(T t ) 0.3
1
We can then find r(t ,T ) from the relationship r (tT ) log P(tT ) :
T t
1
r (tT ) log P(tT )
T t
1
T t
0.06(T t) 0.3e 0.1(T t) 0.3
1 e 0.1(T t )
0.06 0.3
T t
ng
si
om
o m
2 Desirable characteristics of a term structure model
as
.m
w
Question
w
w
What do you think term structure models are used for?
Solution
Equilibrium models start with a theory about the economy, such that interest rates revert to
some long-run average, are positive or their volatility is constant or geometric. Based on
the model for (typically) the short rate, the implications for the pricing of assets is worked
out. Examples of equilibrium models are Rendleman and Bartter, Vasicek and Cox-
Ingersoll-Ross.
No-arbitrage models use the term structure as an input and are formulated to adhere to the
no-arbitrage principle. An example of a no-arbitrage model is the Hull-White (one- and two-
factor).
The implication here is that the Vasicek and the Cox-Ingersoll-Ross models permit arbitrage
opportunities. This is not actually true; both of these models result in no-arbitrage bond price
formulae, even though their underlying construction principles are based in economic theory.
We will now discuss characteristics of a term structure model that are regarded as desirable
features.
ng
si
om
m
o
Interest rates should ideally be positive. Banks have to offer investors a positive return to
as
prevent them from withdrawing paper cash and putting it ‘under the bed’. This might be
.m
impractical for a large life office or pension fund but, nevertheless, it typically holds in
w
w
practice.
w
Some term structure models do allow interest rates to go negative.
One such example is the Vasicek model we will see later in this chapter.
Whether or not this is a problem depends on the probability of negative interest rates
within the timescale of the problem in hand and their likely magnitude if they can go
negative. It also depends on the economy being modelled, as negative interest rates have
been seen in some countries.
r (t) and other interest rates should exhibit some form of mean-reverting behaviour.
Again this is because the empirical evidence suggests that interest rates do tend to mean
revert in practice.
This might not be particularly strong mean reversion but it is essential for many actuarial
applications where the time horizon of a problem might be very long.
How easy is it to calculate the prices of bonds and certain derivative contracts?
This is a computational issue. It is no good in a modelling exercise to have a wonderful
model if it is impossible to perform pricing or hedging calculations within a reasonable
amount of time.
This is because we need to act quickly to identify any potential arbitrage opportunities or
to rebalance a hedged position.
Thus we aim for models that either give rise to simple formulae for bond and option
prices, or that make it straightforward to compute prices using numerical techniques.
Does the model, with appropriate parameter estimates, fit historical interest rate data
adequately?
Can the model be calibrated easily to current market data?
If so, is this calibration perfect or just a good approximation? This is an important point
when we are attempting to establish the fair value of liabilities. If the model cannot fit
observed yield curves accurately then it has no chance of providing us with a reliable fair
value for a set of liabilities.
Is the model flexible enough to cope properly with a range of derivative contracts?
ng
si
om
o m
3 Models for the term structure of interest rates
as
.m
w
3.1 The risk-neutral approach to pricing
w
w
We will assume that the short rate is driven by an Ito diffusion:
where:
In actuarial work we are used to assuming a fixed rate of interest in calculations. However, in this
chapter we are considering stochastic models where future interest rates behave randomly. This
means that we need to specify which probability measure we are using.
If we are to have a model that is arbitrage-free then we need to consider the prices of tradeable
assets, with the most natural of these being the zero-coupon bond prices P(t ,T ) .
Modelling the short rate r (t) does not tell us directly about the prices of the assets traded in the
market. To see whether arbitrage opportunities exist or not, we need to examine these prices.
We can use an argument similar to the derivation of the Black-Scholes model using the martingale
approach to demonstrate that:
P(tT ) EQ exp rudu rt
T
t
Since we are considering Markov models, any information about the value of the short rate
before time t is irrelevant, and so we can replace rt with its filtration Ft . We can then write this
equation in the equivalent form:
P(tT ) EQ exp rudu 1 Ft
T
t
Since the payoff from the bond at maturity is P(T ,T ) 1 , we can see that this equation is
analogous to the valuation formula Vt EQ e r (T t ) X Ft , which we met in an earlier chapter for
derivatives based on shares.
ng
si
om
m
If we had a derivative whose payoff X was dependent on the future value of the bond, then its
o
as
value at time t would be:
.m
w
w
Vt EQ exp rudu X Ft
T
w
t
This formula is interesting because it tells us that, if we assume that the market is arbitrage-free,
we can work out the value of the bond at the current time t based on the current short rate rt
and the possible future values of ru for t u T . We don’t need to know the entire term
structure, ie the values of f (t ,u) for t u T , at the current time.
dBt rt Bt dt , B0 1
and:
t
Bt exp ru du
0
According to the standard theory, all discounted assets must be martingales under the
martingale measure, or:
P (t ,T ) 1
EQ Ft
Bt B
T
B
P (t ,T ) EQ t Ft
B
T
The martingale measure (ie the risk-neutral probability measure) is chosen so that this
relationship holds by definition. It is the set of probabilities such that the expected future value of
the payout (which is guaranteed to be P(T ,T ) 1 ) discounted from time T to time t (ie Bt / BT ) is
equal to the value of the contract at time t, P(t ,T ) .
Then we have:
T
P (t ,T ) EQ exp ru du Ft
t
Compare with:
T
P (t ,T ) exp f (t , u )du
t
from earlier.
ng
si
om
m
The difference between P (t ,T ) and Bt is captured by noting:
o
as
.m
T t
w
P (t ,T ) exp f (t , u )du while Bt exp ru du
w
w
t 0
So the value of Bt , the cash account at time t, can be calculated using the known past short rates.
In contrast, the price of the bond is the expectation of unknown future values of the short rate.
However, given the known bond prices at time t, we can derive a set of consistent forward rates.
t t
0
P (0, t ) exp f (0, u )du
while
Bt exp ru du and f (u , u ) ru
0
We now assume that the discount bond price, P (t ,T ) , is some deterministic function of the
short-rate process, rt :
P (t ,T ) g (t , rt )
By using the stochastic product rule from the stochastic calculus chapter we have:
P (t ,T )
d
B t
1
d g (t , rt )Bt
d g(t ,rt ) Bt1 g(t ,rt )d Bt1
t
d g(t ,rt ) Bt1 g(t ,rt )d exp rudu
0
t
d g(t ,rt ) Bt1 g(t ,rt )rt exp rudu dt
0
P (t ,T )
1 g (t , rt ) g (t , rt ) 1 2 g (t , rt ) 2
d Bt (t , rt ) (t , rt ) rt g (t , rt ) dt
Bt t r 2 r 2
t t
g (t , rt )
Bt1 (t , rt )dW t
rt
Question
ng
si
om
o m
Solution
as
.m
w
By Taylor’s theorem in two variables we have:
w
w
P(t ,T ) 1 g(t , rt ) g(t ,rt ) 1 2g(t ,rt ) 2
d Bt dt drt dr 1
t Bt rt g(t , rt )dt
B
t t rt 2 rt
2
g (t , rt ) g (t , rt ) 1 2 g (t , rt ) 2
(t , rt ) (t , rt ) rt g (t , rt ) 0
t rt 2 rt2
Question
P(t ,T )
Why does the partial differential equation above need to equal zero in order for to be a
Bt
martingale?
Solution
P(t ,T )
Martingale processes have zero drift. In order for to have zero drift the coefficient of the
Bt
dt term in the stochastic differential equation must be zero.
ng
si
om
o m
3.2 The Vasicek model (1977)
as
.m
Vasicek assumes that:
w
w
w
drt ( rt )dt dW t
Here represents the ‘mean’ level of the short rate. If the short rate grows (driven by the
stochastic term) the drift becomes negative, pulling the rate back to . The speed of the
‘reversion’ is determined by . If is high, the reversion will be very quick.
The graph below show a simulation of this process based on the parameter values 0.1 ,
0.06 and 0.02 .
12%
10%
8%
6%
4%
2%
0%
-2%
g (t , rt ) g (t , rt ) 1 2 g (t , rt ) 2
( rt ) rt g (t , rt ) 0
t rt 2 rt2
t
rt (r0 )e t e (t u )dW u
0
T T
rsds T (r0 ) 1 e
1 T
1 e
(T u )
dW
u
0 0
ng
si
om
m
T
o
as
This implies that rsds is normally distributed with conditional mean:
.m
0
w
w
w
T
0
1
E rs ds T (r0 ) 1 e T
T T
VarQ ( X t , t )dWt E 2 ( X t , t ) dt
0 0
T 2 2 1
VarQ rs ds 2 T (1 e T )
(1 e 2T )
0 2
T
Since rsds is normally distributed, using the moment generating function of a normal,
0
N , 2
, random variable:
t 21 2t 2
E e tX e
T
P (0,T ) EQ exp rs ds
0
1
2
1
exp T (r0 ) 1 e T 2
2
T
2
(1 e T
)
1
2
(1 e 2T
)
In general, by setting:
b(t ,T )
1
1 e (T t )
and:
2 2 2
a(t ,T ) b(t ,T ) T t b (t ,T )
2 2 4
we have:
P (t ,T ) e a (t ,T ) b (t ,T )rt
ng
si
om
m
By defining T t , then equivalently we have:
o
as
.m
P(t ,T ) ea( )b( )rt
w
w
w
where:
b( )
1
1 e
and:
2 2 2
a( ) 2 b( ) b ( )
2 4
Question
Show that the instantaneous forward rate for the Vasicek model can be expressed as:
2 2
f (t ,T ) r(t)e 2 (1 e ) 2 (1 e )e
2 2
where T t .
Solution
We have a formula for P(t ,T ) under this model. So the instantaneous forward rate can be
derived from this using the relationship:
f (tT ) log P(tT )
T
By use of the chain rule, and noting that 1 , this gives:
T
f (tT ) [a( ) b( )r (t)]
T
[a( ) b( )r (t)]
T
[a( ) b( )r (t)] a( ) b( )r (t)
T
ng
si
om
m
From the definitions of b( ) and a( ) , we find that:
o
as
.m
d 1 e
w
b( ) e
w
d
w
and:
d 2 2
a( ) (b( ) ) 2 b( )2
d 2 4
2 2
(b( ) 1) 2 2b( )b( )
2 4
2 2 1 e
(e 1) 2 e
2 2
2 2
(1 e ) 2 2 (1 e )e
2 2
Substituting these expressions into the general formula for f (t ,T ) gives the required answer.
Question
Write down an expression in terms of the model parameters for the long rate, ie the
instantaneous forward rate corresponding to T t , according to the Vasicek model.
Solution
2 2 2
f (t , ) r(t) 0 2 (1 0) 2 (1 0) 0 2
2 2 2
The curves shown on the graph of gilt yields in Section 1 were fitted using a Vasicek model with
parameter values 0.131 , 0.083 and 0.037 .
Question
‘The particular model used for the graph implies that interest rates are mean-reverting to the
value 0.083 .’
True or false?
ng
si
om
o m
Solution
as
.m
w
The dynamics of r (t) for this particular Vasicek model are:
w
w
dr (t) 0.131 [r (t) 0.083]dt 0.037dW (t)
under the risk-neutral probability measure Q . Under this measure W (t) is standard Brownian
motion and therefore has zero drift and the process mean-reverts to the value 0.083.
However, under the real-world probability measure P , W (t) would have non-zero drift and the
process will mean-revert to a different value. In fact, although we will not prove it here, the long-
term rate in the real world can be found from the formula derived in the previous question,
namely:
2
0.0431 ie 4.31%
2 2
We can see that the form of the drift of rt is the same as for the Vasicek model. The critical
difference between the two models occurs in the volatility, which is increasing in line with the
square root of rt . Since this diminishes to zero as rt approaches zero, and provided 2 is not too
large ( rt will never hit zero provided 2 2 ), we can guarantee that rt will not hit zero.
Consequently all other interest rates will also remain strictly positive.
The graph below shows a simulation of this process based on the parameter values 0.1 ,
0.06 and 0.1 .
ng
si
om
o m
as
.m
20%
w
18%
w
w
16%
14%
12%
10%
8%
6%
4%
2%
0%
g (t , rt ) g (t , rt ) 1 2 g (t , rt ) 2
( rt ) rt rt g (t , rt ) 0
t rt 2 rt2
The only difference between this and the Vasicek PDE is the inclusion of an rt in the second
derivative term.
2(e 1)
b( )
( )(e 1) 2
and:
2 1 ( )
2 e 2
a( ) ln
2 ( )(e 1) 2
1 ( )
2 e2
2 ln b( )
(e 1)
where:
2 2 2
It turns out that these values for a and b are not that different from those in Vasicek’s
model.
ng
si
om
m
If X1 , X2 ,, Xn are independent random variables, each with a N(0,1) distribution, then
o
as
Y X12 X22 Xn2 has a chi-square distribution with n degrees of freedom.
.m
w
w
w
If X1 , X2 ,, Xn are independent random variables and Xi N(di ,1) , then Yd X12 X22 Xn2 is
said to have a non-central chi-squared distribution with n degrees of freedom and non-centrality
n
parameter d di2 .
i 1
So the non-central chi-squared distribution can be thought of as a lopsided version of the ordinary
chi-square distribution.
Question
What is the mean of the non-central chi-squared distribution with n degrees of freedom and
non-centrality parameter d ?
Solution
It follows that:
n
E[Yd ] E[ X12 X22 Xn2 ] (1 di2 ) n d
i 1
ln P (t ,T )
r (t ,T )
T t
and:
so we have:
Alternatively:
ng
si
om
m
Then the yield curves coming out of the Vasicek model are of three (related) types:
o
as
.m
w
w
w
These yield curves are generally described as being:
normal, with short-term yields being lower than long-term yields
inverted, with short-term yields being higher than long-term yields
humped, ie a yield curve with a turning point.
Recall that the duration, D, of an asset, whose interest rate dependent price is given by B, is
defined by:
B
D y
B
where y is the instrument’s yield. In the context of interest rate models, this is equivalent
to:
P (t ,T )
DP (t ,T ) b(t ,T )P (t ,T )
rt
B B
D y BD
B y
B
and is the rate at which the bond price changes with respect to a change in its yield, ie
y
P(t ,T )
.
rt
rt
rt
e
P(t ,T ) a(t ,T )b(t ,T )rt
b(t ,T )ea(t ,T )b(t ,T )rt b(t ,T )P(t ,T )
ng
si
om
o m
3.5 The Hull-White model (1990)
as
.m
The Hull-White model is an extension of Vasicek where the mean-reversion level, , is a
w
w
deterministic function of time:
w
drt ( (t ) rt )dt dW t
In some representations, the parameter is also allowed to be a function of time. This is known
as the extended Vasicek model.
This yields a PDE similar to Vasicek, and so we can start by ‘guessing’ that
P (t ,T ) e a(t ,T ) b(t ,T )rt g (t , rt ) and so:
g (t , rt ) g (t , rt ) 1 2 g (t , rt ) 2
( (t ) rt ) rt g (t , rt ) 0
t rt 2 rt2
By noting that:
g(t ,rt )
g(t ,rt )b(t ,T )
rt
2g(t ,rt )
g(t ,rt )b2 (t ,T )
rt2
a(t ,T ) b(t ,T ) 1
g(t ,rt ) rt g(t ,rt )b(t ,T )( (t) rt ) g(t , rt )b2 (t ,T ) 2 rt g(t , rt ) 0
t t 2
a(t ,T ) b(t ,T ) 1
g(t ,rt ) rt b(t ,T )( (t) rt ) b2 (t ,T ) 2 rt 0
t t 2
b(t ,T ) a(t ,T ) 1
g(t ,rt ) rt b(t ,T ) 1 b(t ,T ) (t) b2 (t ,T ) 2 0
t t 2
a(t ,T ) b(t ,T )
By letting a(t ,T ) and b(t ,T ) then we have:
t t
1
g (t , rt ) rt b(t ,T ) b(t ,T ) 1 a (t ,T ) b(t ,T ) (t ) b2 (t ,T ) 2 0
2
ng
si
om
m
This is essentially Vasicek but we have:
o
as
.m
T s
w
b(t ,T ) exp (u )du ds
w
t
w
t
T s
b(t ,T ) exp du ds
t t
T
exp s t ds
t
1
1 e T t
So b(t ,T ) is the same as for the Vasicek model.
T
a(t ,T ) (s)b(s,T ) 21
2 2
b (s,T ) ds
t
Question
1
g(t ,rt ) rt b(t ,T ) b(t ,T ) 1 a(t ,T ) b(t ,T ) (t) b2 (t ,T ) 2 0
2
Solution
a(t ,T )
a(t ,T ) (t)b(t ,T ) 12 2b2 (t ,T )
t
and
b(t ,T )
e
T t
b(t ,T )
t
ng
si
om
m
Substituting these into the given equation gives:
o
as
.m
g(t ,rt ) rt b(t ,T ) b(t ,T ) 1 a(t ,T ) b(t ,T ) (t) 12 b2 (t ,T ) 2
w
w
w
t
g(t ,rt )
r b(t ,T ) e T t 1 (t)b(t ,T )
1 2b2 (t ,T ) b(t ,T ) (t) 1 b2 (t ,T ) 2
2 2
g(t ,rt ) rt 0 0
0
as required.
The advantage of this model over Vasicek is that (t ) can be chosen to reproduce (as
closely as possible) the exact yield curve, rather than the restricted forms of the Vasicek
model.
ng
si
om
o m
4 Summary of short-rate modelling
as
.m
w
4.1 Summary of models
w
w
The properties of these models are summarised below:
Non-central chi-
CIR drt ( rt )dt rt dW t Yes
squared
Hull-White –
drt ( (t ) rt )dt dW t No Normal
Vasicek
Non-central chi-
Hull-White – CIR drt ( (t ) rt )dt rt dW t Yes
squared
The last row of the table introduces an alternative form of the Hull-White model that extends the
CIR model.
There are analytic solutions for P (t ,T ) and option prices for each of these four models.
4.2 Limitations
Question
Solution
A one-factor model is a model in which interest rates are assumed to be influenced by a single
source of randomness.
Bearing in mind that the purpose of interest rate models is to price interest rate derivatives,
there are some short-comings of short-rate models:
Single factor short-rate models mean that all maturities behave in the same way -
there is no independence. This essentially means they are useless for pricing
swaptions (but OK for caps/floors).
One-factor models, such as Vasicek and CIR, have certain limitations with which it is
important to be familiar.
ng
si
om
m
First, if we look at historical interest rate data we can see that changes in the prices of
o
as
bonds with different terms to maturity are not perfectly correlated as one would expect to
.m
see if a one-factor model was correct. Sometimes we even see, for example, that
w
short-dated bonds fall in price while long-dated bonds go up. Recent research has
w
w
suggested that around three factors, rather than one, are required to capture most of the
randomness in bonds of different durations.
Second, if we look at the long run of historical data we find that there have been sustained
periods of both high and low interest rates with periods of both high and low volatility.
Again, these are features which are difficult to capture without introducing more random
factors into a model.
This issue is especially important for two types of problem in insurance: the pricing and
hedging of long-dated insurance contracts with interest rate guarantees; and asset-liability
modelling and long-term risk management.
One-factor models do, nevertheless have their place as tools for the valuation of simple
liabilities with no option characteristics; or short-term, straightforward derivatives
contracts.
For other problems it is appropriate to make use of models which have more than one
source of randomness: so-called multi-factor models. Hull-White is not really a multi-factor
model, the and parameters are deterministic and aid fitting. A multi-factor version of
Vasicek would involve a multidimensional driving Wiener process and possibly
stochastic .
Question
ng
si
om
o m
Solution
as
.m
w
The following table summarises the characteristics of the Vasicek, Cox-Ingersoll-Ross (CIR) and
w
Hull-White models.
w
Vasicek CIR Hull-White
Notes:
(1) Although the CIR model is harder to use than the other two models, it is more tractable
than models with two or more factors.
(2) All three models produce perfectly correlated changes in bond prices, which is inconsistent
with the empirical evidence, and fail to model periods of high and low interest rates and
high and low volatility.
(3) Whilst one-factor models are generally difficult to calibrate, the Hull-White model is easier
than the other two because its time-varying mean-reversion function aids fitting.
(4) All three models can be used to price short-term, straightforward derivatives, but not
complex derivatives.
ng
si
om
o m
5 State-price deflator approach to pricing
as
.m
w
5.1 Short-rate models
w
w
Other modellers prefer to take a different approach to model-building using state price deflators.
With this approach we first specify a strictly positive diffusion process A(t) with SDE under P :
Note that, with this approach, the dynamics of the state price deflator A(t) are defined in terms
of the real-world probability measure.
E A(T ) Ft
P(tT ) P
A(t)
It can be shown, although it isn’t done here, that this method gives rise to an arbitrage-free model
for bond prices.
Question
Solution
When t T we have:
E A(T ) FT
P(T T ) P
A(T )
The expectation here is conditional on the history of the process A(t) up to time T .
Consequently, there is no additional randomness in A(T ) , which can be considered to be a known
value. So we have:
A(T )
P(T T ) 1
A(T )
This is exactly what we would expect, since the bond matures at time T and has a value of 1 at
that time.
ng
si
om
o m
Question
as
.m
Suppose that A(t) is a deterministic process with A (t) and A (t) 0 , where is a
w
w
w
constant. Show that A(t) ke t and evaluate P(t ,T ) .
Solution
Here the SDE for A(t) becomes dA(t) A(t) dt , which is an ordinary differential equation. This
can be written in the form:
1 dA(t) d
or log A(t)
A(t) dt dt
Integrating gives:
EP A(T ) Ft
P(t ,T ) can now be calculated from the formula P(tT ) , ie:
A(t)
EP ke T Ft
P(tT ) t
ke
ke T
P(tT ) t
e (T t )
ke
This question illustrates that the process A(t) acts like a discount factor that applies to payments
at time t . It is a generalisation of the familiar vt factor.
The formula for P(t ,T ) is a very simple looking formula, but any potential difficulty comes in
working out EP A(T ) Ft as we need ‘interesting’ models for A(t) in order to get interesting and
useful models for r (t) and P(t ,T ) .
Question
ng
si
om
o m
Solution
as
.m
w
The SDE for A(t) now becomes:
w
w
dA(t) A(t) dt dWt
This has the same form as the SDE for the lognormal model for share prices. So its solution is:
EP A(T ) Ft
P(t ,T ) can now be calculated from the formula P(tT ) :
A(t)
P(tT )
A(0)e( ½ )t Wt
2
EP e WT Ft
eWt
This expectation is conditional on the history up to time t . But WT Wt relates to the ‘future’
time interval (t ,T ) , and so is statistically independent. So we can write:
Since WT Wt N(0,T t) , the expectation is now just the MGF of a normal distribution, so that:
e (T t )
Using the state price deflator approach the risk-free rate of interest can be shown to be simply:
r(t) A (t)
It follows that our model will have positive interest rates if and only if A (t) is negative for all t
with probability 1. This means that A(t) must be a strictly positive supermartingale (that is,
EP A(T ) Ft A(t) for all t , T ).
Recall that martingales have the property that E Xt |Fs X s whenever t s , ie the process has
zero drift. A supermartingale has the property that E Xt |Fs X s , ie the drift is negative or zero.
ng
si
om
o m
Question
as
.m
w
How do we know that A(t) is a supermartingale?
w
w
Solution
A(t) takes positive values and A (t) is negative. Therefore the drift coefficient A(t) A (t) in the
SDE for A(t) is negative. So the expected value of dA(t) is negative, ie the process has negative
drift, which means that it is a supermartingale.
The state price deflator approach looks like it is quite different from the risk-neutral approach to
pricing. However, the two approaches are, in fact, exactly equivalent: any model developed in
one framework has an equivalent form under the other framework.
Define a traded asset, based on the (traded) discount bonds P (s, t ) and P (s,T ) :
1
X (s ) P (s, t ) P (s,T )
The price of a traded asset divided by another traded asset must be a martingale under the
measure associated with the numeraire.
A numeraire is a quantity whose values are used as the units for expressing the price of a security.
Commonly used numeraires are:
the accumulated value of a risk-free cash account
the price of a zero-coupon bond
the price of a foreign currency.
So, labelling QT as the measure associated with using P (s,T ) as the numeraire, we have:
X (s ) X (t )
F (s, t ,T )
P (s,T )
EQT P (t ,T ) EQT F (t , t ,T )
T
dF (s, t ,T ) (s, t )F (s , t ,T )dWsQ
Note that we use t as the parameter for the volatility, rather than T, because the forward
matures at t and does not exist in (t ,T ] .
ng
si
om
o m
as
.m
w
w
w
This page has been left blank so that you can keep the chapter
summaries together for revision purposes.
ng
si
om
o m
as
Chapter 18 Summary
.m
w
w
Notation
w
t current time
P(t ,T ) zero-coupon bond price
Relationships
1 1 T
T t t
r (tT ) log P(tT ) f (t u)du
T t
1 P(tT )
f (tT S) log
S T P(t S)
f (t ,T ) lim F (t ,T , S) log P(t ,T )
T S T
ng
si
om
o m
as
Examples of one-factor models
.m
w
Vasicek model
w
w
dr(t) r(t)dt dW (t) under Q
Cox-Ingersoll-Ross model
Hull-White model