Professional Documents
Culture Documents
Q
P )
p
r
(
E
)
r
(
E
Indifference curve
(preferred direction)
Northwest
II
IV
I
III
Figure 1: The trade-o between risk and return of a potential investment portfolio P
A risk lover (A < 0) is willing to engage in fair games and gambles: This investor adjusts the
expected return upward to take into account the fun of confronting the prospects risk
Portfolio P (expected return E(r
p
), standard deviation
p
) is preferred by risk-averse investors to
any portfolio in quadrant IV because it has an expected return any portfolio in that quadrant
and a standard deviation any portfolio in that quadrant
Conversely, any portfolio in quadrant I is preferable to portfolio P
Mean-variance (M-V) criterion: Portfolio A dominates B if:
E(r
A
) E(r
B
) and
A
B
In the E- plane in Fig. 1, the preferred direction is northwest, because we simultaneously increase
the expected return/decrease the variance of the rate of return
Indierence curve: Equally preferred portfolios will lie in the mean-standard deviation plane on
a curve called the indierence curve that connects all portfolio points with the same utility value
Estimating risk aversion
One way is to observe individuals decisions when confronted with risk
Consider an investor with risk aversion A whose entire wealth is in a piece of real estate
Suppose that in any given year there is a probability p of a disaster that will wipe out the
investors entire wealth. Such an event would amount to a rate of return of 100%
With probability 1 p, real estate remains intact, and rate of return is zero
The expected rate of return of this prospect is:
E(r) = p (1) + (1 p) 0 = p
The variance of the rate of return equals the expectation of the squared deviation:
2
(r) = p (p 1)
2
+ (1 p) p
2
= p(1 p)
Utility score:
U = E(r)
1
2
A
2
(r) = p
1
2
Ap(1 p) (2)
We can relate the risk-aversion parameter to the amount that an individual would be willing
to pay for insurance against the potential loss. Suppose an insurance company oers to cover
any loss over the year for a fee of dollars per dollar of insured property
Such a policy amounts to a sure negative rate of return of , with a utility score: U =
Maximum value of the investor is willing to pay? Equate the utility score of the uninsured
property to that of the insured property, and solve for
= p[1 +
1
2
A(1 p)] (3)
Square brackets in Eq. 3 = multiple of expected loss p the investor is willing to pay
Economists estimate that investors exhibit degrees of risk aversion in the range of 2 to 4
More support for the hypothesis that A is somewhere in the range of 2 to 4 can be obtained from
estimates of the expected rate of return and risk on a broad stock-index portfolio
4
Capital allocation across risky and risk-free portfolios
The most straightforward way to control the risk of the portfolio is through the fraction of the portfolio
invested in Treasury bills and other safe money market securities versus risky assets
This capital allocation decision is an example of an asset allocation choice - a choice among broad
investment classes, rather than among the specic securities within each asset class
Most investment professionals: Asset allocation = most important part of portfolio construction
Take composition of risky portfolio as given and focus on allocation between it/risk-free securities
When we shift wealth from the risky portfolio to the risk-free asset, we do not change the relative
proportions of the various risky assets within the risky portfolio
Rather, we reduce the relative weight of the risky portfolio as a whole in favor of risk-free assets
As long as we do not alter the weights of each security within the risky portfolio, the probability
distribution of the rate of return on the risky portfolio remains unchanged by the asset reallocation
What will change is the probability distribution of the rate of return on the complete portfolio that
consists of the risky asset and the risk-free asset
The risk-free asset
There are no true risk-free assets
Only the government can issue default-free bonds
Even the default-free guarantee by itself is not sucient to make the bonds risk-free in real terms
The only risk-free asset in real terms would be a perfectly price-indexed bond
Moreover, a default-free perfectly indexed bond oers a guaranteed real rate to an investor only if
the maturity of the bond is identical to the investors desired holding period
Even indexed bonds are subject to interest rate risk: Real interest rates change unpredictably
Nevertheless, it is common practice to view Treasury bills as the risk-free asset
Their short term nature makes their values insensitive to interest rate uctuations
An investor can lock in a short-term nominal return by buying a bill and holding it to maturity
Ination uncertainty over a few weeks/months uncertainty of stock market returns
In practice, most investors use a broader range of money market instruments as a risk-free asset
All the money market instruments are virtually free of interest rate risk because of their short
maturities and are fairly safe in terms of default or credit risk
Most money market funds hold three types of securities: (i) T-bills, (ii) Bank certicates of deposit
(CDs), and (iii) Commercial paper (CP), diering slightly in their default risk
Portfolios of one risky asset and a risk-free asset
The concern is with the proportion of the investment budget y to be allocated to the risky portfolio P.
The remaining proportion 1 y is to be invested in the risk-free asset F
Denote the risky rate of return of P by r
p
, its expected rate of return by E(r
p
) and its standard
deviation by
p
. The rate of return on the risk-free asset is denoted as r
f
The risk premium on the risky asset is: E(r
p
) r
f
The rate of return on the complete portfolio C is r
c
= yr
p
+ (1 y)r
f
E(r
c
) = r
f
+y[E(r
p
) r
f
]
Base rate of return = risk-free rate. The portfolio is also expected to earn a risk premium that
depends on risk premium of risky portfolio E(r
p
) r
f
and investors position y in P
When combining risky and risk-free assets, standard deviation
c
of complete portfolio = standard
deviation
p
of risky asset multiplied by weight y of risky asset:
c
= y
p
(4)
Investment opportunity set with risky/risk-free asset in the E- plane
Equation for the straight line between F and P:
E(r
c
) = r
f
+y[E(r
p
) r
f
] = r
f
+
c
p
[E(r
p
) r
f
] (5)
5
)
r
(
E
)
p
r
(
E
f
B
r
f
r
F
Capital Allocation Line (CAL)
P
f
r ){
p
r ( E 1) < y ( S
1) > y ( S
Capital Allocation Line (CAL)
f
B
r with borrowing rate
= S Slope
f
r ) {
p
r ( E
p
Figure 2: The investment opportunity set in the expected return-standard deviation plane
Investment opportunity set
The set of feasible expected return and standard deviation pairs of portfolios resulting from
dierent values of y
The Capital Allocation Line (CAL) and the Sharpe ratio
The CAL depicts all the risk-return combinations available to investors
The slope S of the CAL equals the increase in the expected return of the complete portfolio
per unit of additional standard deviation, i.e. incremental return per incremental risk
The slope is called the reward-to-volatility ratio or the Sharpe ratio
S =
E(r
p
) r
f
p
(6)
If investors can borrow at r
f
, they can construct portfolios to the right of P on the CAL
However, non-government investors cannot borrow at the risk-free rate
Then in the borrowing range, the reward-to-volatility ratio (i.e. the slope of the CAL) will be lower
The CAL will therefore be kinked at point P
Risk tolerance and asset allocation
Investor confronting the CAL must choose one optimal portfolio C from set of feasible choices
This choice entails a trade-o between risk and return
Dierences in risk aversion Dierent investors choose dierent positions in risky asset
Investors attempt to maximize utility by choosing the best allocation to the risky asset y
As allocation to risky asset increases (y ), expected return increases, but so does volatility
Solving the utility maximization problem:
max
y
U = E(r
c
)
1
2
A
2
c
= r
f
+y[E(r
p
) r
f
]
1
2
Ay
2
2
p
Setting the derivative of this expression to zero and solving for y yields the optimal position:
y
=
E(r
p
) r
f
A
2
p
(7)
The optimal position in the risky asset is inversely proportional to the level of risk aversion and
the level of risk (variance) and directly proportional to the risk premium oered by the risky asset
Indierence curve analysis
First calculate the utility value of a risk-free portfolio yielding r
f
Then, nd the expected return the investor would require to maintain the same level of utility
when holding a risky portfolio for a given
This yields all combinations of expected return/volatility with a given constant utility level
Any investor prefers a portfolio on higher indierence curve (higher certainty equivalent)
Portfolios on higher indierence curves oer a higher return for any given level of risk
Higher indierence curves correspond to higher levels of utility
6
CAL
P
C
f
r = U
U Complete portfolio maximizing
U Higher
)
r
(
E
)
p
r
(
E
)
c
r
(
E
f
r
the fraction of overall investment funds to place in the risky portfolio versus the safer
but lower expected-return risk-free asset, is in large part a matter of risk aversion
Passive strategies: The capital market line
Passive strategy
Describes a portfolio decision that avoids any direct or indirect security analysis
Natural candidate for passively held risky asset is a well-diversied portfolio of common stocks
Because a passive strategy requires that we devote no resources to acquiring information on any
individual stock or group of stocks, we must follow a neutral diversication strategy
Select diversied stock portfolio that mirrors the value of the US corporate sector
This results in a portfolio in which, e.g., the proportion invested in Microsoft stock will be the ratio
of Microsofts total market value to the market value of all listed stocks
The Capital Market Line (CML)
Dened as the CAL provided by l-month T-bills and a broad index of common stocks
A passive strategy generates an investment opportunity set that is represented by the CML
How reasonable is it for an investor to pursue a passive strategy?
1. The alternative active strategy is not free, Whether you choose to invest the time and cost to
acquire the information needed to generate an optimal active portfolio of risky assets, or whether
you delegate the task to a professional who will charge a fee
2. Free-rider benet: Another reason to pursue a passive strategy
If there are many active investors who quickly bid up prices of undervalued assets and force
down prices of overvalued assets, then at any time most assets will be fairly priced
Therefore, a well-diversied portfolio of common stock will be a reasonably fair buy, and the
passive strategy may not be inferior to that of the average active investor
To summarize, a passive strategy involves investment in two passive portfolios: (i) Virtually risk-free
short-term T-bills (or a money market fund) and (ii) A fund of common stocks that mimics a broad
market index. The capital allocation line representing such a strategy is called the capital market line
Criticisms of index funds dont hold up
They are undiversied: The same complaint could be leveled at actively managed funds
They are top-heavy: True, but S&P 500 not so narrow focused (77.2% of US stock-market value)
They are chasing performance: This is what all investors do
You can do better: As a group, investors in cant outperform market (collectively, they = market)
7
8
BKM - Ch. 7: Optimal risky portfolios
Introduction
The investment decision can be viewed as a top-down process:
1. Capital allocation between the risky portfolio and risk-free assets
2. Asset allocation across broad classes (US stocks, international stocks, long-term bonds)
3. Security selection of individual assets within each asset class
The optimal capital allocation is determined by risk aversion as well as expectations for the risk-return
trade-o of the optimal risky portfolio
In principle, asset allocation and security selection are technically identical:
Both aim at identifying that optimal risky portfolio, namely, the combination of risky assets that
provides the best risk-return trade-o
In practice, however, asset allocation and security selection are typically separated into two steps:
1. The broad outlines of the portfolio are established rst (asset allocation)
2. Details concerning specic securities are lled in later (security selection)
Diversication and portfolio risk
When all risk is rm-specic, diversication can reduce risk to arbitrarily low levels
With all risk sources independent, exposure to any specic source of risk reduced to negligible level
The insurance principle: Reduction of risk to very low levels for independent risk sources
Risk eliminated by diversication: Unique/rm-specic/nonsystematic/diversiable risk
When common sources of risk aect all rms, even extensive diversication cannot eliminate risk
The risk that remains even after extensive diversication is called market risk, risk that is at-
tributable to marketwide risk sources (aka systematic risk, or nondiversiable risk)
Portfolio of two risky assets
Ecient diversication: Constructing risky portfolios to provide the lowest possible risk for any given
level of expected return
Consider two mutual funds: A bond portfolio D, and a stock fund E
w
D
is invested in the bond fund, and the remainder 1 w
D
= w
E
is invested in the stock fund
Denoting r
D
, r
E
the rate of return on debt/equity funds, the rate of return r
p
on portfolio is:
r
p
= w
D
r
D
+w
E
r
E
(1)
The expected return on the portfolio is a weighted average of expected returns on the component
securities with portfolio proportions as weights:
E(r
p
) = w
D
E(r
D
) +w
E
E(r
E
) (2)
The variance of the two-asset portfolio is:
2
p
= w
2
D
2
D
+w
2
E
2
E
+ 2w
D
w
E
Cov(r
D
, r
E
) (3)
Variance is reduced if the covariance term is negative
Even if covariance term 0, the portfolio standard deviation is less than weighted average of
individual security s, unless the two securities are perfectly positively correlated
The covariance can be computed from the correlation coecient
DE
:
Cov(r
D
, r
E
) =
DE
E
2
p
= w
2
D
2
D
+w
2
E
2
E
+ 2w
D
w
E
DE
(4)
A hedge asset has negative correlation with the other assets in the portfolio
Such assets will be particularly eective in reducing total risk
Expected return is unaected by correlation between returns
Always prefer to add to portfolio assets with low or negative correlation with existing position
9
How low can portfolio standard deviation be?
With perfect negative correlation, = 1:
2
p
= (w
D
D
w
E
E
)
2
(5)
When = 1, a perfectly hedged position can be obtained by choosing the portfolio proportions
to solve w
D
D
w
E
E
= 0. Then:
w
D
=
E
D
+
E
and w
E
=
D
D
+
E
= 1 w
D
(6)
What happens when w
D
> 1 and w
E
< 0?
Strategy: Sell the equity fund short and invest proceeds of short sale in debt fund
This will decrease the expected return of the portfolio
The reverse happens when w
D
< 0 and w
E
> 1
Strategy: Sell bond fund short and use proceeds to buy more of equity fund
E
D
3 : =
= 0
= 1
= {1
10
5
1 .5 0 -.5
Weight in Stock Fund
P
o
r
t
f
o
l
i
o
S
t
a
n
d
a
r
d
D
e
v
i
a
t
i
o
n
Figure 1: Portfolio standard deviation as a function of investment proportions
As the portfolio weight in the equity fund increases from zero to 1, portfolio standard deviation
rst falls with the initial diversication from bonds into stocks, but then rises again as the portfolio
becomes heavily concentrated in stocks, and again is undiversied
Pattern generally holds as long as the correlation coecient between funds is not too high
If <
D
/
E
, volatility initially falls when we start with all bonds and move into stocks
For a pair of assets with a large positive correlation of returns, the portfolio standard deviation
will increase monotonically from the low-risk asset to the high-risk asset. Even in this case,
however, there is a positive (if small) value from diversication
What is the minimum level to which portfolio standard deviation can be held?
The portfolio weights that solve this minimization problem turn out to be:
w
min
(D) =
2
E
Cov(r
D
, r
E
)
2
D
+
2
E
2Cov(r
D
, r
E
)
The minimum-variance portfolio has a standard deviation smaller than that of either of the indi-
vidual component assets Eect of diversication
Portfolio opportunity set
Pair of investment weights (w
D
, w
E
) Resulting pair of expected return/standard deviation
These constitute the portfolio opportunity set that can be constructed from the two available assets
Fig. 2 shows the portfolio opportunity set for other values of the correlation coecient
The solid black line connecting the two funds shows that there is no benet from diversication
when the correlation between the two is perfectly positive ( = 1)
The dashed colored line demonstrates the greater benet from diversication when the corre-
lation coecient is lower than .30
10
16 14 12 10
8
9
Standard Deviation (%)
E
x
p
e
c
t
e
d
R
e
t
u
r
n
(
%
)
E
D
3 : =
= 0
= 1
= {1
10
Figure 2: Portfolio expected return as a function of standard deviation
For = 1, the portfolio opportunity set is linear, but now it oers a perfect hedging oppor-
tunity and the maximum advantage from diversication
The lower the correlation, the greater the potential benet from diversication
Suppose now an investor wishes to select the optimal portfolio from the opportunity set
The best portfolio will depend on risk aversion
Portfolios to the northeast in Fig. 2 provide higher rates of return but impose greater risk
The best trade-o among these choices is a matter or personal preference
Investors with greater risk aversion prefer southwest portfolios (lower expected return, lower risk)
Given a level of risk aversion, determine the portfolio that provides highest level of utility
Using U = E(r
p
)
1
2
A
2
p
and the portfolio mean/variance determined by the portfolio weights
in the two funds w
E
and w
D
, the optimal investment proportions in the two funds are:
w
D
=
E(r
D
) E(r
E
) +A(
2
E
DE
)
A(
2
D
+
2
E
2
D
DE
)
and w
E
= 1 w
D
Asset allocation with stocks, bonds, and bills
The optimal risky portfolio with two risky assets and a risk-free asset
Graphical solution
Ratchet the CAL upward until it reaches point of tangency with investment opportunity set
This must yield the CAL with the highest feasible reward-to-volatility ratio
Thus, the tangency portfolio (P in Fig. 3) is the optimal risky portfolio to mix with T-bills
We can read the expected return and standard deviation of portfolio P from the graph
Portfolio
Complete
Optimal
Portfolio
Global M-V
% = 5
f
r
CAL(E)
Minimum-Variance Frontier
Opportunity Set of Risky Assets
Efficient Frontier
Indifference Curve
C
E
G
P
CAL(P)
20 15 10 5 0
14
12
10
8
6
4
2
Standard Deviation (%)
E
x
p
e
c
t
e
d
R
e
t
u
r
n
(
%
)
Figure 3: The opportunity set of the debt and equity funds with the CAL
Portfolio construction with only two risky assets and a risk-free asset
The objective is to nd the weights w
D
and w
E
that result in the highest slope of the CAL
(i.e., the weights that result in the risky portfolio with the highest reward-to-volatility ratio)
11
Thus our objective function is the slope (equivalently, the Sharpe ratio) S
p
:
S
p
=
E(r
p
) r
f
p
For portfolio with two risky assets, expected return and standard deviation of portfolio P are:
E(r
p
) = w
D
E(r
D
) +w
E
E(r
E
)
p
= [w
2
D
2
D
+w
2
E
2
E
+ 2w
D
w
E
Cov(r
D
, r
E
)]
1/2
Therefore, we solve an optimization problem formally written as (subject to
w
i
= 1):
max
w
i
S
p
=
E(r
p
) r
f
p
In the case of two risky assets, the solution for the weights of the optimal risky portfolio P,
using excess rates of return R rather than total returns r, is:
w
D
=
E(R
D
)
2
E
E(R
E
)Cov(R
D
, R
E
)
E(R
D
)
2
E
+E(R
E
)
2
D
[E(R
D
) +E(R
E
)]Cov(R
D
, R
E
)
and w
E
= 1 w
D
(7)
The steps to arrive at the complete portfolio are:
1. Specify the return characteristics of all securities (expected returns, variances, covariances)
2. Establish the risky portfolio:
(a) Calculate the optimal risky portfolio P [Eq. (7)]
(b) Calculate properties of P using weights determined in step (a) and Eqs. (2) and (3)
3. Allocate funds between the risky portfolio and the risk-free asset:
(a) Calculate the fraction y of the complete portfolio allocated to portfolio P (the risky port-
folio) and to T -bills (the risk-free asset)
y =
E(r
p
) r
f
A
2
p
(8)
(b) Calculate the share of the complete portfolio invested in each asset and in T-bills
Our two risky assets, the bond and stock mutual funds, are already diversied portfolios. The
diversication within each of these portfolios must be credited for a good deal of the risk reduction
compared to undiversied single securities
Optimizing the asset allocation between bonds and stocks contributed incrementally to the im-
provement in the reward-to-volatility ratio of the complete portfolio
The CAL with stocks, bonds, and bills shows that the standard deviation of the complete portfolio
can be further reduced while maintaining the same expected return as the stock portfolio
The Markowitz portfolio selection model
Generalizing the portfolio construction problem to the case of many risky securities and a risk-free asset
1. Identify the risk-return combinations available from the set of risky assets
2. Identify optimal portfolio of risky assets by nding portfolio weights resulting in steepest CAL
3. Choose appropriate complete portfolio by mixing risk-free asset with optimal risky portfolio
Security selection
In the risk-return analysis, the portfolio manager needs as inputs a set of estimates for the expected
returns of each security and a set of estimates for the covariance matrix
Hence, we have n estimates of E(r
i
) and the n n estimates of the covariance matrix in which
the n diagonal elements are estimates of the variances
2
i
and the n
2
n = n(n 1) o-diagonal
elements are the estimates of the covariances between each pair of asset returns
12
The expected return/variance of any risky portfolio with weights in each security w
i
is:
E(r
p
) =
n
i=1
w
i
E(r
i
) (9)
2
p
=
n
i=1
n
j=1
w
i
w
j
Cov(r
i
, r
j
) (10)
Markowitz model is precisely step one of portfolio management: The identication of the ecient
set of portfolios, or the ecient frontier of risky assets
M-V frontier: Graph of the lowest possible variance for a given portfolio expected return
All individual assets lie to the right inside the frontier, at least when short sales are allowed.
When short sales are prohibited, single securities may lie on the frontier
The part of the frontier above the global M-V portfolio is the ecient frontier
The principal idea behind the frontier set of risky portfolios is that, for any risk level, we are
interested only in that portfolio with the highest expected return
The frontier is the set of portfolios that minimizes variance for any target expected return
Some clients may be subject to additional constraints. E.g., prohibited from taking short positions
For these clients the portfolio manager will add to the optimization program constraints that
rule out negative (short) positions in the search for ecient portfolios
In this special case, single assets may be, in and of themselves, ecient risky portfolios
E.g., asset with highest expected return is a frontier portfolio because, without short sales, the
only way to obtain that rate of return is to hold the asset as ones entire risky portfolio
Some may want to ensure minimal level of expected dividend yield from optimal portfolio
In this case the input list will be expanded to include a set of expected dividend yields d
1
, , d
n
and the optimization program will include an additional constraint that ensures that the ex-
pected dividend yield of the portfolio will equal or exceed the desired level d
Any constraint carries a price tag in the sense that an ecient frontier constructed subject to extra
constraints will oer a reward-to-volatility ratio inferior to that of a less constrained one
Another type of constraint is aimed at ruling out investments in industries or countries considered
ethically or politically undesirable. This is referred to as socially responsible investing
Capital allocation and the separation property
We ratchet up the CAL by selecting dierent portfolios until we reach portfolio P which is the
tangency point of a line from F to the ecient frontier
Portfolio P maximizes the reward-to-volatility ratio and is the optimal risky portfolio
The most striking conclusion is that a portfolio manager will oer the same risky portfolio P to all
clients regardless of their degree of risk aversion
The degree of risk aversion comes into play only in the selection of desired point along CAL
More risk-averse clients invest more in risk-free asset and less in optimal risky portfolio than
less risk-averse clients. However, both use portfolio P as their optimal risky investment vehicle
Separation property: Portfolio choice problem may be separated into 2 independent tasks:
1. Determination of the optimal risky portfolio (purely technical)
2. Allocation of the complete portfolio to T-bills vs. the risky portfolio (personal preference)
In practice, dierent managers estimate dierent input lists, thus deriving dierent ecient fron-
tiers, and oer dierent optimal portfolios Source of disparity lies in security analysis
This analysis suggests that a limited number of portfolios may be sucient to serve the demands
of a wide range of investors Theoretical basis of the mutual fund industry
The power of diversication
Consider the naive diversication strategy in which an equally weighted portfolio is constructed
w
i
= 1/n for each security. In this case Eq. (10) becomes:
2
p
=
1
n
n
i=1
1
n
2
i
+
n
j=1,j=i
n
i=1
1
n
2
Cov(r
i
, r
j
) (11)
13
Dene the average variance and average covariance of the securities as:
2
=
1
n
n
i=1
2
i
and Cov =
1
n(n 1)
n
j=1,j=i
n
i=1
Cov(r
i
, r
j
) (12)
Then, the portfolio variance is:
2
p
=
1
n
2
+
n 1
n
Cov (13)
Eect of diversication
When the average covariance among security returns is zero, as it is when all risk is rm-specic,
portfolio variance can be driven to zero
Hence when security returns are uncorrelated, the power of diversication is unlimited
However, usually, economy-wide risk factors impart positive correlation among stocks
The irreducible risk of a diversied portfolio depends on covariance of the returns of com-
ponent securities, which is a function of the importance of systematic economic factors
Suppose that all securities have a common and all security pairs have a common
The covariance between all pairs of securities is
2
and Eq. (13) becomes:
2
p
=
1
n
2
+
n 1
n
2
(14)
When = 0, we again obtain the insurance principle:
p
0 as n
For > 0, however, portfolio variance remains positive
For = 1, portfolio variance equals
2
regardless of n Diversication is of no benet
In the case of perfect correlation, all risk is systematic
More generally, as n becomes greater, Eq. (14) shows that systematic risk becomes
2
For diversied portfolios, the contribution to portfolio risk of a particular security depends on the
covariance of that securitys return with other securities, and not on the securitys variance
Asset allocation and security selection
Why distinguish between asset allocation and security selection? 3 reasons:
1. As a result of greater need and ability to save (for college, recreation, longer life, health care
needs, etc.), the demand for sophisticated investment management has increased enormously
2. The widening spectrum of nancial markets and nancial instruments has put sophisticated
investment beyond the capacity of many amateur investors
3. There are strong economies of scale in investment analysis
The end result is that the size of a competitive investment company has grown with the industry,
and eciency in organization has become an important issue
The practice is therefore to optimize the security selection of each asset-class portfolio independently
At the same time, top management continually updates the asset allocation of the organization,
adjusting the investment budget allotted to each asset-class portfolio
Risk pooling, risk sharing, and risk in the long run
Risk pooling and the insurance principle
The insurance principle
Suppose an insurer sells 10,000 uncorrelated policies, each with a standard deviation
Because the covariance between any two insurance policies is zero and is the same for each
policy, the standard deviation of the rate of return on the 10,000-policy portfolio is:
2
p
=
1
n
p
=
n
(15)
p
could be further decreased by selling even more policies This is the insurance principle
14
It seems that as the lm sells more policies, its risk continues to fall. Flaw in this argument:
Probability of loss = inadequate measure of risk: Does not account for the magnitude of loss
If 10,000 policies are sold, maximum possible loss is 10,000 bigger and comparison with a
one-policy portfolio cannot be made based on means/standard deviations of rates of return
Similar aw as the argument that investing in stocks for the long run reduces risk
Increasing the size of the bundle of policies does not make for diversication! Diversifying a portfolio
means dividing a xed investment budget across more assets
When we combine n uncorrelated insurance policies, each with an expected prot $, both expected
total prot and standard deviation (SD) grow in direct proportion to n:
E(n) = nE()
V ar(n) = n
2
V ar() = n
2
2
SD(n) = n
The ratio of mean to standard deviation does not change when n increases
The risk-return trade-o therefore does not improve with the assumption of additional policies
Risk sharing
If risk pooling (sale of additional independent policies) does not explain insurance, what does?
The answer is risk sharing, the distribution of a xed amount of risk among many investors
An underwriter will contact other underwriters who each will take a piece of the action: Each will
choose to insure a fraction of the project risk
Each underwriter has a xed amount of equity capital. Underwriters engage in risk sharing. They
limit their exposure to any single source of risk by sharing that risk with other underwriters
Underwriter diversies its risk by allocating its budget across many projects that are not perfectly
correlated One underwriter will decline to u/w too large a fraction of any single project
This is the proper use of risk pooling: Pooling many sources of risk in a portfolio of given size
The bottom line is that portfolio risk management is about the allocation of a xed investment budget
to assets that are not perfectly correlated
In this environment, rate of return statistics (expected returns, variances, and covariances) are
sucient to optimize the investment portfolio
Choices among alternative investments of a dierent magnitude require that we abandon rates of
return in favor of dollar prots
This applies as well to investments for the long run
15
16
BKM - Ch. 8: Index models
Introduction
The Markowitz procedure suers from two drawbacks:
1. The model requires a huge number of estimates to ll the covariance matrix
2. The model does not provide any guideline to the forecasting of the security risk premiums
Index models simplify estimation of covariance matrix and enhance analysis of risk premiums
By allowing to explicitly decompose risk into systematic and rm-specic components, these models
also shed considerable light on both the power and limits of diversication
Further, they allow to measure these components of risk for particular securities and portfolios
Despite simplication, index models remain true to concepts of ecient frontier/portfolio optimization
Empirically, index models are as valid as the assumption of normality of rates of return on securities
A single-factor security market
The input list of the Markowitz model
The success of a portfolio selection rule depends on the quality of the input list
The Markowitz model necessitates n expected returns, n variances, and n(n 1)/2 covariances
Markowitz model: Errors in estimation of correlation coecients can lead to nonsensical results
This can happen because some sets of correlation coecients are mutually inconsistent
Introducing a model that simplies the way we describe the sources of security risk allows us to
use a smaller, consistent set of estimates of risk parameters and risk premiums
The simplication emerges because positive covariances among security returns arise from
common economic forces that aect the fortunes of most rms
Examples of common economic factors: Business cycles, interest rates, cost of natural resources
The unexpected changes in these variables cause, simultaneously, unexpected changes in the
rates of return on the entire stock market
By decomposing uncertainty into these system-wide versus rm-specic sources, we vastly simplify
the problem of estimating covariance and correlation
Normality of returns and systematic risk
Decompose rate of return on security i into its expected plus unanticipated components:
r
i
= E(r
i
) +e
i
(1)
Where the unexpected return e
i
has a mean of zero and a standard deviation of
i
When security returns can be well approximated by normal distributions that are correlated across
securities, we say that they are joint normally distributed
At any time, security returns are driven by one or more common variables
Suppose the common factor m that drives innovations in security returns is some macroeconomic
variable that aects all rms Decompose sources of uncertainty into uncertainty about economy
as a whole (captured by m) and uncertainty about rm in particular (captured by e
i
)
r
i
= E(r
i
) +m+e
i
(2)
The macroeconomic factor m measures unanticipated macro surprises
m has a mean of zero (over time, surprises average out) with standard deviation of
m
m and e
i
are uncorrelated, because e
i
is rm-specic Independent of shocks to the common
factor that aect the entire economy
The variance of r
i
thus arises from two uncorrelated sources, systematic and rm specic
2
i
=
2
m
+
2
(e
i
) (3)
The common factor m generates correlation across securities
Because all securities will respond to the same macroeconomic news
17
Since m is uncorrelated with any rm-specic surprises, the covariance between any i and j is:
Cov(r
i
, r
j
) = Cov(m+e
i
, m+e
j
) =
2
m
(4)
We recognize that some securities are more sensitive than others to macroeconomic shocks:
Capture this renement by assigning each rm a sensitivity coecient to macro conditions
This leads to the single-factor model:
r
i
= E(r
i
) +
i
m+e
i
(5)
The systematic risk of security i is
2
i
2
m
, and its total risk is:
2
i
=
2
i
2
m
+
2
(e
i
) (6)
The covariance between any pair of securities also is determined by their betas:
Cov(r
i
, r
j
) = Cov(
i
m+e
i
,
j
m+e
j
) =
i
2
m
(7)
Normality of security returns alone guarantees that portfolio returns are also normal and that there
is a linear relationship between security returns and the common factor
Seek a variable that can proxy for common factor. To be useful, variable must be observable, so
we can estimate its volatility/sensitivity of individual securities returns to variation in its value
The single-index model
A reasonable approach is to assert that the rate of return on a broad index of securities such as the S&P
500 is a valid proxy for the common macroeconomic factor
Single-index model single-factor model because market index = proxy for common factor
The regression equation of the single-index model
Denote the market index by M, with excess return of R
M
= r
M
r
f
and standard deviation of
M
We regress the excess return of a security R
i
= r
i
r
f
on the excess return of the index R
M
We collect a historical sample of paired observations R
i
(t) and R
M
(t) where t denotes the date of
each pair of observations. The regression equation is:
R
i
(t) =
i
+
i
R
M
(t) +e
i
(t) (8)
The intercept
i
is the securitys expected excess return when the market excess return is zero
The slope coecient
i
is the securitys sensitivity to the index
e
i
is the zero-mean, rm-specic surprise in the security return in time t, aka the residual
The expected return-beta relationship
Taking expected values, we obtain the expected return-beta relationship of the single-index model:
E(R
i
) =
i
+
i
E(R
M
) (9)
Part of a securitys risk premium is due to the risk premium of the index
The market risk premium is multiplied by the relative sensitivity of the individual security
This is the systematic risk premium because it derives from the risk premium that characterizes
the entire market, which proxies for the condition of the full economy or economic system
is the non-market premium
Risk and covariance in the single-index model
Both variances/covariances are determined by security betas/properties of market index:
Total risk = Systematic risk + Firm-specic risk
2
i
=
2
i
2
M
+
2
(e
i
) (10)
Covariance = Product of betas Market index risk
Cov(r
i
, r
j
) =
i
2
M
(11)
Correlation = Product of correlations with the market index
Corr(r
i
, r
j
) =
i
2
M
j
= Corr(r
i
, r
M
) Corr(r
j
, r
M
) (12)
18
The set of parameter estimates needed for the single-index model consists of only , , and (e)
for the individual securities, plus the risk premium and variance of the market index
The set of estimates needed for the single-index model
1.
i
: Stocks expected return if the market is neutral, i.e. r
M
r
f
= 0
2.
i
(r
M
r
f
): The component of return due to movements in the overall market
3. e
i
: The unexpected component of return due to unexpected rm specic events
4.
2
i
2
M
: The variance attributable to the uncertainty of the common macroeconomic factor
5.
2
(e
i
): The variance attributable to rm-specic uncertainty
Advantages of the model
The index model is a very useful abstraction because for large universes of securities, the
number of estimates is only a small fraction of what otherwise would be needed
The index model abstraction is crucial for specialization of eort in security analysis: If a
covariance term had to be calculated for each security pair, then no specialization by industry
Disadvantages
Cost of the model: Restrictions it places on structure of asset return uncertainty
Classication of uncertainty into dichotomy - macro vs. micro risk - oversimplies sources of
real-world uncertainty and misses important sources of dependence in stock returns
E.g., dichotomy rules out industry events (aect an industry but not the broad macroeconomy)
The optimal portfolio derived from the single-index model therefore can be signicantly inferior
to that of the full-covariance (Markowitz) model when stocks with correlated residuals have large
alpha values and account for a large fraction of the Portfolio
If many pairs of the covered stocks exhibit residual correlation, it is possible that a multi-
index model, which includes additional factors to capture those extra sources of cross-security
correlation, would be better suited for portfolio analysis and construction
The index model and diversication
Suppose that we choose an equally weighted portfolio of n securities. The excess rate of return on
each security is given by: R
i
=
i
+
i
R
M
+e
i
Similarly, we can write the excess return on the portfolio of stocks as: R
p
=
p
+
p
R
M
+e
p
As the number of stocks included in this portfolio increases, the part of the portfolio risk attributable
to nonmarket factors becomes ever smaller: This part of the risk is diversied away
In contrast, market risk remains, regardless of the number of rms combined into the portfolio
R
p
=
n
i=1
w
i
R
i
=
1
n
n
i=1
(
i
+
i
R
M
+e
i
) =
1
n
n
i=1
i
+
_
1
n
n
i=1
i
_
R
M
+
1
n
n
i=1
e
i
(13)
Portfolios sensitivity to market, nonmarket return and avg. of rm-specic components:
p
=
1
n
n
i=1
i
p
=
1
n
n
i=1
i
e
p
=
1
n
n
i=1
e
i
(14)
Hence the portfolios variance is:
2
p
=
2
p
2
M
+
2
(e
p
) (15)
The systematic risk component of the portfolio variance, which depends on marketwide movements,
is
2
p
2
M
and depends on the sensitivity coecients of the individual securities. This part of the
risk will persist regardless of the extent of portfolio diversication
The nonsystematic component of the variance is
2
(e
p
) and is attributable to rm-specic e
i
Because e
i
s are independent and have zero expected value, as more stocks are added to the
portfolio, rm-specic components cancel out, resulting in ever-smaller nonmarket risk
Such risk is thus termed diversiable: Because the e
i
s are uncorrelated,
2
(e
p
) =
n
i=1
_
1
n
_
2
2
(e
i
) =
1
n
2
(e) (16)
19
Estimating the single-index model
The security characteristic line for stock i
Regression: Line with intercept
i
, slope
i
= security characteristic line (SCL) for stock i:
R
i
(t) =
i
+
i
R
S&P500
(t) +e
i
(t)
The explanatory power of the SCL for stock i
The R-square tells us the percentage of the variation in the stock i series that is explained by the
variation in the S&P 500 excess returns
The adjusted R
2
(slightly smaller) corrects for upward bias in R
2
that arises because we use tted
values of parameters (slope and intercept ) rather than true, but unobservable values
In general, the adjusted R-square (R
2
A
) is derived from the unadjusted by:
R
2
A
= 1 (1 R
2
)
(n1)
(nk1)
where k is the number of independent variables
An additional degree of freedom is lost to the estimate of the intercept
Analysis of variance
The sum of squares (SS) is the portion of the variance of the dependent variable (stock is return)
that is explained by the independent variable (S&P 500 return); it is equal to
2
i
2
S&P500
The MS column for the residual shows the variance of the unexplained portion of stock is return,
i.e. the portion of return that is independent of the market index. The square root of this value is
the standard error (SE) of the regression
Dividing the total SS of the regression by the number of dof (59 for 60 observations), we obtain
the estimate of the variance of the dependent variable (stock i), per month
R
2
(ratio of explained/total variance) = [explained (regression) SS]/[total SS]
R
2
=
2
i
2
S&P500
2
i
2
S&P500
+
2
(e
i
)
= 1
2
(e
i
)
2
i
2
S&P500
+
2
(e
i
)
The estimate of alpha
The intercept is the estimate of stock is alpha for the sample period (per month)
The standard error of the estimate is a measure of the imprecision of the estimate. If the standard
error is large, the range of likely estimation error is correspondingly large
We can relate the standard error of to the standard error of the residuals as:
SE(
i
) = (e
i
)
1
n
+
(Avg. S&P500)
2
V ar(S&P500) (n 1)
The t-statistic is the ratio of the regression parameter to its standard error
This statistic equals the number of standard errors by which our estimate exceeds zero, and
therefore can be used to assess the likelihood that the true but unobserved value might actually
equal zero rather than the estimate derived from the data
For , we are interested in avg. value of stock is return net of market movements
Dene the nonmarket component of return as actual return minus the return attributable to
market movements. Call this stock is rm-specic return R
fs
= R
i
i
R
S&P500
If R
fs
were normally distributed with a mean of zero, the ratio of its estimate to its standard
error would have a t-distribution
From a table of the t-distribution, we can nd the probability that the true is actually zero
This is called the level of signicance or the probability or p-value. Conventional cut-o for
statistical signicance is a probability of less than 5%, which requires t-statistic 2.0
Even if was both economically and statistically signicant within the sample, we still would not
use that as a forecast for a future period
Overwhelming empirical evidence shows that 5-year alpha values do not persist over time
Virtually no correlation between estimates from one sample period to the next
20
The estimate of beta
When the value of beta and its SE produce a large t-statistic and a p-value of practically zero, we
can condently reject the hypothesis that stock is true beta is zero
SE(
i
) =
(e
i
)
n 1
A more interesting t-statistic might test a null hypothesis that
i
is greater than the market-wide
average beta of 1. This t-statistic would measure how many standard errors separate the estimated
beta from a hypothesized value of 1:
t-statistics =
Estimated value Hypothesized value
Standard error
Firm-specic risk
The annual standard deviation of stock is residual is
12 times its monthly standard deviation
The standard deviation of systematic risk is (S&P500)
It is common for individual stocks to have a rm-specic risk as large as its systematic risk
Correlation and covariance matrix
The variance estimates for the individual stocks equal
2
i
2
M
+
2
(e
i
)
The o-diagonal terms are covariance values and equal
i
2
M
Portfolio construction and the single-index model
Alpha and security analysis
Most important advantage of single-index model: Framework it provides for macroeconomic/security
analysis in preparing input list critical to eciency of the optimal portfolio
The single-index model separates macroeconomic and individual-rm sources of return variation
and makes it easier to ensure consistency across analysts
Hierarchy of the preparation of the input list using the single-index model:
1. Macroeconomic analysis is used to estimate the risk premium and risk of the market index
2. Statistical analysis used to estimate s of all securities and their residual variances
2
(e
i
)
3. The portfolio manager uses the estimates for the market-index risk premium and the beta
coecient of a security to establish the expected return of that security absent any contribution
from security analysis. This market-driven expected return can be used as a benchmark
4. Security-specic expected return forecasts (s) are derived from security-valuation models
The alpha value distills the incremental risk premium attributable to private information
developed from security analysis
is more than just one of the components of expected return: It is the key variable that tells us
whether a security is a good or a bad buy
The index portfolio as an investment asset
To avoid inadequate diversication, include the S&P 500 portfolio as one of the assets
S&P 500 passive portfolio that the manager would select in the absence of security analysis
The single-index model input list
If the portfolio manager plans to compile a portfolio from a list of n actively researched rms and
a passive market index portfolio, the input list will include the following estimates:
1. Risk premium on the S&P 500 portfolio
2. Estimate of the standard deviation of the S&P 500 portfolio
3. n sets of estimates of (i) coecients, (ii) Stock residual variances, and (iii) values
The optimal risky portfolio of the single-index model
With the estimates of and coecients, plus the risk premium of the index portfolio, we can
generate the n + 1 expected returns using Eq. (9)
With the estimates of coecients and residual variances, together with the variance of the index
portfolio, we can construct the covariance matrix using Eq. (10)
Given risk premiums and the covariance matrix, we can conduct the optimization program
21
The , , and residual variance of a weighted portfolio are the simple averages of those parameters
across component securities (where the index n+1 corresponds to the market index:
n+1
=
M
=
0,
n+1
=
M
= 1, and (e
n+1
) = (e
M
) = 0):
p
=
n+1
i=1
w
i
i
p
=
n+1
i=1
w
i
i
2
(e
p
) =
n+1
i=1
w
2
i
2
(e
i
) (17)
The objective is to maximize the Sharpe ratio of the portfolio by using portfolio weights w
1
, , w
n+1
.
With this set of weights, the portfolio expected return, standard deviation, and Sharpe ratio are:
E(R
p
) =
p
+E(R
M
)
p
=
n+1
i=1
w
i
i
+E(R
M
)
n+1
i=1
w
i
i
(18)
p
= [
2
p
2
M
+
2
(e
p
)]
1/2
=
_
_
2
M
_
n+1
i=1
w
i
i
_
2
+
n+1
i=1
w
2
i
2
(e
i
)
_
_
1/2
(19)
S
p
=
E(R
p
)
p
(20)
The optimal risky portfolio trades o search for against departure from ecient diversication
The optimal risky portfolio turns out to be a combination of two component portfolios:
1. An active portfolio A comprised of the n analyzed securities
2. The market-index passive portfolio is the (n +l)-th asset included to aid in diversication
Assume rst that the active portfolio has a beta of 1:
In that case, the optimal weight in active portfolio is proportional to ratio
A
/
2
(e
A
)
This ratio balances the contribution of the active portfolio (its alpha) against its contribution
to the portfolio variance (residual variance)
The analogous ratio for the index portfolio is E(R
M
)/
2
M
and hence the initial position in the
active portfolio (i.e., if its beta were 1) is:
w
0
A
=
A
/
2
(e
A
)
E(R
M
)/
2
M
(21)
Next, we amend this position to account for the actual beta of the active portfolio
For any level of
2
A
, the correlation between the active and passive portfolios is greater when
the beta of the active portfolio is higher
This implies less diversication benet from the passive portfolio and a lower position in it.
Correspondingly, the position in the active portfolio increases:
w
A
=
w
0
A
1 + (1
A
)w
0
A
(22)
The information ratio
Sharpe ratio of optimally constructed risky portfolio > index portfolio (passive strategy):
S
2
p
= S
2
M
+
_
A
(e
A
)
_
2
(23)
The information ratio
The contribution of the active portfolio (when held in its optimal weight w
A
) to the Sharpe
ratio of the overall risky portfolio is determined by the ratio of its alpha to its residual standard
deviation, aka the information ratio
Ratio measures extra return we can obtain from security analysis compared to rm-specic
risk we incur when we over-/underweight securities relative to passive market index
22
To maximize the overall Sharpe ratio, maximize the information ratio of active portfolio
Information ratio maximized if we invest in each security its ratio of
i
/
2
(e
i
)
Rescaling so that total position in active portfolio = w
A
, weight in security i is:
w
i
= w
2
(e
i
)
_
n
i=1
2
(e
i
)
(24)
With this set of weights, the contribution of each security to the information ratio of the active
portfolio depends on its own information ratio:
_
A
(e
A
)
_
2
=
n
i=1
_
i
(e
i
)
_
2
(25)
In contrast to alpha, the market (systematic) component of the risk premium
i
E(R
M
) is oset by
the securitys nondiversiable (market) risk
2
i
2
M
and both are driven by the same beta
The beta of a security is neither vice nor virtue:
It is a property that simultaneously aects the risk and risk premium of a security
Only the aggregate of the active portfolio, rather than each individual securitys matters
The index portfolio is an ecient portfolio only if all alpha values are zero
Unless a security has ,= 0, including it in active portfolio makes portfolio less attractive
In addition to the securitys systematic risk, which is compensated for by the market risk
premium (through beta), the security would add its rm-specic risk to portfolio variance
If all securities have zero alphas, the optimal weight in the active portfolio will be zero, and
the weight in the index portfolio will be 1
However, when security analysis uncovers securities with nonmarket risk premiums (nonzero
alphas), the index portfolio is no longer ecient
Summary of the optimization procedure
1. Compute the initial position of each security in the active portfolio as w
0
i
=
i
/
2
(e
i
)
2. Scale initial positions to force
weights = 1 by dividing by their sum: w
i
= w
0
i
/
w
0
i
3. Compute the alpha of the active portfolio:
A
=
w
i
i
4. Compute the residual variance of the active portfolio:
2
(e
A
) =
w
2
i
2
(e
i
)
5. Compute the initial position in the active portfolio: w
0
A
= [
A
/
2
(e
A
)]/[E(R
M
)/
2
M
]
6. Compute the beta of the active portfolio:
A
=
w
i
i
7. Adjust the initial position in the active portfolio: w
A
= w
0
A
/[1 + (1
A
)w
0
A
]
8. Note: The optimal risky portfolio now has weights: w
M
= 1 w
A
and w
i
= w
A
w
i
9. Calculate the risk premium of the optimal risky portfolio from the risk premium of the index
portfolio and the alpha of the active portfolio: E(R
p
) = (w
M
+w
A
)E(R
M
) +w
A
10. Compute the variance of the optimal risky portfolio from the variance of the index portfolio and
the residual variance of the active portfolio:
2
p
= (w
M
+w
A
)
2
2
M
+ [w
A
(e
A
)]
2
Practical aspects of portfolio management with the index model
Is the index model inferior to the full-covariance model?
A parsimonious model that is stingy about inclusion of independent variables is often superior:
Predicting the value of the dependent variable depends on two factors: (i) The precision of
the coecient estimates and (ii) The precision of the forecasts of the independent variables
When we add variables, we introduce errors on both counts
Markowitz model more exible in for asset covariance structure compared to single-index model
Advantage illusory if we cant estimate covariances with any degree of condence
Using full-covariance matrix invokes estimation of thousands of terms Possible that cumu-
lative eect of so many estimation errors results in an inferior than the single-index model
Advantages of the single-index framework:
Clear practical advantage
Decentralizes macro and security analysis
23
The industry version of the index model
Merrill Lynch publishes a monthly Security Risk Evaluation book (aka beta book) which uses
the S&P 500 as the proxy for the market portfolio
It relies on the 60 most recent monthly observations to calculate regression parameters
Uses total returns, rather than excess returns, in the regressions Variant of index-model:
r = a +br
M
+e
instead of r r
f
= +(r
M
r
f
) +e (26)
If r
f
is constant, both equations have same independent variable r
m
and residual e
The slope coecient will be the same in the two regressions
However, the intercept that Merrill Lynch calls alpha is really an estimate of +r
f
(1 )
Merrill Lynch departs from the index model is in its use of percentage changes in price instead
of total rates of return Ignores the dividend component of stock returns
For most rms, R
2
0.5, indicating that stocks have far more rm-specic than systematic risk
Highlights the practical importance of diversication
Adjusted beta
Motivation for adjusting estimates: On average, s of stocks move toward 1 over time
As it grows, a rm diversies, expanding to similar products and later to more diverse opera-
tions. As rm becomes more conventional, it resembles rest of economy even more
Its beta coecient will tend to change in the direction of 1
Another explanation for this phenomenon is statistical:
When we estimate this beta coecient over a particular sample period, we sustain some
unknown sampling error of the estimated beta
The greater the dierence between our estimate and 1, the greater the chance that we
had a large estimation error and that in a later sample period will be closer to 1
Merrill Lynch adjusts beta estimates in a simple way:
Adjusted beta =
2
3
Sample beta +
1
3
estimates are ex post (after the fact) measures. They do not mean that anyone could have
forecast these alpha values ex ante (before the fact)
Given the R
2
of the regression and the residual standard deviation of the stock (e
i
), we can solve
for the total standard deviation
i
of stock i:
i
=
_
2
(e
i
)
1 R
2
i
= stock is monthly std. dev. for sample period Annualized std. dev. is
12
i
Predicting betas
Simple approach: Collect data on in dierent periods and then estimate a regression:
Current beta = a +b(Past beta) (27)
Given estimates of a and b, we would then forecast future betas using:
Forecast beta = a +b(Current beta) (28)
If belief that rm size/debt ratios are two determinants of , an expanded version of Eq. (27) is:
Current beta = a +b
1
(Past beta) +b
2
(Firm size) +b
3
(Debt ratio)
Variables to help predict betas:
1. Variance of earnings 3. Growth in earnings per share 5. Dividend yield
2. Variance of cash ow 4. Market capitalization (rm size) 6. Debt-to-asset ratio
Even after controlling for a rms nancial characteristics, industry group helps to predict
24
Index models and tracking portfolios
Suppose a portfolio manager believes she has identied an underpriced portfolio. The index model
equation for this portfolio is:
R
p
= .04 + 1.4R
S&P500
+e
p
(29)
Manager condent in quality of security analysis but wary about near term performance of broad
market. Wants a position that takes advantage of teams analysis but is independent of overall
market performance To this end, a tracking portfolio (T) can be constructed
Tracking portfolio
A tracking portfolio for P is designed to match the systematic component of Ps return
The idea is for the portfolio to track the market-sensitive component of Ps return
This means the tracking portfolio must have the same beta on the index portfolio as P and as
little nonsystematic risk as possible. This procedure is also called beta capture
A tracking portfolio for P will have a levered position in the S&P 500 to achieve a beta of 1.4
Therefore, T includes positions of 1.4 in the S&P 500 and -0.4 in T-bills
Because T is constructed from the index and bills, it has an alpha value of zero
Now buy P but oset systematic risk by assuming a short position in tracking portfolio
The short position in T cancels out the systematic exposure of the long position in P
The overall combined position is thus market neutral
Therefore, even if the market does poorly, the combined position should not be aected
But the alpha on portfolio P will remain intact
The combined portfolio C provides an excess return per dollar of:
R
C
= R
p
R
T
= (.04 + 1.4R
S&P500
+e
p
) 1.4R
S&P500
= .04 +e
p
(30)
While this portfolio is still risky (residual risk e
p
), the systematic risk has been eliminated, and if
P is reasonably well-diversied, the remaining nonsystematic risk will be small
Manager can take advantage of the 4% without inadvertently taking on market exposure
Process of separating the search for from the choice of market exposure is called transport
This long-short strategy is characteristic of the activity of many hedge funds
25
26
BKM - Ch. 9: The Capital Asset Pricing Model (CAPM)
Introduction
CAPM gives precise prediction of relationship between risk of an asset and its expected return
CAPM serves two vital functions:
1. It provides a benchmark rate of return for evaluating possible investments
2. Helps make a guess as to the expected return on assets not yet traded in the marketplace
Although the CAPM does not fully withstand empirical tests, it is widely used because of the insight it
oers and because its accuracy is deemed acceptable for important applications
The Capital Asset Pricing Model
Simplifying assumptions that lead to the basic version of the CAPM:
1. There are many investors, each with a wealth that is small compared to the total endowment of
all investors. Investors act as though security prices are unaected by their own trades
2. All investors plan for one identical holding period. This behavior is myopic: It ignores everything
that might happen after the end of single-period horizon. Myopic behavior is suboptimal
3. Investments limited to publicly traded nancial assets and to risk-free borrowing/lending
Rules out investment in nontraded assets such as human capital, private enterprises, and gov-
ernmentally funded assets. Assumes that investors may borrow/lend any amount at risk-free rate
4. Investors pay no taxes on returns and no transaction costs on trades in securities
5. All investors are rational M-V optimizers, they all use the Markowitz model
6. All investors analyze securities in the same way and share the same economic view of the world.
I.e., for any set of security prices, they all derive the same input list to feed into the Markowitz
model. This assumption is often referred to as homogeneous expectations
Summary of the equilibrium that will prevail in this hypothetical world of securities and investors:
1. All investors will choose to hold a portfolio of risky assets in proportions that duplicate represen-
tation of the assets in the market portfolio M, which includes all traded assets. The proportion of
each stock in the market portfolio equals the market value of the stock (price per share number
of shares outstanding) divided by the total market value of all stocks
2. Not only will the market portfolio be on the ecient frontier, but it also will be the tangency
portfolio to the optimal capital allocation line (CAL) derived by each and every investor
The capital market line (CML), the line from the risk-free rate through the market portfolio
M is also the best attainable capital allocation line. All investors hold M as their optimal risky
portfolio, diering only in the amount invested in it versus in the risk-free asset
3. The risk premium on the market portfolio will be proportional to its risk and the degree of risk
aversion of the representative investor:
E(r
M
) r
f
=
A
2
M
Where
2
M
= variance of market portfolio and
A = avg. degree of risk aversion across investors
4. The risk premium on individual assets will be proportional to the risk premium on the market
portfolio M and the beta coecient of the security relative to the market portfolio, where:
i
=
Cov(r
i
, r
M
)
2
M
And the risk premium on individual securities is:
E(r
i
) r
f
=
i
[E(r
M
) r
f
]
Why do all investors hold the market portfolio?
When we aggregate the portfolios of all individual investors, lending/borrowing cancel out, and the
value of the aggregate risky portfolio will equal the entire wealth of the economy
This is the market portfolio M
27
The CAPM implies that as individuals attempt to optimize their personal portfolios, they each
arrive at the same portfolio, with weights on each asset equal to those of the market portfolio
If all investors use identical Markowitz analysis (Assumption 5) applied to the same universe of
securities (Assumption 3) for the same time horizon (Assumption 2) and use the same input list
(Assumption 6), they all must arrive at the same composition of the optimal risky portfolio, the
portfolio on the ecient frontier identied by the tangency line from T-bills to that frontier
As a result, the optimal risky portfolio of all investors is simply a share of the market portfolio
All assets have to be included in the market portfolio
When all investors avoid a stock i, the demand is zero, and is price takes a free fall
Ultimately, stock i reaches a price attractive enough to be included in the optimal portfolio
Such price adjustment process guarantees that all stocks are included in optimal portfolio
The passive strategy is ecient
In the simple world of the CAPM, M is the optimal tangency portfolio on the ecient frontier
Thus the passive strategy of investing in a market index portfolio is ecient
We sometimes call this result a mutual fund theorem
Another incarnation of separation property: Separate portfolio selection into 2 components:
1. A technical problem: Creation of mutual funds by professional managers
2. A personal problem that depends on an investors risk aversion: Allocation of the complete
portfolio between the mutual fund and risk-free assets
The risk premium of the market portfolio
Each individual investor chooses a proportion y allocated to the optimal portfolio M such that:
y =
E(r
M
) r
f
A
2
M
(1)
Net borrowing and lending across all investors must be zero Substituting the representative
investors risk aversion
A for A, the average position in the risky portfolio is 100%, or y = 1
Risk premium on market portfolio related to its variance by avg. degree of risk aversion:
E(r
M
) r
f
=
A
2
M
(2)
Expected returns on individual securities
The contribution of one stock i to portfolio variance can be expressed as the sum of all the covariance
terms corresponding to the stock Contribution of stock i to variance of market portfolio:
w
i
[w
1
Cov(r
1
, r
i
) + +w
i
Cov(r
i
, r
i
) + +w
n
Cov(r
n
, r
i
)] (3)
When there are many stocks, there will be many more covariance terms than variance terms
Covariance of a stock with all others dominates that stocks contribution to total portfolio risk
Measure stock is contribution to market portfolio risk by its covariance with that portfolio:
Stock i contribution to variance = w
i
Cov(r
i
, r
M
)
Since r
M
=
w
k
r
k
, the covariance of return on stock i with market portfolio is:
Cov(r
i
, r
M
) = Cov
_
r
i
,
n
k=1
w
k
r
k
_
=
n
k=1
w
k
Cov(r
i
, r
k
) (4)
The contribution of our holding of stock i to the risk premium of the market portfolio is w
i
[E(r
i
)r
f
]
Therefore, the reward-to-risk ratio for investments in i is:
Stock i contribution to risk premium
Stock i contribution to variance
=
w
i
[E(r
i
) r
f
]
w
i
Cov(r
i
, r
M
)
=
E(r
i
) r
f
Cov(r
i
, r
M
)
28
The market portfolio is the tangency (ecient M-V) portfolio with reward-to-risk ratio:
Market risk premium
Market variance
=
E(r
M
) r
f
2
M
(5)
Market price of risk: Ratio in Eq. (5), quanties extra return demanded to bear portfolio risk
A basic principle of equilibrium is that all investments should oer the same reward-to-risk ratio
The reward-to-risk ratios of i and the market portfolio should be equal:
E(r
i
) r
f
Cov(r
i
, r
M
)
=
E(r
M
) r
f
2
M
(6)
Hence, the fair risk premium of stock i is:
E(r
i
) r
f
=
Cov(r
i
, r
M
)
2
M
[E(r
M
) r
f
] (7)
Expected return-beta relationship
Ratio Cov(r
i
, r
M
)/
2
M
measures contribution of stock i to variance of market portfolio as
fraction of total variance of market portfolio. This ratio is called and:
E(r
i
) = r
f
+[E(r
M
) r
f
] (8)
Assumptions that made individuals act similarly are very useful:
If everyone holds an identical risky portfolio, then everyone will nd that the beta of each asset
with the market portfolio equals the assets beta with his or her own risky portfolio
Hence everyone will agree on the appropriate risk premium for each asset
Even if one does not hold precise market portfolio, a well-diversied portfolio is so highly
correlated with market that a stocks relative to market is a useful risk measure
Modied CAPMs hold true even if dierences among individuals lead them to dierent portfolios
If CAPM relationship holds for any individual asset, it must hold for any combination of assets:
w
1
E(r
1
) = w
1
r
f
+w
1
1
[E(r
M
) r
f
]
+w
2
E(r
2
) = w
2
r
f
+w
2
2
[E(r
M
) r
f
]
+w
2
E(r
2
) = w
2
r
f
+w
2
2
[E(r
M
) r
f
]
=
+w
n
E(r
n
) = w
n
r
f
+w
n
n
[E(r
M
) r
f
]
E(r
p
) = r
f
+
p
[E(r
M
) r
f
] where E(r
p
) =
w
k
E(r
k
) and
p
=
w
k
k
This result has to be true for market portfolio itself: E(r
M
) = r
f
+
M
[E(r
M
) r
f
]
M
= 1
This also establishes 1 as the weighted-average value of beta across all assets
s greater than 1 are considered aggressive, s below 1 can be described as defensive
The security market line
CAPM relationship as a reward-risk equation: of a security = appropriate measure of its risk
because is proportional to the risk that security contributes to optimal risky portfolio
CAPM states that the securitys risk premium is directly proportional to both: (i) The beta and,
(ii) The risk premium of the market portfolio
The security market line (SML) graphs individual asset risk premiums as a function of asset risk
Relevant measure of risk for individual assets part of diversied portfolios is not assets std.
dev./variance. It is the asset contribution of the to portfolio variance (measured by assets )
The SML is valid for both ecient portfolios and individual assets
Fairly priced assets plot exactly on the SML
If stock perceived to be a good buy (underpriced) it will provide an expected return in excess
of fair return stipulated by SML Underpriced stocks plot above SML: Given their s, their
expected returns are greater than implied by CAPM. Overpriced stocks plot below the SML
Dierence between fair/actually expected rates of return on a stock is called the stocks
29
Market
0 1.5
Stock
Slope of SML =
f
r ) {
M
r ( E
Security Market Line (SML)
0 : = 1
M
)
M
r
(
E
)
r
(
E
f
r
Figure 1: The security market line
The security market line provides a benchmark for the evaluation of investment performance: Given
the risk of an investment, as measured by its beta, the SML provides the required rate of return
necessary to compensate investors for both risk as well as the time value of money
In contrast, the capital market line (CML) graphs the risk premiums of ecient portfolios (i.e.
portfolios composed of the market and the risk-free asset) as a function of portfolio
Analysis suggests: Starting point of portfolio management = passive market-index portfolio
The portfolio manager will then increase the weights of securities with positive alphas and
decrease the weights of securities with negative alphas
The CAPM is also useful in capital budgeting decisions
For a rm considering a new project, the CAPM can provide the required rate of return that
the project needs to yield, based on its beta, to be acceptable to investors
Yet another use of the CAPM is in utility ratemaking cases
In this case the issue is the rate of return that a regulated utility should be allowed to earn
on its investment in plant and equipment. The rm would be allowed to set prices at a level
expected to generate rate of returns indicated by the CAPM
The CAPM and the index model
Actual returns versus expected returns
One central prediction of CAPM: The market portfolio is a M-V ecient portfolio
However, testing its eciency has not been feasible
CAPM involves expected returns. We can only observe actual/realized HPRs
The second central set of CAPM predictions is the expected return-beta relationship
Problem of measuring expectations as well
We must make additional assumptions to make CAPM implementable and testable
The index model and realized returns
To go from expected to realized returns, use the index model in excess return form:
R
i
=
i
+
i
R
M
+e
i
(9)
Covariance between the returns on stock i and the market index
Firm-specic component independent of market wide component Cov(R
M
, e
i
) = 0
Hence, the covariance of the excess rate of return on security i with the market index is:
Cov(R
i
, R
M
) = Cov(
i
R
M
+e
i
, R
M
) =
i
2
M
Sensitivity coecient
i
in Eq. (9), which is the slope of the index model regression, equals:
i
=
Cov(R
i
, R
M
)
2
M
The index model = in CAPM relationship, except that we replace the CAPM (theoretical)
market portfolio with the well-specied/observable market index
30
The index model and the expected return-beta relationship
If the index M in Eq. (9) represents the true market portfolio, we can take the expectation of each
side of the equation to show that the index model specication is: E(r
i
) r
f
=
i
[E(r
m
) r
f
]
A comparison of the index model relationship to the CAPM expected return-beta relationship Eq.
(8) shows that the CAPM predicts that
i
should be zero for all assets
i
is the expected return in excess of (or below) fair expected return predicted by CAPM
If the stock is fairly priced, its alpha must be zero
Therefore, if we estimate the index model for several rms, using Eq. (9) as a regression, we should
nd that ex post/realized s (regression intercepts) center around zero
CAPM states that E[] = 0 for all securities, whereas index model representation of CAPM holds
that realized value of should average out to zero for sample of historical observed returns
Indirect evidence on the eciency of the market portfolio can be found in a study that estimates
values for a large sample of equity mutual funds: The distribution of s is roughly bell shaped,
with a mean that is slightly negative but statistically indistinguishable from zero
The market model
Other applicable variation on the intuition of the index model, which divides returns into rm-
specic and systematic components somewhat dierently from the index model
The market model states that the return surprise of any security is proportional to the return
surprise of the market, plus a rm-specic surprise:
r
i
E(r
i
) =
i
[r
M
E(r
M
)] +e
i
If CAPM is valid, substituting for E(r
i
) from Eq. (8), the market model equation is identical to
the index model Index model and market model are used interchangeably
Is the CAPM practical?
If CAPM was valid, single-index model in which index includes all traded securities also valid
All alpha values in security risk premiums would be identically zero
All
s = 0)
Central problem in testing predictions: The hypothesized market portfolio is unobservable
It is dicult to test the eciency of an observable portfolio, let alone an unobservable one
These problems alone make adequate testing of the model infeasible
31
The CAPM fails empirical tests
Tests use proxies such as the S&P 500 index to stand in for the true market portfolio. Assumption
that the market proxy is suciently close to the true, unobservable market portfolio
CAPM fails these tests: Data reject hypothesis that
2
P
Cov(r
P
, r
Q
)
(10)
3. Every portfolio on the ecient frontier, except for the global M-V portfolio, has a companion
portfolio on bottom (inecient) half of the frontier with which it is uncorrelated
Uncorrelated Companion portfolio called zero-beta portfolio of ecient portfolio
If we choose the market portfolio M and its zero-beta companion portfolio Z, then Eq.
(10) simplies to the CAPM-like equation:
E(r
i
) E(r
Z
) = [E(r
M
) E(r
Z
)]
Cov(r
i
, r
M
)
2
M
=
i
[E(r
M
) E(r
Z
)] (11)
Eq. (11) resembles the SML of the CAPM, except that the risk-free rate is replaced with the
expected return on the zero-beta companion of the market index portfolio
Eq. (11): CAPM equation for investors restricted when borrowing/investing in risk-free asset
Because average returns on the zero-beta portfolio are greater than observed T-bill rates, the zero-
beta model can explain why average estimates of alpha values are positive for low-beta securities
and negative for high-beta securities, contrary to the prediction of the CAPM
Labor income and nontraded assets
An important departure from realism is the CAPM assumption that all risky assets are traded.
Two important asset classes that are not traded are:
1. Human capital
2. Privately held businesses
Privately held business may be the lesser of the two sources of departures from the CAPM
Nontraded rms can be incorporated or sold at will, save for liquidity considerations
Owners of private business also can borrow against their value
Suppose that privately held business have similar risk characteristics as those of traded assets
Individuals can partially oset the diversication problems posed by their nontraded en-
trepreneurial assets by reducing their portfolio demand for securities of similar, traded assets
CAPM equation may not be greatly disrupted by presence of entrepreneurial income
To the extent that risk characteristics of private enterprises dier from those of traded securities,
a portfolio of traded assets that best hedges the risk of typical private business would enjoy excess
demand from the population of private business owners
The price of assets in this portfolio will be bid up relative to the CAPM considerations, and
the expected returns on these securities will be lower in relation to their systematic risk
Adding proprietary income to a standard asset-pricing model improves its predictive performance
The size of labor income and its special nature is of greater concern for validity of CAPM
Despite individuals borrowing against labor income (via a home mortgage) and reducing some
uncertainty about future labor income via life insurance, human capital is less portable across
time and more dicult to hedge using traded securities than nontraded business
This induces pressure on security prices and results in departures from CAPM equation
Equilibrium expected return-beta equation for an economy in which individuals are endowed with
labor income of varying size relative to their nonlabor capital: The resultant SML equation is:
E(R
i
) = E(R
M
)
Cov(R
i
, R
M
) +
P
H
P
M
Cov(R
i
, R
H
)
2
M
+
P
H
P
M
Cov(R
M
, R
H
)
(12)
Where P
H
= Value of aggregate human capital
P
M
= Market value of traded assets (market portfolio)
P
H
= Excess rate of return on aggregate human capital
33
The CAPM measure of systematic risk is replaced in the extended model by an adjusted beta
that also accounts for covariance with the portfolio of aggregate human capital
Because we expect Cov(R
i
, R
H
) to be positive for the average security, the risk premium in
this model will be greater, on average, than predicted by the CAPM for securities with beta
less than 1, and smaller for securities with beta greater than 1
Model predicts a security market line less steep than that of standard CAPM
This may help explain the average negative alpha of high-beta securities and positive alpha of
low-beta securities that lead to the statistical failure of the CAPM
A multiperiod model and hedge portfolios
Merton relaxes the single-period myopic assumptions about investors
He envisions individuals who optimize a lifetime consumption/investment plan, and who con-
tinually adapt consumption/investment decisions to current wealth/planned retirement age
When uncertainty about portfolio returns is the only source of risk and investment opportuni-
ties remain unchanged through time (no change in probability distribution of return on market
portfolio or individual securities) Mertons intertemporal capital asset pricing model
(ICAPM) predicts same expected return-beta relationship as single-period model
But the situation changes when we include additional sources of risk, of two general kinds:
1. One concerns changes in the parameters describing investment opportunities, such as future
risk-free rates, expected returns, or the risk of the market portfolio
Investors will sacrice some expected return if they can nd assets whose returns will be
higher when other parameters (e.g., the risk-free rate) change adversely
2. The other additional source of risk concerns the prices of the consumption goods that can be
purchased with any amount of wealth (e.g. ination risk)
Investors may be willing to sacrice some expected return to purchase securities whose
returns will be higher when the cost of living changes adversely
Suppose we identify K sources of extramarket risk and nd K associated hedge portfolios
Mertons ICAPM equation generalizes the SML to a multi-index version:
E(R
i
) =
iM
E(R
M
) +
K
k=1
iK
E(R
K
) where
iK
= beta on k-th hedge portfolio (13)
A consumption based CAPM
The logic of the CAPM together with the hedging demands noted in the previous subsection
suggests that it might be useful to center the model directly on consumption
In a lifetime consumption plan, investor must in each period balance allocation of current
wealth between todays consumption and savings/investment supporting future consumption
When optimized, utility value from additional dollar of consumption today = utility value of
expected future consumption nanced by that additional dollar of wealth
Future wealth will grow from: (i) Labor income, and (ii) Returns on that dollar when invested
in the optimal complete portfolio
As a general rule, investors will value additional income more highly during dicult economic times
An asset will therefore be viewed as riskier in terms of consumption if it has positive covariance
with consumption growth Equilibrium risk premiums will be greater for assets that exhibit
higher covariance with consumption growth
We can write the risk premium on an asset as a function of its consumption risk as:
E(R
i
) =
iC
RP
C
(14)
Portfolio C may be interpreted as a consumption-tracking portfolio,i.e. the portfolio with
the highest correlation with consumption growth
iC
= slope in regression of asset is excess returns on consumption-tracking portfolio
RP
C
= risk premium associated with consumption uncertainty: RP
C
= E(R
C
) = E(r
C
) r
f
34
Consumption-tracking portfolio in CCAPM plays role of market portfolio in CAPM
Focus on risk of consumption opportunities rather than risk/return of portfolios dollar value
Excess return on consumption-tracking portfolio Excess return on market portfolio
In contrast to CAPM,
M
on the market factor of the CCAPM is not necessarily 1
Advantage: Compactly incorporates consumption hedging/changes in investment opportunities
However, consumption growth gures are published infrequently (monthly at the most) com-
pared with nancial assets, and are measured with signicant error
Liquidity and the CAPM
Standard models of asset pricing assume frictionless markets (securities can be traded costlessly)
In reality, trading (and trading costs) will be of great importance to investors
Liquidity
The liquidity of an asset is the ease and speed with which it can be sold at fair market value
Part of liquidity is the cost of engaging in a transaction, particularly the bid-ask spread
Another part is price impact: Adverse movement in price encountered when attempting large trade
Another part is immediacy: Ability to sell quickly without reverting to re-sale prices
Conversely, illiquidity can be measured in part by the discount from fair market value seller must
accept if asset is to be sold quickly. A perfectly liquid asset would entail no illiquidity discount
Stock investors pay high price for liquidity
Overall, least-liquid stocks average 8.5%/year higher return than most-liquid stocks
On average, a 1% point increase in spread is associated with a 2.5% higher annual return (NYSE)
The relationship held after results were adjusted for size and other risk factors
Cost of spread incurred for each trade Illiquid stocks can quickly become prohibitively expensive
Liquidity (or lack of it) is an important characteristic that aects asset values
Legal cases: Courts apply steep discounts to values of businesses not publicly traded
Liquidity is increasingly viewed as an important determinant of prices and expected returns
Early models of liquidity focused on the inventory management problem faced by security dealers
Dealers in OTC markets post prices at which they buy (the bid price) or sell (the ask price)
The fee they earn for supplying this liquidity is the bid-ask spread
Part of the bid-ask spread is compensation for bearing price risk involved in holding an inventory
of securities and allowing inventory levels to absorb uctuations in overall security demand
Assuming the fair price of the stock is the average of the bid and ask prices, an investor pays half
the spread upon purchase and another half upon sale of the stock
The spread is one important component of liquidity: It is the cost of transacting in a security
In electronic markets, the limit-order book contains the inside spread: Dierence between highest
price at which some investor will purchase and lowest price at which another investor will sell
The eective bid-ask spread will also depend on the size of the desired transaction
Greater emphasis today on the component of the spread that is due to asymmetric information
Traders who post prices need to be worried about being picked o by better-informed traders who
hit their limit prices only when they are out of line with the intrinsic rm value
Investors trade securities for two reasons:
1. Some trades are driven by noninformational motives
E.g., selling assets to raise cash for a big purchase, or even just for portfolio rebalancing
These trades, not motivated by private info, are called noise trades
Security dealers will earn a prot from the bid-ask spread when transacting with noise traders
(also called liquidity traders because their trades may derive from needs for liquidity)
2. Other transactions are motivated by private information known only to the seller or buyer
These transactions are generated when traders believe they have come across information that
a security is mispriced, and try to prot from that analysis
Information traders impose a cost on both dealers and other investors who post limit orders
The greater the relative importance of information traders, the greater the required spread to
compensate for the potential losses from trading with them
35
Discount in a security price from illiquidity can be very large, far larger than bid-ask spread
The PV of all three future trading costs (spreads) are discounted into the current price
For any given spread, the price discount increases almost in proportion to frequency of trading
For 1% spread, if security is traded once a year forever, current illiquidity cost = immediate cost
plus PV of a .5% perpetuity. At 5% discount rate, .005 +.005/.05 = 10.5%
The reduction in the rate of return due to trading costs is lower the longer the security is held
In equilibrium, investors with long holding periods, on average, hold more of the illiquid securities
Short-horizon investors will more strongly prefer liquid securities
This clientele eect mitigates the eect of the bid-ask spread for illiquid securities
The end result is that liquidity premium should increase with bid-ask spread at a decreasing rate
What about unanticipated changes in liquidity?
Investors may also demand compensation for liquidity risk
The bid-ask spread of a security is not constant through time, nor is the ability to sell a security
at a fair price with little notice. Both depend on overall conditions in security markets
There may be a systematic component to liquidity risk that aects the equilibrium rate of return
and hence the expected return-beta relationship
Acharya and Pedersen consider the impacts of both the level and the risk of liquidity on security pricing
They include three components to liquidity risk: Each captures the extent to which liquidity varies
systematically with other market conditions
Therefore, expected return depends on expected liquidity, as well as the conventional CAPM
beta and three additional liquidity-related betas:
E(R
i
) = kE(C
i
) +( +
L1
L2
L3
) (15)
where E(C
i
) = Expected cost of illiquidity
k = Adjustment for average holding period over all securities
= Market risk premium net of average market illiquidity cost, E[R
M
C
M
]
= Measure of systematic market risk
Li
= Liquidity betas
Compared to the conventional CAPM, the expected return-beta equation now has a predicted
rm-specic component that accounts for the eect of security liquidity
Such an eect would appear to be an alpha in the conventional index model
The market risk premium itself is measured net of the average cost of illiquidity = E[R
M
C
M
],
where C
M
is the market-average cost of illiquidity
Overall risk of each security accounts for three elements of liquidity risk:
1.
L1
measures sensitivity of the securitys illiquidity to market illiquidity. Investors want extra
compensation for holding security that becomes illiquid when general liquidity is low
L1
=
Cov[C
i
, C
M
]
V ar[R
M
C
M
]
2.
L2
measures the sensitivity of the stocks return to market illiquidity. This coecient appears
with a negative sign in Eq. (15) because investors are willing to accept a lower average return
on stocks that will provide higher returns when market illiquidity is greater
L2
=
Cov[R
i
, C
M
]
V ar[R
M
C
M
]
3.
L3
measures sensitivity of security illiquidity to the market rate of return. This sensitivity
also appears with a negative sign, because investors accept a lower average return on securities
that can be sold more easily (have low illiquidity costs) when the market declines
L1
=
Cov[C
i
, R
M
]
V ar[R
M
C
M
]
All liquidity variants improve on the explanatory power of the CAPM
36
BKM - Ch. 10: Arbitrage pricing theory and multifactor models of risk and return
Introduction
Arbitrage: The exploitation of security mispricing so that risk-free prots can be earned
Involves simultaneous purchase/sale of equivalent securities to prot from price discrepancies
The most basic principle of capital market theory is that equilibrium market prices are rational in
that they rule out arbitrage opportunities
If actual security prices allow for arbitrage, the result will be strong pressure to restore equilibrium
Security markets ought to satisfy a no-arbitrage condition
Multifactor models of security returns can be used to measure and manage exposure to each of many
economy-wide factors such as business-cycle risk, interest or ination rate risk, energy price risk
Factor models combined with a no-arbitrage condition lead to simple relationship between expected
return and risk This approach to risk-return trade-o is called Arbitrage Pricing Theory (APT)
Multifactor models: An overview
Sometimes, rather than using market proxy, more useful to focus directly on ultimate sources of risk
Useful in risk assessment when measuring ones exposures to particular sources of uncertainty
Factor models allow to quantify the dierent factors aecting the rate of return on a security
Factor models of security returns
Uncertainty in asset returns has two sources:
1. A common or macroeconomic factor
2. Firm-specic events
The common factor is constructed to have zero expected value, because we use it to measure new
information concerning the macro-economy which, by denition, has zero expected value
If F is the deviation of the common factor from its EV,
i
the sensitivity of rm i to that factor, and
e
i
the rm-specic disturbance, factor models state that the actual return on rm i equals its ini-
tially expected return plus a (zero EV) random amount attributable to unanticipated economywide
events, plus another (zero EV) random amount attributable to rm-specic events:
r
i
= E(r
i
) +
i
F +e
i
(1)
Example: Factor models
Suppose that macro factor F is taken to be news about state of business cycle, measured by
unexpected % change in GDP, and that consensus is that GDP will increase by 4% this year
Suppose stocks value is 1.2. If GDP increases by only 3%, then F = 1%
This disappointment would translate into a return on the stock that is 1.2% lower than expected
When we estimate a single-index regression, we implicitly impose an (incorrect) assumption that each
stock has the same relative sensitivity to each risk factor
A more explicit representation of systematic risk, allowing for dierent stocks exhibiting dierent
sensitivities to its various components, is a useful renement of the single-factor model
multifactor models can provide better descriptions of security returns
Apart from their use in building models of equilibrium security pricing, multifactor models are
useful in risk management applications: These models give us a simple way to measure our exposure
to various macroeconomic risks, and construct portfolios to hedge those risks
Two-factor model
Macroeconomic sources of risk: Unanticipated growth in GDP and changes in interest rates:
r
i
= E(r
i
) +
iGDP
GDP +
iIR
IR +e
i
(2)
Both macro factors have zero EV: They represent unanticipated changes in these variables
The coecients of each factor in Eq. (2) measure the sensitivity of share returns to that factor
Coecients are called factor sensitivities, factor loadings, or factor betas
Increase in interest rates is bad news for most rms Interest rate betas generally negative
37
Factor betas can provide a framework for a hedging strategy:
The idea for investors who wish to hedge a source of risk is to establish an opposite factor
exposure to oset that source of risk
A multifactor Security Market Line (SML)
The multifactor model is no more than a description of the factors that aect security returns.
Where does E(r) comes from, i.e., what determines a securitys expected rate of return
A multifactor index model leads to a multifactor SML in which the risk premium is determined by
exposure to each systematic risk factor, and by risk premium associated with those factors
E.g., in a two-factor economy in which risk exposures can be measured by Eq. (2), we would
conclude that the expected rate of return on a security i would be the sum of:
1. The risk-free rate of return
2. [Sensitivity to GDP risk (
iGDP
)] [Risk premium for bearing GDP risk]
3. [Sensitivity to interest rate risk (
iIR
)] [Risk premium for bearing interest rate risk]
Hence, the two-factor security market line:
E(r) = r
f
+
GDP
RP
GDP
+
IR
RP
IR
(3)
One dierence between single/multiple factor economy: A factor risk premium can be negative
E.g., a security with a positive interest rate beta performs better when rates increase, and thus
would hedge the value of a portfolio against interest rate risk
Investors accept a lower rate of return (negative risk premium) as cost of hedging attribute
How to estimate the risk premium for each factor?
Like CAPM, the risk premium for each factor is the risk premium of a portfolio that has
= 1.0 on that particular factor and = 0 on all other factors
I.e., it is the risk premium one expects to earn by taking a pure play on that factor
Arbitrage pricing theory
Like CAPM, APT predicts a SML linking expected returns to risk. APT relies on 3 key propositions:
1. Security returns can be described by a factor model
2. There are sucient securities to diversify away idiosyncratic risk
3. Well-functioning security markets do not allow for the persistence of arbitrage opportunities
Arbitrage, risk arbitrage, and equilibrium
Arbitrage opportunity: When investor can earn riskless prots without making a net investment
E.g., if shares of a stock sold for dierent prices on two dierent exchanges
The Law of One Price
If two assets are equivalent in all economically relevant respects, then same market price
The Law of One Price is enforced by arbitrageurs: They will bid up the price where it is low
and force it down where it is high until the arbitrage opportunity is eliminated
Market prices move to rule out arbitrage opportunities: Fundamental concept in capital markets
The critical property of a risk-free arbitrage portfolio is that any investor, regardless of risk
aversion or wealth, will want to take an innite position in it
Because those large positions will quickly force prices up or down until the opportunity vanishes,
security prices should satisfy a no-arbitrage condition
Dierence between arbitrage/risk-return dominance arguments in support of equilibrium prices:
A dominance argument holds that when an equilibrium price relationship is violated, many
investors will make limited portfolio changes, depending on their degree of risk aversion
Aggregation of these limited portfolio changes is required to create a large volume of buying
and selling, which in turn restores equilibrium prices
By contrast, with arbitrage, each investor wants to take as large a position as possible
Few investors needed to bring about price pressures necessary to restore equilibrium
Therefore, implications for prices derived from no-arbitrage arguments are stronger than im-
plications derived from a risk-return dominance argument
CAPM: Example of dominance argument, implying that all investors hold M-V ecient portfolios
38
Arbitrageur often refers to a professional searching for mispriced securities in specic areas such
as merger-target stocks, rather than to one who seeks strict (risk-free) arbitrage opportunities
Such activity is sometimes called risk arbitrage to distinguish it from pure arbitrage
Well-diversied portfolios
If we construct an n-stock portfolio with weights w
i
, then the rate of return on this portfolio is:
r
p
= E(r
p
) +
p
F +e
p
where
p
=
w
i
i
and E(r
p
) =
w
i
E(r
i
) (4)
The portfolio nonsystematic component (which is uncorrelated with F) is e
p
=
w
i
e
i
The portfolio variance is:
2
p
=
2
p
2
F
+
2
(e
p
)
Where
2
F
is the variance of the factor F and
2
(e
p
) is the nonsystematic risk of the portfolio:
2
(e
p
) = V ar
_
w
i
e
i
_
=
w
2
i
2
(e
i
)
If the portfolio was equally weighted w
i
= 1/n, then the nonsystematic variance would be:
2
(e
p
) = V ar
_
w
i
e
i
_
=
_
1
n
_
2
2
(e
i
) =
1
n
2
(e
i
)
n
=
1
n
2
(e
i
)
Where the last term is the average value across securities of nonsystematic variance
When n is large, nonsystematic variance approaches zero Eect of diversication
Well-diversied portfolio
A portfolio that is diversied over a large enough number of securities, with each weight w
i
small enough that for practical purposes the nonsystematic variance
2
(e
p
) is negligible
Because the expected value of e
p
for any well-diversied portfolio is zero, and its variance also
is eectively zero, we can conclude that any realized value of e
p
will be virtually zero
Hence, for a well-diversied portfolio, for all practical purposes:
r
p
= E(r
p
) +
p
F
Betas and expected returns
Because nonfactor risk can be diversied away, only factor risk should command a risk premium
Portfolios with same beta
Well-diversied portfolio A with
A
= 1 for systematic factor F: E(r
A
)+
A
F = 10%+1.0F
Consider another well-diversied portfolio B with an expected return of 8% and
B
= 1
Could portfolios A and B coexist with the return pattern depicted? Clearly not
If one sells short $1 million of B and buy $1 million of A, a zero net investment strategy, the
riskless payo would be $20,000:
(.10 + 1.0 F) $1M from long position in A
(.08 + 1.0 F) $1M from short position in B
.02 $1M = $20, 000 net proceeds
The prot is risk-free because the factor risk cancels out across the long and short positions
Moreover, the strategy requires zero net investment
One should pursue it on an innitely large scale until the return discrepancy disappears
Well-diversied portfolios with equal s must have equal expected returns in equilibrium
Portfolios with dierent betas
Risk-free rate = 4%. Well-diversied portfolio C ( = .5) has expected return of 6%
Consider new portfolio D composed of half of portfolio A and half of the risk-free asset
Portfolio Ds = .5 0 +.5 1.0 = .5, and expected return = 5 4 +.5 10 = 7%
D has same beta but greater expected return than C Arbitrage opportunity
39
to macro factor
with respect
F
1.0 0.5
10
7
6
D
C
A
= 4
f
r
Risk Premium
0
)
r
(
E
Figure 1: An arbitrage opportunity
We conclude that, to preclude arbitrage opportunities, the expected return on all well-diversied
portfolios must lie on the straight line from the risk-free asset in Fig. 1
The equation of this line will dictate the expected return on all well-diversied portfolios
Risk premiums are indeed proportional to portfolio betas
Risk premium shown by vertical arrow (distance between risk-free rate/expected return)
The risk premium is zero for = 0 and rises in direct proportion to
The one-factor security market line
Consider the market index portfolio M as a well-diversied portfolio, and let us measure the
systematic factor as the unexpected return on that portfolio
Because index portfolio must be on the line in Fig. 1 and
M
= 1, the equation of the line is:
E(r
p
) = r
f
+ [E(r
M
) r
f
]
p
(5)
Arbitrage and the security market line
Suppose market index is well-diversied with expected return 10% and deviations of its return
from expectation (r
M
10%) serve as systematic factor. T-bill rate is 4%
Then the SML, Eq. (5) implies that the expected rate of return on well-diversied portfolio E
with a beta of 2/3 should be 4% + 2/3(10 4) = 8%
If its expected return actually is 9%, then there will be an arbitrage opportunity
Buy $1 worth of E and sell $1 of portfolio invested 1/3 in T-bills and 2/3 in market
Risk-free prot per dollar invested = Deviation of expected return on E from SML
Unlike CAPM, APT does not require benchmark SML portfolio = true market portfolio
Any well-diversied portfolio lying on the SML may serve as the benchmark portfolio
APT more exible than CAPM (unobservable market portfolio not a concern)
APT provides further justication for index model in practical SML implementation
Individual assets and the APT
If no arbitrage, each well-diversied portfolios expected excess return must be proportional to its
If satised by all well-diversied portfolios, it must be satised by almost all individual securities
Suppose that the expected return-beta relationship is violated for all single assets. Now create a
pair of well-diversied portfolios from these assets
What are the chances that in spite of the fact that for any pair of assets the relationship does not
hold, the relationship will hold for both well-diversied portfolios? The chances are small
Now construct yet a third well-diversied portfolio. What are the chances that the violations of the
relationships for single securities are such that the third portfolio also will fulll the no-arbitrage
expected return-beta relationship? Obviously, the chances are smaller still
Continue with a fourth well-diversied portfolio, and so on. If the no-arbitrage expected return-
beta relationship has to hold for innitely many dierent, well-diversied portfolios, it must be
virtually certain that the relationship holds for all but a small number of individual securities
40
Imposing no-arbitrage condition implies maintenance of the expected return-beta relationship for all
well-diversied portfolios and for all but possibly a small number of individual securities
The APT and the CAPM
The APT serves many of the same functions as the CAPM
Rate of return benchmark for capital budgeting/security valuation/performance evaluation
APT highlights the crucial distinction between nondiversiable risk (factor risk) that requires
a reward in the form of a risk premium and diversiable risk that does not
Advantages of the APT
Depends on assumption that rational equilibrium in markets precludes arbitrage opportunities
Uses diversied portfolio that can be constructed practically from large number of securities
In contrast, the CAPM is derived assuming an inherently unobservable market portfolio
The CAPM argument rests on mean-variance eciency
Disadvantages of the APT
The APT does not fully dominate the CAPM
CAPM: Unequivocal statement on expected return-beta relationship for all securities
APT: Implies that this relationship holds for all but perhaps a small number of securities
APT cannot rule out a violation of expected return-beta relationship for any particular asset
A multifactor APT
Multifactor version of the APT to accommodate multiple sources of risk. E.g., two-factor model:
r
i
= E(r
i
) +
i1
F
1
+
i2
F
2
+e
i
(6)
Factor portfolio
Well-diversied portfolio with = 1 on one of the factors and = 0 on all other factor
Think of factor portfolio as a tracking portfolio: Returns on such portfolio track evolution of
particular sources of macroeconomic risk, but are uncorrelated with other sources of risk
It is possible to form such factor portfolios because we have a large number of securities to choose
from, and a relatively small number of factors
Factor portfolios will serve as the benchmark portfolios for a multifactor security market line
The multifactor APT states that the overall risk premium on a portfolio must equal the sum of the risk
premiums required as compensation for each source of systematic risk
The factor exposures of any portfolio P are given by its betas
P1
and
P2
A competing portfolio Q can be formed by investing in factor portfolios with the following weights:
P1
in the rst factor portfolio
P2
in the second factor portfolio
1
P1
P2
in T-bills
By construction, portfolio Q will have betas equal to those of portfolio P and expected return of:
E(r
Q
) =
P1
E(r
1
) +
P2
E(r
2
) +(1
P1
P2
)r
f
= r
f
+
P1
[E(r
1
) r
f
] +
P2
[E(r
2
) r
f
] (7)
Any well-diversied portfolio with
P1
/
P2
must have Eq. (7) return if no arbitrage opportunities
Extension of multifactor SML to individual assets: Same as for one-factor APT
Eq. (7) cannot be satised by all well-diversied portfolio unless satised by virtually every security
individually Eq. (7) represents multifactor SML for economy with multiple sources of risk
Multifactor APT can be used to provide fair rates of return for regulated utilities, like CAPM
Where should we look for factors?
Multifactor APT gives no guidance to determine relevant risk factors/their risk premiums
Two principles guide us when we specify a reasonable list of factors:
1. Limit ourselves to systematic factors with considerable ability to explain security returns
2. Choose factors likely to be important risk factors, i.e. factors that concern investors suciently
that they will demand meaningful risk premiums to bear exposure to those sources of risk
41
One of many possible sets that might be considered:
IP = % change in industrial production
EI = % change in expected ination
UI = % change in unanticipated ination
CG = Excess return of long-term corporate bonds over long-term government bonds
GB = Excess return of long-term government bonds over T-bills
Thus, ve-factor model of security returns during holding period t:
r
it
=
i
+
iIP
IP
t
+
iEI
EI
t
+
iUI
UI
t
+
iGC
GC
t
+
iGB
GB
t
+e
it
(8)
The Fama-French (FF) three-factor model
An alternative approach to specifying macroeconomic factors as candidates for relevant sources of
systematic risk uses rm characteristics that seem to proxy for exposure to systematic risk
Factors are variables that seem to predict avg. returns well and thus may capture risk premiums
Fama and French three-factor model:
r
it
=
i
+
iM
R
Mt
+
iSMB
SMB
t
+
iHML
HML
t
+e
it
(9)
SMB: Small Minus Big = Return of small stocks portfolio in xs of large stocks portfolio
HML: High Minus Low = Return of a portfolio of stocks with a high book-to-market ratio in
excess of the return on a portfolio of stocks with a low book-to-market ratio
Market index expected to capture systematic risk originating from macroeconomic factors
These two rm-characteristic variables are chosen because of observations that corporate capital-
ization (rm size) and book-to-market ratio predict deviations of avg. stock returns from CAPM
While SMB and HML are not themselves obvious candidates for relevant risk factors, the hope
is that these variables proxy for yet-unknown more fundamental variables
Fama and French: Firms with high ratios of book-to-market value are more likely to be in
nancial distress and small stocks are more sensitive to changes in business conditions
These variables may capture sensitivity to risk factors in the macroeconomy
The problem with empirical approaches which use proxies for extramarket sources of risk, is that
factors in the models cannot be clearly identied as hedging signicant source of uncertainty
The rm-characteristic basis of Fama-French factors raises the question of whether they reect
APT model or approximation to a multi-index ICAPM based on extra-market hedging demands
Important distinction for the debate over the proper interpretation of the model
The validity of FF-style models may constitute either: (i) A deviation from rational equilibrium
(as there is no rational reason to prefer one or another of these rm characteristics per se),
or (ii) That rm characteristics identied as empirically associated with average returns are
correlated with other (yet unknown) risk factors
The multifactor CAPM and the APT
Multi-index CAPM inherits risk factors from risks that investors deem important enough to hedge
If hedging demands are common to many investors, prices of securities with desirable hedging
characteristics will be bid up and their expected return reduced. This process requires a multifactor
model to explain expected returns, where each factor arises from particular hedging motive
Risk sources that are priced in market equilibrium presumably will be systematic sources of
uncertainty that aect investors broadly
In contrast, the APT is largely silent on where to look for priced sources of risk
Lack of guidance problematic, but accommodates less structured search for risk factors
42
BKM - Ch. 11: The ecient market hypothesis
Random walks and the ecient market hypothesis
Attempts to nd recurrent patterns in stock price movements are likely to fail
A forecast about favorable future performance leads instead to favorable current performance, as
market participants all try to get in on the action before the price jump
Any info that could be used to predict stock performance should already be reected in stock prices
New information and random walk
If prices are bid immediately to fair levels, they increase/decrease only in response to new info
New info, by denition, must be unpredictable: Stock prices that change in response to new
(unpredictable) information also must move unpredictably
This is the essence of the argument that stock prices should follow a random walk
Far from market irrationality, randomly evolving stock prices are the necessary consequence of
intelligent investors competing to discover relevant information on which to buy/sell
If stock price movements were predictable, then evidence of stock market ineciency, because
ability to predict prices indicates that all available info is not already reected in prices
Ecient market hypothesis (EMH)
EMH: The notion that stocks prices already reect all available information
Dramatic evidence of rapid response to new information found in intraday prices: Most of price
response to corporate dividend/earnings announcements occurs within 10 minutes
Competition as the source of eciency
Why should we expect stock prices to reect all available information?
If willing to spend time/money on gathering info, it might seem reasonable that you could turn
up something that has been overlooked by the rest of the investment community
When information is costly to uncover and analyze, one would expect investment analysis
calling for such expenditures to result in an increased expected return
The degree of eciency diers across various markets
E.g., emerging markets that are less intensively analyzed than US markets and in which ac-
counting disclosure requirements are less rigorous may be less ecient than US markets
Although it may not literally be true that all relevant information will be uncovered, competition
among many well-backed, highly paid, aggressive analysts ensures that, as a general rule, stock
prices ought to reect available information regarding their proper levels
Versions of the ecient market hypothesis
Three versions of the EMH: (i) The weak, (ii) The semistrong, and (iii) The strong form
Versions dier by their notions of what is meant by the term all available information
The weak form hypothesis
Asserts that stock prices already reect all information that can be derived by examining
market trading data such as the history of past prices, trading volume, or short interest
Implies that trend analysis is fruitless since past stock price data are publicly available/costless
The weak-form hypothesis holds that if such data ever conveyed reliable signals about future
performance, all investors already would have learned to exploit the signals
The semistrong form hypothesis
States that all publicly available info must be reected already in the stock price
Such info includes, in addition to past prices, fundamental data on rms product line, quality
of management, balance sheet, patents held, earning forecasts, accounting practices
The strong form hypothesis
States that stock prices reect all info, even info available only to company insiders
This version of the hypothesis is quite extreme
Few would argue with the proposition that corporate ocers have access to pertinent info long
enough before public release to enable them to prot from trading on that info
43
Implications of the EMH
Technical analysis
Technical analysis is essentially the search for recurrent and predictable patterns in stock prices
Although technicians recognize the value of info regarding future economic prospects of the
rm, they believe that such info is not necessary for a successful trading strategy
If stock price responds slowly enough, can identify/exploit trend during adjustment period
Key to technical analysis: Slow response of prices to fundamental supply-and-demand factors
This prerequisite, of course, is diametrically opposed to the notion of an ecient market
Example of technical analysis - The relative strength approach
Chartist compares stock performance over recent period to market/stocks in same industry
Strength of a stock presumably may continue for long enough to oer prot opportunities
Resistance levels or support levels
Price levels above which it is dicult for stock prices to rise, or below which it is unlikely for
them to fall, and they are believed to be levels determined by market psychology
The ecient market hypothesis implies that technical analysis is without merit
Will a technical rule that seems to work continue to work in the future once widely recognized
Once discovered, a useful tech rule ought to fail invalidated when mass of traders exploit it
In this sense, price patterns ought to be self-destructing
Thus market dynamic: Continual search for protable trading rules, followed by destruction
by overuse of successful rules, followed by more search for yet-undiscovered rules
Fundamental analysis
Uses earnings/dividend prospects, expectations of future interest rates, and risk evaluation of the
rm to determine proper stock prices. Represents an attempt to determine the PV of all the
payments a stockholder will receive from each share of stock
If value exceeds stock price, fundamental analyst recommends purchasing the stock
Fundamental analysts usually start with a study of past earnings/company balance sheets
Supplement analysis with detailed economic analysis, including evaluation of the rms man-
agement quality, rms standing within industry, and prospects for industry
The EMH predicts that most fundamental analysis also is doomed to failure
Only analysts with a unique insight will be rewarded
Fundamental analysis much more dicult than identifying well-run rms with good prospects
Trick is not to identify good rms, but rms better than everyone elses estimate
Similarly, poorly run rms can be great bargains if not as bad as stock prices suggest
Active versus passive portfolio management
Limitations of active portfolio management
Casual eorts to pick stocks are not likely to pay o
Only serious/uncommon analyses generate dierential insight needed to yield prots
Moreover, these techniques are economically feasible only for managers of large portfolios
The small investor probably is better o investing in mutual funds. By pooling resources in this
way, small investors can gain from economies of scale
EMH proponents believe active management is wasted eort/unlikely to justify expenses
Therefore, they advocate a passive investment strategy (no attempt to outsmart market)
Passive strategy aims only at building well-diversied portfolio without attempting to nd
under-/overvalued stocks, usually characterized by a buy-and-hold strategy
One common strategy for passive management is to create an index fund, which is a fund
designed to replicate the performance of a broad-based index of stocks
Exchange-traded funds (ETFs) are a (lower-expense) alternative to indexed mutual funds
These are shares in diversied portfolios that can be bought/sold like shares of individual stock
ETFs matching several broad stock market indexes are available to investors who want to hold
a diversied sector of a market without attempting active security selection
Hybrid strategy common: Fund maintains passive core, plus actively managed portfolio(s)
44
The role of portfolio management in an ecient market
There is a role for rational portfolio management, even in perfectly ecient markets
Even if all stocks are priced fairly, rm-specic still need to be diversied away
Therefore, rational security selection, even in an ecient market, calls for the selection of a
well-diversied portfolio providing the systematic risk level that the investor wants
Rational investment policy also requires that tax considerations be reected in security choice
Another argument for rational portfolio management relates to investors particular risk prole
Investors of varying ages need dierent portfolio policies with regard to risk bearing
Investors optimal positions vary with age, tax bracket, risk aversion, and employment
Role of portfolio manager is to tailor portfolio to these needs, rather than to beat the market
Resource allocation
Deviations from eciency allow better-informed traders to prot from less-informed traders
However, deviations from informational eciency would also result in a large cost that will be
borne by all citizens, namely, inecient resource allocation
Capital market prices guide resource allocation because security mispricing could entail severe
social costs by fostering inappropriate investments on the real side of the economy
Corporations with overpriced stocks would obtain capital too cheaply, and corporations with
undervalued stocks might forgo investment opportunities because raising capital too expensive
Inecient capital markets would diminish one of the most potent benets of a market economy
Event studies
The notion of informationally ecient markets leads to a powerful research methodology
If security prices reect all currently available info, then price changes must reect new info
Measure the importance of an event by examining price changes when the event occurs
Event study
Technique of empirical nancial research to assess impact of an event on rms stock price
Isolating the part of a stock price movement that is attributable to a specic event is not trivial
Starts with a proxy for what the stocks return would have been in absence of the event
Abnormal return due to event estimated as [stocks actual return] [this benchmark]
Many researchers use a market model to estimate abnormal returns (based on index models):
The stock return r
t
during a given period t would be expressed as:
r
t
= a +br
Mt
+e
t
e
t
= r
t
(a +br
Mt
) (1)
Firm-specic/abnormal return e
t
interpreted as unexpected return resulting from event
Market model is highly exible, it can be generalized to richer models of benchmark returns
E.g., by including industry as well as broad market returns on the right-hand side of Eq. (1),
or returns on indexes constructed to match characteristic such as rm size
However, intercept a/slope b must be estimated using data suciently separated in time from
the event so that they are not aected by event-period abnormal stock performance
We measure the impact of an event by estimating the abnormal return on a stock (or group of stocks)
at the moment the information about the event becomes known to the market
One concern that complicates event studies arises from leakage of information
Leakage occurs when info is released to a small group of investors before ocial public release
Any abnormal return on announcement date is then poor indicator of total impact of info release
Better indicator: Cumulative abnormal return (
2
(e
i
) = Estimates of the variance of the residuals for each of the 100 stocks
Estimating the SML
Now view Eq. (1) as security market line (SML) with 100 observations for stocks in sample
Estimate
0
/
1
in 2nd-pass regression with estimates b
i
from 1st-pass as independent variable:
r
i
r
f
=
0
+
1
b
i
i = 1, , 100 (2)
If the CAPM is valid, then
0
= 0 and
1
= r
M
r
f
57
The key property of the expected return-beta relationship described by SML is that expected
excess return on securities is determined only by systematic risk (measured by ) and should
be independent of nonsystematic risk, measured by the variance of residuals
2
(e
i
)
These estimates can be added as a variable in Eq. (2) of an expanded SML:
r
i
r
f
=
0
+
1
b
i
+
2
2
(e
i
) (3)
This 2nd-pass regression is estimated with hypotheses:
0
= 0,
1
= r
M
r
f
, and
2
= 0
The hypothesis that
2
= 0 is consistent with the notion that nonsystematic risk should not
be priced, i.e. there is no risk premium earned for bearing nonsystematic risk
More generally, according to the CAPM, the risk premium depends only on beta
Any additional RHS variable in Eq. (3) beyond beta should have a coecient that is
insignicantly dierent from zero in the second-pass regression
Tests of the CAPM
Early tests of CAPM by Lintner, and replicated by Miller/Scholes used annual data on 631 NYSE
stocks for 1954-63. Results inconsistent with CAPM:
The estimated SML is too at: The
1
coecient is too small
SML intercept
0
(should be zero) is 20 its standard error
Two-stage procedure is straightforward, and rejection of CAPM using this approach is disappointing
However, there are several diculties with this approach:
1. Stock returns are extremely volatile, which lessens the precision of any tests of average return
2. The market index used in the tests is surely not the market portfolio of the CAPM
3. In light of asset volatility, security s from 1st-stage regressions are estimated with substantial
sampling error and therefore cannot readily be used as inputs to 2nd-stage regression
4. Investors cannot borrow at the risk-free rate, as assumed by the simple version of CAPM
The market index
In what has become known as Rolls critique, Richard Rolls pointed out that:
1. Single testable CAPM hypothesis: Market portfolio is mean-variance ecient
2. All other implications follow from market portfolios eciency Not independently testable
3. In any sample of individual returns, there are an innite number of ex post M-V ecient
portfolios using sample-period returns and covariances. Sample s calculated between each
such portfolio/individual assets are exactly linearly related to sample avg. returns
If s are calculated against such portfolios, they will satisfy the SML relation exactly
whether or not the true market portfolio is mean-variance ecient in an ex ante sense
4. CAPM not testable unless we know exact composition of true market portfolio
The theory is not testable unless all individual assets are included in the sample
5. Using a proxy such as the S&P 500 for the market portfolio is subject to two diculties:
(a) Proxy itself might be M-V ecient even when true market portfolio is not. Conversely,
proxy may be inecient, but true market portfolio could still be ecient
(b) Most reasonable market proxies will be very highly correlated with each other and with
the true market portfolio whether or not they are mean-variance ecient
This problem is referred to as benchmark error, because it refers to the use of an incorrect
benchmark (market proxy) portfolio in the tests of the theory
Roll and Ross expanded Rolls critique, arguing that tests that reject positive relationship between
avg. return/ point to ineciency of market proxy, rather than refuting CAPM relationship
Plausible that even highly diversied portfolios, such as value-/equally weighted portfolios of
all stocks in the sample, fail to produce signicant avg. return- relationship
Kandel and Stambaugh considered the properties of the usual two-pass test of the CAPM in an
environment in which borrowing is restricted but the zero-beta version of the CAPM holds
The expected return-beta relationship describes the expected returns on a stock i, a portfolio
E on the ecient frontier, and that portfolios zero-beta companion Z:
E(r
i
) E(r
Z
) =
i
[E(r
E
) E(r
Z
)] (4)
58
Kandel and Stambaugh asked what would happen if we followed the common procedure of
using market proxy portfolio M in place of E, and used the more ecient generalized least
squares regression in estimating 2nd-pass regression for zero-beta version of CAPM:
r
i
r
Z
=
0
+
1
(Estimated
i
)
Results: Estimates of
0
/
1
biased by term relative eciency of market proxy
2nd-pass regression provides a poor test of CAPM if market proxy is not ecient
Unfortunately, it is dicult to tell how ecient our market index is relative to the theoretical
true market portfolio, so we cannot tell how good our tests are
Measurement error in beta
Suppose that we could get past Rolls problem by obtaining data on the returns of the true market
portfolio. We still would have to deal with the statistical problems caused by measurement error
in the estimates of beta from the rst-stage regressions
Well known in statistics that if RHS variable of a regression is measured with error (here ), then
the slope coecient of the regression is biased downward and the intercept biased upward
Consistent with
0
higher than predicted by CAPM and
1
lower than predicted
Miller and Scholes conrmed these arguments in a well-controlled simulation:
They constructed stock returns obeying SML/agreeing with CAPM using a random generator
They used those simulated rates of returns in a test of CAPM as if they were from real data
They obtained the same test results as with real data, i.e. a rejection of CAPM
Next wave of tests designed to overcome the measurement error problem
Innovation (Black, Jensen, and Scholes): Use portfolios rather than individual securities
Combining securities into portfolios diversies away most of the rm-specic part of returns, thereby
enhancing precision of estimates and expected rate of return of the portfolio
This mitigates the statistical problems that arise from measurement error in the beta estimates
However, combining stocks into portfolios reduces the number of observations left for 2nd-pass
To get best of trade-o: Construct portfolios with largest possible dispersion of coecients
Rather than allocate 20 stocks to each portfolio randomly, we can rank portfolios by betas
Widely spaced betas will yield reasonably powerful tests of the SML
Fama and MacBeth used this to verify that the observed relationship between avg. excess returns and
is indeed linear and that nonsystematic risk does not explain avg. excess returns
Using 20 portfolios constructed according to the Black, Jensen, and Scholes methodology, Fama
and MacBeth expanded the estimation of the SML equation to include:
The square of coecient (test for linearity of relationship between returns and s)
The estimated std. dev. of the residual (test for explanatory power of nonsystematic risk)
For a sequence of many subperiods, they estimated for each subperiod the equation:
r
i
=
0
+
1
i
+
2
2
i
+
3
(e
i
) (5)
According to CAPM, both
2
/
3
should have zero coecients in 2nd-pass regression (conrmed)
With respect to the expected return-beta relationship, however, the picture is mixed:
The estimated SML is too at, consistent with previous studies
1
is, on average, less than r
M
r
f
On the + side, dierence does not appear to be signicant CAPM not clearly rejected
In conclusion, these tests of the CAPM provide mixed evidence on the validity of the theory:
1. The insights that are supported by the single-factor CAPM and APT are:
(a) Expected rates of return are linear and increase with beta, the measure of systematic risk
(b) Expected rates of return are not aected by nonsystematic risk
2. The single-variable expected return-beta relationship predicted by either the risk-free rate or the
zero-beta version of the CAPM is not fully consistent with empirical observation
CAPM qualitatively correct ( matters, (e
i
) does not), but tests invalidate quantitative predictions
59
The EMH and the CAPM
Roll: CAPM and the expected return-beta relationship follow directly from eciency of the market
portfolio If we establish that market portfolio is ecient, we prove CAPM equation
Proxies for market portfolio (S&P 500, NYSE index) have proven hard to beat by investors
Strongest evidence for the empirical content of the CAPM and APT
Accounting for human capital and cyclical variations in asset betas
Two important deciencies of the tests of the single-index models:
1. Not all assets traded in capital markets. Most important nontraded asset is human capital
2. There is ample evidence that asset betas are cyclical
One of the CAPM assumptions is that all assets are traded and accessible to all investors
Mayer proposed a version of the CAPM that accounts for a violation of this assumption
This requires an additional term in the expected return-beta relationship
An important nontraded asset is human capital
The value of future wages and compensation for expert services is a signicant component of
the wealth of investors who expect years of productive careers prior to retirement
Moreover, it is reasonable to expect that changes in human capital are far less than perfectly
correlated with asset returns, and hence they diversify the risk of investor portfolios
Jagannathan and Wang used a proxy for changes in the value of human capital based on the rate
of change in aggregate labor income
In addition to the standard security betas estimated using the value-weighted stock market
index
vw
, they also estimated s of assets with respect to labor income growth
labor
They considered the possibility that business cycles aect asset betas
They used the dierence between the yields on low-/high-grade corporate bonds as proxy for
state of business cycle and estimate asset s relative to business cycle variable
prem
With estimates of these three s for several stock portfolios, they estimated a 2nd-pass re-
gression which includes rm size (market value of equity, denoted ME):
E(R
i
) = c
0
+c
size
log(ME) +c
vw
vw
+c
prem
prem
+c
labor
labor
(6)
Jagannathan/Wang chose indirect way to add cyclical component to expected return-beta
relationship, as default premium can be very dierent from expected return on market index
Eq. (6) combines the labor factor that would be a natural ICAPM factor with two other factors
(size and the default premium) that might be justied on APT grounds
Jagannathan and Wang test their model with 100 portfolios that are designed to spread se-
curities on the basis of size and beta. Stocks are sorted to 10 size portfolios, and the stocks
within each size decile are further sorted by beta into 10 subportfolios
Size explains variations in average returns quite well while beta does not
Substituting the default premium and labor income for size results in a similar increase in
explanatory power, but the CAPM expected return-beta relationship is not redeemed
Default premium is signicant, while labor income is borderline signicant. When we add size
as well, we nd it is no longer signicant and only marginally increases explanatory power
Conclusions from the Jagannathan and Wang study
1. Conventional 1st-pass estimates of security betas are greatly decient. They do not fully
capture cyclicality of returns and thus do not accurately measure systematic risk of stock
2. Human capital important in any version of CAPM that better explains systematic risk
Accounting for nontraded business
Heaton and Lucas estimate the importance of proprietary business in portfolio choice
We expect that private-business owners will reduce demand for traded securities that are
positively correlated with their specic entrepreneurial income
If this eect is suciently important, aggregate demand for traded securities will be determined
in part by the covariance with aggregate noncorporate business income
Higher risk premium on securities with high covariance with noncorporate business income
60
Consistent with theory, Heaton and Lucas nd that households with higher investments in private
business do in fact reduce the fraction of total wealth invested in equity
In their regression analysis, allocation of overall portfolio to stocks is the dependent variable
The share of private business in total wealth (aka relative business) receives negative and
statistically signicant coecients
Risk-attitude (self-reported risk aversion) also receives a negative/signicant coecient
Heaton/Lucas extend Jagannathan/Wangs equation to include rate of change in proprietary-
business wealth This variable also is signicant and improves explanatory power of regression
Tests of multifactor CAPM and APT
Multifactor CAPM/APT are elegant theories, but provide little guidance concerning which factors
(sources of risk) ought to result in risk premiums. A test would require three stages:
1. Specication of risk factors
2. Identication of portfolios that hedge these fundamental risk factors
3. Test of the explanatory power and risk premiums of the hedge portfolios
A macro factor model
Chen, Roll, and Ross identify several possible variables that might proxy for systematic factors:
IP = Growth rate in industrial production
EI = Changes in expected ination measured by changes in short-term (T-bill) rates
UI = Unexpected ination = Actual expected ination
CG = Unexpected changes in RP = Returns on corporate Baa bonds long-term gov. bonds
GB = Unexpected changes in the term premium = Returns on long- short-term gov. bonds
By using these factors, they implicitly assumed that factor portfolios exist that proxy for factors
A critical part of the methodology is the grouping of stocks into portfolios
In single-factor tests, portfolios constructed to span wide range of s to enhance power of test
In a multifactor framework the ecient criterion for grouping is less obvious
Chen, Roll, and Ross chose to group sample stocks into 20 portfolios by size (market value)
They rst used 5 years of monthly data to estimate the factor betas of the 20 portfolios in a
rst-pass regression: For each portfolio, estimate:
r = a +
M
r
M
+
IP
IP +
EI
EI +
UI
UI +
CG
CG+
GB
GB +e (7)
Used as market index both value-weighted NYSE (VWNY) and equal weight NYSE (EWNY)
Using the 20 sets of rst-pass estimates of factor betas as the independent variables, they now
estimated the second-pass regression (with 20 observations, one for each portfolio):
r =
0
+
M
M
+
IP
IP
+
EI
EI
+
UI
UI
+
CG
CG
+
GB
GB
+e (8)
Where the gammas become estimates of the risk premiums on the factors
Chen, Roll, and Ross ran this second-pass regression for every month of their sample period,
reestimating the rst-pass factor betas once every 12 months. The estimated risk premiums (the
values for the parameters ) were averaged over all the second-pass regressions
Industrial production (IP), the risk premium on corporate bonds (CG), and unanticipated ination
(UI) are the factors that appear to have signicant explanatory power
The Fama-French three-factor model
Systematic factors in Fama-French model: Firm size, book-to-market ratio and market index
These additional factors are empirically motivated by the observations that historical average re-
turns on stocks of small rms and on stocks with high ratios of book equity to market equity (B/M)
are higher than predicted by the security market line of the CAPM
This suggests that size or book-to-market ratio may be proxies for exposures to sources of systematic
risk not captured by CAPM and results in return premiums associated with these factors
They measure the size factor in each period as dierential return on small vs. large rms
61
Similarly, the other extra-market factor is typically measured as the return on rms with high book-to-
market ratios minus that on rms with low ratios (HML)
E(r
i
) r
f
= a
i
+b
i
[E(r
M
) r
f
] +s
i
E[SMB] +h
i
E[HML] (9)
b
i
, s
i
and h
i
are the betas of the stock on each of the three factors
If these factors fully explain asset returns, the intercept a
i
of the equation should be zero
Tracking portfolios
To create portfolios that track the size and book-to-market factors, Davis, Fama, and French sort
industrial rms by size (market cap) and by book-to-market (B/M) ratio
Size premium SMB constructed as dierence in returns between smallest/largest third of rms
HML in each period is the dierence in returns between high and low book-to-market rms
They use a broad market index, the value-weighted return on all stocks traded on US national
exchanges (NYSE, AMEX, and NASDAQ) to compute the excess return on the market portfolio
relative to the risk-free rate, taken to be the return on 1-month T-bills
Test: Davis, Fama, and French form nine portfolios with a range of sensitivities to each factor
They construct the portfolios by sorting rms into three size groups (small S, medium M, and big
B) and three book-to-market groups (high H, medium M, and low L)
For each of these nine portfolios, they estimate Eq. (9) as a rst-pass regression over the 816
months between 1929 and 1997 by using the regression model:
r
i
r
f
= a
i
+b
i
(r
M
r
f
) +s
i
SMB +h
i
HML +e
i
(10)
Intercepts (a
i
) are small and statistically insignicant, with t-statistics 2
Large R
2
statistics Returns well explained by three-factor portfolios, and large t-statistics on
size/value loadings show that these factors contribute signicantly to explanatory power
How should we interpret these tests?
Size and relative value (B/M ratio) proxy for risks not fully captured by CAPM beta
Consistent with the APT in that it implies that size and value are priced risk factors
Another explanation attributes these premiums to investor irrationality/behavioral biases
Risk-based interpretations
Liew and Vassalou show that returns on style portfolios (HML or SMB) seem to predict GDP
growth, and thus may in fact capture some aspects of business cycle risk
Returns on HML/SMB portfolios are positively related to future growth in macroeconomy
Petkova and Zhang also try to tie avg. return premium on value portfolios to risk premiums
Conditional CAPM, allowing both market RP and rm s to vary over time, and to co-vary
What might lead to such an association between beta and the market risk premium?
Irreversible investments: Value rms (high B/M ratios) have more tangible capital
Investment irreversibility puts such rms more at risk for economic downturns because
they will suer from excess capacity from assets already in place
In contrast, growth rms better able to deal with downturn by deferring investment plans
Moreover, evidence suggests that market risk premium also is higher in down markets
These factors imparts a positive correlation between of high B/M rms and market RP
Petkova/Zhang t both beta/market risk premium to a set of state variables (variable that
summarizes the state of the economy):
DIV = Market dividend yield
DEFLT = Default spread on corporate bonds (Baa - Aaa rates)
TERM = Term structure spread (10-yr - 1-yr Treasury rates)
TB = 1-month T-bill rate
They estimate a rst-pass regression, but rst substitute these state variables for beta:
r
HML
= +r
Mt
+e
i
= + [b
0
+b
1
DIV
t
+b
2
DEFLT
t
+b
3
TERM
t
+b
4
TB
t
]r
Mt
+e
i
Where
t
= b
0
+b
1
DIV
t
+b
2
DEFLT
t
+b
3
TERM
t
+b
4
TB
t
is a time-varying beta
62
Similarly, the determinants of a time-varying market risk premium, using same state variables:
r
Mkt,t
r
ft
= c
0
+c
1
DIV
t
+c
2
DEFLT
t
+c
3
TERM
t
+c
4
TB
t
+e
t
Petkova and Zhang examine the relationship between beta and the market risk premium:
They dene the state of economy by the size of the premium
A peak is dened as the periods with the 10% lowest risk premiums
A trough has the 10% highest risk premiums
The results support the notion of a counter-cyclical value beta: The beta of the HML
portfolio is negative in good economies, meaning that the beta of value stocks (high book-
to-market) is less than that of growth stocks (low B/M). Reverse true in recessions
Behavioral explanations
Several authors make the case that the value premium is a manifestation of market irrationality
The essence of the argument is that analysts tend to extrapolate recent performance too far out
into the future, and thus tend to overestimate the value of rms with good recent performance
Chan, Karceski, and Lakonishok makes the case for overreaction:
Firms are sorted into deciles based on income growth in the past 5 years
Book-to-market ratio for each decile at end of 5-year period tracks recent growth very well
B/M falls steadily with growth over past 5 years
This is evidence that past growth is extrapolated and then impounded in price
But B/M at beginning of 5-year period shows little or even a positive association with subse-
quent growth Market capitalization today is inversely related to growth prospects
Implication: Market ignores evidence that past growth cannot be extrapolated into future
B/M reects past growth better than future growth, consistent with extrapolation error
La Porta et al. examine stock performance when actual earnings are released to public
Firms are classied as growth versus value stocks
Growth stocks underperform value stocks surrounding these announcements
When news of earnings is released, market is disappointed in stocks priced as growth rms
Liquidity and asset pricing
Measuring liquidity is hard. Eect of liquidity on expected return is composed of two factors:
1. Transaction costs that are dominated by the bid-ask spread
2. Liquidity risk resulting from covariance between changes in asset liquidity cost with both changes
in market-index liquidity cost and with market-index rates of return
Both factors are unobservable and their eect on equilibrium rates of return is hard to estimate
Observed (inside) bid-ask spreads apply only to small trades and therefore may be highly unreliable
indicators of trading costs for larger transactions
Most studies of liquidity instead use a proxy variable that can distinguish liquidity costs across rms
and then calibrate the distribution of such costs to average observed spreads
One widely used measure of illiquidity cost was proposed by Amihud:
ILLIQ = Monthly average of daily
_
Absolute value(Stock return)
Dollar volume
_
This measure of illiquidity is based on the price impact per dollar of transactions in the stock and can
be used to estimate both liquidity cost and liquidity risk
Acharya and Pedersen calculate this statistic C
it
for stock i in month t, using daily data for 1963-99
The market measure of illiquidity C
Mt
is the average value of C
it
over all stocks in month t
Using C
it
, C
Mt
, the excess returns R
it
(net of C
it
) on each stock, as well as the market excess
return R
Mt
(net of C
Mt
), they calculate the market beta
M
and three illiquidity s:
1. The sensitivity of asset illiquidity to market illiquidity:
L1
2. The sensitivity of stock returns to market illiquidity:
L2
3. The sensitivity of stock illiquidity to the market return:
L3
63
To reduce errors in measurement, they form 25 portfolios sorted from low to high illiquidity
Results:
Liquidity is correlated with all three Fama-French factors, suggesting that some of the predic-
tive power of the FF factors for average returns may in fact be liquidity related
Average excess return increases with portfolio illiquidity, although part of that increase is
attributable to the higher systematic risk associated with higher illiquidity
Liquidity cost (eective bid-ask spread) increases a lot with illiquidity, up to 8.83%/month
After adjustment for the more typical holding periods, as reected by monthly turnover, this
cost is economically signicant for illiquid portfolios, on the order of 2.5% per year
Liquidity s are small relative to market and highly collinear, but still signicantly improve
explanatory power of a CAPM augmented with liquidity considerations
Acharya and Pedersens work establishes important point that liquidity is a priced factor
Time-varying volatility
We may associate the variance of the rate of return on the stock with the rate of arrival of new information
because new information may lead investors to revise their assessment of intrinsic value
The rate of arrival of new information is time varying
Expect variances of rates of return on stocks (and covariances among them) to be time varying
When we consider a time-varying return distribution, we must refer to the conditional mean, variance,
and covariance, that is, the mean, variance, or covariance conditional on currently available info
Conditions that vary over time are values of variables that determine level of these parameters
In contrast, the usual estimate of return variance, the average of squared deviations over the sample
period, provides an unconditional estimate (treats variance as constant over time)
Engle ARCH model - Autoregressive conditional heteroskedasticity
Based on the idea that a natural way to update a variance forecast is to average it with the most
recent squared surprise (i.e., the squared deviation of the rate of return from its mean)
Engle GARCH model - Generalized autoregressive conditional heteroskedasticity
Most widely used model to estimate conditional variance of stocks/stock-index returns
Allows greater exibility in the specication of how volatility evolves over time
Uses rate-of-return history as the information set used to form our estimates of variance
The model posits that the forecast of market volatility evolves relatively smoothly each period in
response to new observations on market returns
The updated estimate of market-return variance in each period depends on both the previous
estimate and the most recent squared residual return on the market
This technique essentially mixes in a statistically ecient manner the previous volatility estimate
with an unbiased estimate based on new observation of market return. Updating formula:
2
t
= a
0
+a
1
2
t1
+a
2
2
t1
(11)
Updated forecast = function of most recent variance forecast
2
t1
and most recent squared predic-
tion error in market return
2
t1
. Parameters a
0
, a
1
, and a
2
estimated from past data
ARCH-type models clearly capture much of the variation in stock market volatility
Consumption-based asset pricing and the equity premium puzzle
Equity premium puzzle
Historical excess returns on risky assets in the US are too large to be consistent with economic
theory and reasonable levels of risk aversion
This observation has come to be known as the equity premium puzzle
Suggests that forecasts of the market risk premium should be lower than historical averages
Consumption growth and market rates of return
ICAPM derived from lifetime consumption/investment plan of representative investor
Each individuals plan is set to maximize a utility function of lifetime consumption, and con-
sumption/investment in each period is based on age and current wealth, as well as the risk-free
rate and the market portfolios risk and risk premium
64
What matters to investors is not their wealth per se, but their lifetime ow of consumption
There can be slippage between wealth and consumption due to variation in factors such as the
risk-free rate, the market portfolio risk premium, or prices of major consumption items
Better measure of consumer well-being than wealth is consumption ow that wealth can support
Given this framework, the generalization of the basic CAPM is that instead of measuring security
risk based on the covariance of returns with the market return (focuses only on wealth), we are
better o using the covariance of returns with aggregate consumption
Hence, we expect the risk premium of the market index to be related to that covariance as:
E(r
M
) r
f
= ACov(r
M
, r
C
) (12)
Where A depends on avg. coe of risk aversion and r
C
is the rate of return on a consumption-
tracking portfolio (highest possible correlation with growth in aggregate consumption)
The rst wave of attempts to estimate consumption-based asset pricing models used consumption
data directly rather than returns on consumption-tracking portfolios
By and large, these tests found the CCAPM no better than the conventional CAPM
Diculty: Consumption data are collected far less frequently and with substantial error
Recent research improves the quality of estimation in several ways:
1. Rather than using consumption growth directly, it uses consumption-tracking portfolios:
Data on aggregate consumption used only to construct consumption-tracking portfolio
Frequent/accurate data on the return on these portfolios then used to test model
2. Investors adjust consumption levels most substantially in 4-th quarter of calendar year
Data from other quarters obscure the reaction of annual consumption to annual returns
3. To improve the models power to explain returns, some newer studies allow for several classes
of investors with dierences in wealth and consumption behavior
They also may separate expenditures on consumer nondurables versus durable goods
Standard CCAPM focuses on representative consumer/investor, thereby ignoring info about
heterogeneous investors with dierent levels of wealth/consumption habits
Jagannathan and Wang study
Focus on year-over-year 4-th quarter consumption, employs consumption-mimicking portfolio
Find that annual consumption growth measured by comparing 4-th quarter data in successive
years is substantially better than other intervals in explaining portfolio returns
Show that FF factors are associated with consumption s as well as excess returns
High book-to-market ratio is associated with higher consumption beta
Larger rm size is associated with lower consumption beta
The suggestion is that the explanatory power of the Fama-French factors for average returns
may in fact reect the diering consumption risk of those portfolios
Other tests reported by Jagannathan/Wang show that CCAPM explains returns even better
than FF three-factor model, which in turn is superior to single-factor CAPM
The equity premium puzzle refers to the fact that using reasonable estimates of A, the covariance
of consumption growth with the market-index return Cov(r
M
, r
C
) is far too low to justify observed
historical average excess returns on the market-index portfolio
The equity premium puzzle can be interpreted in several ways:
Focus on observed historical returns: Does Eq. (12) fail empirical tests because those historical
returns were not representative of investors expectations at the time?
Conicting interpretation: Puzzle is real, and is yet another nail in CAPM con
Third approach: Extensions of CAPM may resolve the puzzle
4-th interpretation from behavioral nance: Pins the puzzle on irrational behavior
Expected versus realized returns
Fama and French oer another interpretation of the equity premium puzzle:
Using stock index returns from 1872-1999, they report avg. risk-free rate, avg. ROE (repre-
sented by S&P 500), and resultant risk premium for overall period and subperiods
65
The big increase in the average excess return on equity after 1949 suggests that the equity
premium puzzle is largely a creature of modern times
FF suspect that estimating the risk premium from avg. realized returns may be the problem
Using constant-growth DDM to estimate expected returns, they nd that for 1872-1949, DDM
yields similar estimates of expected risk premium as avg. realized excess return
But for 1950-99, DDM yields much smaller risk premium High avg. excess return in this
period exceeded returns investors expected to earn at the time
In constant-growth DDM, expected capital gains rate = growth rate of dividends
The expected total return on the rms stock will be the sum of dividend yield (divi-
dend/price) plus the expected dividend growth rate g:
E(r) =
D
1
P
0
+g where D
1
= end-of-year dividends and P
0
= current stock price (13)
For any sample period t = 1, , T, FF estimate expected return from arithmetic avg. of
dividend yield (D
t
/P
t1
) plus dividend growth rate (g
t
= D
t
/D
t1
1)
In contrast, realized return = dividend yield plus rate of capital gains (P
t
/P
t1
1)
Because dividend yield is common to both estimates, the dierence between expected/realized
return equals the dierence between the dividend growth and capital gains rates
Capital gains signicantly exceeded the dividend growth rate in modern times
FF conclusion: Equity premium puzzle due to unanticipated capital gains in latter period
FF argue that dividend growth rates produce more reliable estimates of capital gains investors
actually expected to earn than the avg. of their realized capital gains. Three reasons:
1. Avg. realized returns over 1950-99 exceeded IRR on corp. investments. If those avg. returns
were representative of expectations, then rms were engaging in negative-NPV investments
2. Statistical precision of DDM estimates far higher than using avg. historical returns
3. The reward-to-volatility (Sharpe) ratio derived from DDM far more stable than that from
realized returns. If risk aversion constant, then Sharpe ratio should be stable
Fama and Frenchs study provides a simple explanation for the equity premium puzzle:
Observed rates of return in the recent half-century were unexpectedly high
Implies that forecasts of future excess returns will be lower than past averages
Goetzmann and Ibbotson lends support to Fama and Frenchs argument
They extend data on rates of return on stocks/long-term corporate bonds back to 1792
These statistics suggest a risk premium that is much lower than the historical average for
1926-2005, which is the period that produces the equity premium puzzle
Survivorship bias
The equity premium puzzle emerged from long-term averages of US stock returns. There are
reasons to suspect that these estimates of the risk premium are subject to survivorship bias
Jurion and Goetzmann: Database of capital appreciation indexes for 39 stock markets over 1921-96
US equities had highest real return of all countries (4.3% annually vs. 0.8% median)
Unlike US, many countries had equity markets that closed (permanently/extended time)
Using avg. US data leads to survivorship bias for estimate of expected returns
Estimating risk premiums from experience of most successful country and ignoring evidence from
stock markets that did not survive for full period impart an upward bias in expected returns
The high realized equity premium obtained for US may not be indicative of required returns
Extensions to the CAPM may resolve the equity premium puzzle
Constantinides argues that the standard CAPM can be extended to account for observed excess
returns by relaxing some of its assumptions, in particular, by recognizing that consumers face
uninsurable and idiosyncratic income shocks, e.g., the loss of employment
In addition, life-cycle considerations are important and often overlooked
Borrowing constraints become important when placed in the context of the life cycle
The imaginary representative consumer does not face borrowing constraints
Young consumers, however, do face meaningful borrowing constraints
66
Constantinides traces their impact on the equity premium, the demand for bonds, and on the
limited participation of many consumers in the capital markets
Adding habit formation to conventional utility function helps explain higher risk premiums than
those obtained by covariance of stock returns with aggregate consumption growth
Constantinides argues for integrating notions of habit formation, incomplete markets, life cycle,
borrowing constraints, and sources of limited stock market participation
Behavioral explanations of the equity premium puzzle
Barberis and Huang explain the puzzle as an outcome of irrational investor behavior
Key elements: Loss aversion and narrow framing
Narrow framing is the idea that investors evaluate every risk they face in isolation
Investors ignore low correlation of the risk of a stock portfolio with other components of
wealth, and therefore require a higher risk premium than rational models would predict
Combined with loss aversion, investor behavior will generate large risk premiums despite the
fact that traditionally measured risk aversion is plausibly low
Incorporating these eects generates large equilibrium equity risk premium and low/stable risk-free
rate, even when consumption growth is smooth/only weakly correlated with stock market
Analysis for the equity premium also has implications for the stock market participation puzzle
When accounting for heterogeneity of preferences, behavioral approach explains why segment
of population that should participate in stock market despite frictions/other rational explana-
tions, still avoids it
67
68
B Fixed Income Securities
BKM - Ch. 14: Bond prices and yields
Introduction
Debt securities
A debt security is a claim on a specied periodic stream of income
Often called xed-income securities because they promise either a xed stream of income or a
stream of income that is determined according to a specied formula
Risk considerations are minimal as long as the issuer of the security is suciently creditworthy
The bond is the basic debt security
Includes Treasury, corporate, and international bonds
Bond prices are set in accordance with market interest rates
Measures of bond returns: (i) Yield to maturity (YTM), (ii) Yield to call, (iii) Holding-period
return (HPR), or (iv) Realized compound rate of return
Important to consider the impact of default or credit risk on bond pricing and look at the deter-
minants of credit risk and the default premium built into bond yields
Bond characteristics
A bond is a security that is issued in connection with a borrowing arrangement
Borrower issues (sells) bond to lender for some cash. Bond is the IOU of the borrower
The arrangement obligates issuer to make specied payments to bondholder on specied dates
Typical coupon bond obligates issuer to make semiannual payments of interest for life of bond
When bond matures, issuer repays debt by paying bondholder the bond s par value (face value)
The coupon rate of the bond serves to determine the interest payment
The annual payment is the coupon rate times the bonds par value
The coupon rate, maturity date, and par value of the bond are part of the bond indenture, which
is the contract between the issuer and the bondholder
Sometimes, zero-coupon bonds are issued that make no coupon payments
In this case, investors receive par value at maturity date but no interest payments until then
The bond has a coupon rate of zero
These bonds are issued at prices considerably below par value, and the investors return comes
solely from the dierence between issue price and the payment of par value at maturity
Treasury bonds and notes
Generalities
Treasury note maturities up to 10 years, while Treasury bonds have 10-30 years maturities
Bonds/notes issued in denominations of $1,000 or more (minimum reduced to $100 in 2008)
Both make semiannual coupon payments
Bonds prices are quoted as a percentage of par value
The yield to maturity (YTM) is a measure of the average rate of return to an investor who
purchases the bond for the asked price and holds it until its maturity date
Accrued interest and quoted bond prices
Bond prices quoted in nancial pages are not actually prices that investors pay for the bond
because quoted price does not include interest that accrues between coupon payment dates
If a bond is purchased between coupon payments, the buyer must pay the seller for accrued
interest, the prorated share of the upcoming semiannual coupon
The sale, or invoice, price of the bond would equal the stated price plus the accrued interest:
Accrued interest =
Annual coupon payment
2
Days since last coupon payment
Days separating coupon payments
Corporate bonds
Although some bonds trade on a formal exchange operated by NYSE, most bonds are traded
over-the-counter in a network of bond dealers linked by a computer quotation system
Safer bonds with higher ratings promise lower YTM than other bonds with similar maturities
71
Call provisions on corporate bonds
Although Treasury no longer issues callable bonds, some corporate bonds are issued with call
provisions allowing issuer to repurchase bond at specied call price before maturity
E.g., if a company issues a bond with a high coupon rate when market interest rates are high,
and interest rates later fall, the rm might like to retire the high-coupon debt and issue new
bonds at a lower coupon rate to reduce interest payments (aka refunding)
Callable bonds typically come with a period of call protection, an initial time during which
the bonds are not callable (aka deferred callable bonds)
To compensate investors, callable bonds issued with higher coupons/promised YTM
Convertible bonds
Convertible bonds give bondholders an option to exchange each bond for a specied number
of shares of common stock of the rm
The conversion ratio is the number of shares for which each bond may be exchanged
Market conversion value = current value of shares for which bonds may be exchanged
The conversion premium is the excess of the bond value over its conversion value
Convertible bondholders benet from price appreciation of the companys stock
Convertible bonds oer lower coupon rates/promised YTM
Puttable bonds
Extendable/put bond gives bondholder the option to extend/retire the bond
If bonds coupon rate exceeds current market yields, bondholder will extend bonds life
Floating-rate bonds
Floating-rate bonds make interest payments tied to some measure of current market rates
The major risk involved in oaters has to do with changes in the rms nancial strength
The yield spread is xed over the life of the security. If nancial health of rm deteriorates,
then investors will demand a greater yield premium than oered by the security
Coupon rate adjusts to changes in level of market rates, but not to rm nancial condition
Preferred stock
Although preferred stock is (strictly) equity, it is often included in xed-income universe
Like bonds, preferred stock promises to pay a specied stream of dividends
However, unlike bonds, the failure to pay the promised dividend does not result in corporate
bankruptcy. Instead, the dividends owed simply cumulate
In the event of bankruptcy, preferred stockholders claims to the rms assets have lower priority
than those of bondholders, but higher priority than those of common stockholders
Preferred stock commonly pays a xed dividend Perpetuity, providing level CF indenitely
Floating-rate preferred stock is much like oating-rate bonds: The dividend rate is linked to a
measure of current market interest rates and is adjusted at regular intervals
Unlike interest on bonds, dividends on preferred stock are not tax-deductible expenses to the rm
This reduces their attractiveness as a source of capital to issuing rms
On the other hand, there is an osetting tax advantage to preferred stock
When rm buys preferred stock of another rm, tax on only 30% of dividends received
Preferred stock rarely gives its holders full voting privileges in the rm. However, if the preferred
dividend is skipped, the preferred stockholders may then be provided some voting power
Other issuers
State/local gov. issue municipal bonds. Outstanding feature: Interest payments are tax-free
Gov. agencies such as Federal Home Loan Bank Board, Farm Credit agencies, and mortgage
pass-through agencies Ginnie Mae/Fannie Mae/Freddie Mac also issue bonds
International bonds
International bonds are commonly divided into: (i) Foreign bonds and (ii) Eurobonds
Foreign bonds are issued by borrower from country other than the one in which bond is sold
The bond is denominated in the currency of the country in which it is marketed
E.g., German rm sells dollar-denominated bond in the US Foreign bond
72
Eurobonds are bonds issued in the currency of one country but sold in other national markets
E.g., the Eurodollar market refers to dollar-denominated bonds sold outside the US
Because Eurodollar market falls outside US jurisdiction, such bonds not regulated by US Feds
Similarly, Euroyen bonds are yen-denominated bonds selling outside Japan
Innovations in the bond market
Inverse oaters
The coupon rate on these bonds falls when the general level of interest rates rises
Investors in these bonds suer doubly when rates rise: Not only does the PV of each dollar of
CF from the bond fall as the discount rate rises, but the level of those CFs falls as well
Of course, investors in these bonds benet doubly when rates fall
Asset-backed bonds
The income from a specied group of assets is used to service the debt
E.g., mortgage-backed securities or securities backed by auto or credit card loans
Walt Disney issued bonds with coupon rates tied to nancial performance of its lms
Catastrophe bonds
These bonds are a way to transfer catastrophe risk from the rm to the capital markets
Investors in these bonds receive compensation in the form of higher coupon rates
But in the event of a catastrophe, the bondholders will give up all/part of their investments
Indexed bonds
Indexed bonds make payments tied to a general price index/price of a particular commodity
E.g., Treasury Ination Protected Securities (TIPS): By tying par value to general level of
prices, coupon payments/nal repayment of par value on these bonds increase CPI
Bond with 3yr maturity, par value of $1,000, and coupon rate of 4%. Assume annual
coupon payments and ination turns out to be 2%, 3%, and 1% in next 3 years
The nominal rate of return on the bond in the rst year is:
Nominal return =
Interest + Price Appreciation
Initial Price
=
40.80 + 20
1000
= 6.08%
The real rate of return is precisely the 4% real yield on the bond:
Real return =
1 + Nominal Return
1 + Ination
1 =
1.0608
1.02
1 = 4%
Bond pricing
Because bonds coupon/principal repayments occur in the future, the price an investor pays for a claim
to those payments depends on value of dollars to be received in future vs. dollars in hand today
This present value calculation depends in turn on market interest rates
In addition, because most bonds are not riskless, discount rate embodies an additional premium
that reects bond-specic characteristics such as default risk/liquidity/tax attributes/call risk
CFs from a bond consist of coupon payments until maturity date plus nal payment of par value
Bond value = Present value of coupons + Present value of par value
If we call the maturity date T and call the interest rate r, the bond value is:
Bond value =
T
t=1
Coupon
(1 +r)
t
+
Par value
(1 +r)
T
= Coupon
1
r
_
1
1
(1 +r)
T
_
+ Par value
1
(1 +r)
T
(1)
= Coupon Annuity factor(r, T) + Par value PV factor(r, T) (2)
At higher interest rate, PV of payments to be received is lower Bond price falls as interest rates rise
Convex shape of the bond price curve
Progressive increases in interest rate result in progressively smaller reductions in bond price
An increase in the interest rate results in a price decline that is smaller than the price gain resulting
from a decrease of equal magnitude in the interest rate
73
Corporate bonds typically are issued at par value
Underwriters of bond issue must choose a coupon rate that closely approximates market yields
If the coupon rate is inadequate, investors will not pay par value for the bonds
After bonds are issued, bondholders may buy/sell bonds in secondary markets (NYSE or OTC)
The inverse relationship between price and yield is a central feature of xed-income securities
Interest rate uctuations represent the main source of risk in the xed-income market
One key factor that determines that sensitivity is the maturity of the bond
The longer the maturity, the greater the sensitivity to uctuations in interest rates
Bond pricing between coupon dates
In principle, the fact that bond is between coupon dates does not aect pricing problem
The procedure is always the same: Compute the PV of each remaining payment and sum up
Bond prices are typically quoted net of accrued interest
These prices, which appear in the nancial press, are called at prices
Invoice price = Flat price + Accrued interest
Bond yields
The current yield of a bond measures only the cash income provided by the bond as a percentage of
bond price and ignores any prospective capital gains or losses
The yield to maturity is the standard measure of the total rate of return which accounts for both current
income and the price increase or decrease over the bonds life
Yield to maturity (YTM)
Dened as the interest rate that makes the PV of a bonds payments equal to its price
Measure of avg. rate of return earned if bond is bought now and held until maturity
To calculate the YTM, solve the bond price equation for the interest rate given the bonds price
The nancial press reports yields on an annualized basis, and annualizes the bonds semiannual
yield using simple interest techniques, resulting in an annual percentage rate, or APR
Yields annualized using simple interest are also called bond equivalent yields
The eective annual yield of the bond, however, accounts for compound interest
The bonds yield to maturity is the IRR on an investment in the bond
The yield to maturity can be interpreted as the compound rate of return over the life of the bond
under the assumption that all bond coupons can be reinvested at that yield
Yield to maturity is widely accepted as a proxy for average return
YTM diers from current yield = [bonds annual coupon payment] [bond price]
For premium bonds (bonds selling above par value), coupon rate is greater than current yield,
which in turn is greater than YTM because the YTM accounts for the built-in capital loss
For discount bonds (bonds selling below par value), these relationships are reversed
Yield to call
When interest rates fall, the PV of the bonds scheduled payments rises, but the call provision
allows the issuer to repurchase the bond at the call price
Call Price
10 9 8 7 6 5 4
1200
1500
900
600
300
Straight Bond
Callable Bond
P
r
i
c
e
s
(
$
)
Interest Rate (%)
Figure 1: Bond prices: Callable and straight debt
74
If call price < PV of scheduled payments, the issuer may call the bond back
At high interest rates, risk of call negligible because PV of scheduled payments < call price
The values of the straight and callable bonds converge
At lower rates, however, the values of the bonds begin to diverge, with the dierence reecting
the value of the rms option to reclaim the callable bond at the call price
At very low rates, PV of scheduled payments exceeds the call price, so the bond is called
Its value at this point is simply the call price
The yield to call is calculated just like the yield to maturity except that the time until call replaces
time until maturity, and the call price replaces the par value
Premium bonds selling near their call prices are especially apt to be called if rates fall further
Investors in premium bonds often more interested in bonds yield to call rather than YTM
Realized compound return versus yield to maturity
The yield to maturity will equal the rate of return realized over the life of the bond if all coupons
are reinvested at an interest rate equal to the bonds yield to maturity
Denoting V
f
the nal value of the investment (par value plus coupons payment bearing interest),
the realized compound return r is determined by: V
i
(1 +r)
n
= V
f
Problem with conventional yield to maturity when reinvestment rates can change over time: Con-
ventional yield to maturity will not equal realized compound return
Disadvantages of the realized compound return:
It cannot be computed in advance without a forecast of future reinvestment rates
Reduces much of the attraction of the realized return measure
Horizon analysis
Forecasting the realized compound yield over various holding periods/investment horizons
The forecast of total return depends on forecasts of both the price of the bond when sold at
the end of your horizon and the rate at which coupon income is reinvested
The sales price depends in turn on the yield to maturity at the horizon date
Longer investment horizon Reinvested coupons are larger component of nal proceeds
As interest rates change, bond investors are actually subject to two sources of osetting risk:
On the one hand, when rates rise, bond prices fall, which reduces the value of the portfolio
On the other hand, reinvested coupon income compounds faster at higher rates
This reinvestment rate risk will oset the impact of price risk
Bond prices over time
A bond will sell at par value when its coupon rate equals the market interest rate
The investor receives fair compensation for TVM with the recurring coupon payments
No further capital gain is necessary to provide fair compensation
When the coupon rate is lower than the market interest rate, the coupon payments alone will not provide
investors as high a return as they could earn elsewhere in the market
To receive a fair return, investors also need to earn price appreciation on their bonds
The bonds have to sell below par value to provide built-in capital gain on the investment
When bond prices are set according to PV formula, any discount from par value provides an anticipated
capital gain that augments a below-market coupon rate to provide a fair total rate of return
Conversely, if coupon rate exceeds market interest rate, the interest income by itself is greater than
elsewhere in market Investors bid up price of these bonds above their par values
As bonds approach maturity, they fall in value because fewer above-market coupon payments
remain Resulting capital losses oset large coupon payments Fair rate of return
Although capital gain vs. income components dier, bond prices are set to provide competitive rates
Yield to maturity versus holding-period return
When the YTM is unchanged over the period, the rate of return on the bond will equal that yield
However, unanticipated changes in market rates result in unanticipated changes in bond returns
Increase in bonds yield acts to reduce its price HPR will be less than initial yield
Conversely, a decline in yield will result in a holding-period return greater than the initial yield
75
Yield to maturity vs. holding-period return
YTM depends only on bonds coupon, current price, and par value at maturity
All of these values are observable today, so yield to maturity can be easily calculated
YTM = Measure of avg. rate of return if bond held until maturity
In contrast, HPR is the rate of return over a particular investment period and depends on the
market price of the bond at the end of that holding period
Bond prices respond to unanticipated changes in interest rates HPR can at most be forecast
Zero-coupon bonds and Treasury strips
Original-issue discount bonds
Less common than coupon bonds issued at par
Bonds issued intentionally with low coupon rates that cause bond to sell at a discount
An extreme example of this type of bond is the zero-coupon bond
Treasury bills are examples of short-term zero-coupon instruments
If face value = $10,000, Treasury issues it for less, agreeing to repay $10,000 at maturity
All of the investors return comes in the form of price appreciation
Longer-term zero-coupon bonds created from coupon-bearing notes/bonds with help of US Treasury
Dealer who purchases Treasury coupon bond may ask Treasury to break down CFs to be paid
into independent securities, where each security is one of the original bond payments
E.g., a 10-year coupon bond would be stripped of its 20 semiannual coupons, and each
coupon payment would be treated as a stand-alone zero-coupon bond
The maturities of these bonds would thus range from 6 months to 10 years
The nal payment of principal would be treated as another stand-alone zero-coupon security
Treasury program which perform coupon stripping: STRIPS (Separate Trading of Registered
Interest and Principal of Securities). Such zero-coupon securities Treasury strips
Prices of zeros as time passes
At maturity, zeros must sell for par value As time passes, price should approach par value
If the interest rate is constant, a zeros price will increase at exactly the rate of interest
Consider a zero with 30 years until maturity, and suppose the market interest rate is 10% per
year. The price of the bond today will be 1, 000/(1.10)
30
= 57.31
Next year, the price is 1, 000/(1.10)
29
= 63.04, a 10% increase over previous-year value
The bond prices rise exponentially, not linearly, until its maturity
After-tax returns
Tax authorities recognize that built-in price appreciation on original-issue discount (OID) bonds
such as zero-coupon bonds represents an implicit interest payment to security holder
Therefore, IRS calculates price appreciation schedule to impute taxable interest income for built-in
appreciation during tax year, even if asset is not sold/does not mature until future year
Interest income based on constant yield method (ignores changes in market interest rates)
Any additional gains or losses that arise from changes in market interest rates are treated as capital
gains or losses if the OID bond is sold during the tax year
Default risk and bond pricing
Introduction
US government bonds may be treated as free of default risk. Not true of corporate bonds
Bond default risk, aka credit risk, is measured by Moodys, S&P, and Fitch, which provide nancial
info on rms and quality ratings of large corporate/municipal bond issues
The top rating is AAA or Aaa, a designation awarded to only about a dozen rms
Moodys modies each rating class with a 1, 2, or 3 sux (e.g., Aaal, Aaa2, Aaa3) to provide
a ner gradation of ratings. The other agencies use a + or modication
Bonds rated BBB or above (S&P, Fitch) or Baa and above (Moodys) are considered investment-
grade bonds, whereas lower-rated bonds are classied as speculative-grade or junk bonds
Defaults on low-grade issues are not uncommon
E.g., almost half of bonds rated CCC by S&Ps at issue have defaulted within 10 years
76
Junk bonds
Junk bonds, aka high-yield bonds, are speculative-grade (low-rated/unrated) bonds
Before 1977, almost all junk bonds were fallen angels, i.e. bonds issued by rms that
originally had investment-grade ratings but that had since been downgraded
1977: Original-issue junk started: Lower-cost nancing alternative than bank borrowing
High-yield bonds notorious in 80s as nancing in leveraged buyouts/hostile takeover attempts
Shortly thereafter, the market suered. Legal diculties of Drexel/Mike Milken with Wall
Streets insider trading scandals of late 80s tainted junk bond market which nearly dried up
Since then, the market has rebounded dramatically
Determinants of bond safety
Bond rating agencies base their quality ratings largely on an analysis of the level and trend of some
of the issuers nancial ratios. The key ratios used to evaluate safety are:
1. Coverage ratios - Ratios of company earnings to xed costs
Times-interest-earned ratio: Ratio of EBIT to interest obligations
Fixed-charge coverage ratio: Includes lease payments and sinking fund payments with
interest obligations to arrive at the ratio of earnings to all xed cash obligations
Low or falling coverage ratios signal possible cash ow diculties
2. Leverage ratio / Debt-to-equity ratio
A too-high leverage ratio indicates excessive indebtedness, signaling the possibility the rm
will be unable to earn enough to satisfy the obligations on its bonds
3. Liquidity ratios
The two most common liquidity ratios are the current ratio (current assets/current liabilities)
and the quick ratio (current assets excluding inventories/current liabilities)
These ratios measure the rms ability to pay bills coming due with its most liquid assets
4. Protability ratios
Measures of rates of return on assets or equity. Indicators of a rms overall nancial health
The return on assets (earnings before interest and taxes divided by total assets) or return on
equity (net income/equity) are the most popular of these measures
Firms with higher returns on assets or equity should be better able to raise money in security
markets because they oer prospects for better returns on the rms investments
5. Cash ow-to-debt ratio
This is the ratio of total cash ow to outstanding debt
S&Ps periodically computes median values of selected ratios for rms in several rating classes
Ratios evaluated in context of industry standards. Analysts dier in ratios weights
Default rates vary dramatically with bond rating
Historically, 1% of bonds originally rated AA/better at issuance had defaulted after 15 years
That ratio is around 7.5% for BBB-rated bonds, and 40% for B-rated bonds
Credit risk clearly varies dramatically across rating classes
Studies have tested whether nancial ratios can in fact be used to predict default risk
Altman used discriminant analysis to predict bankruptcy
With this technique a rm is assigned a score based on its nancial characteristics
If its score exceeds a cut-o value, the rm is deemed creditworthy. A score below the cut-o
value indicates signicant bankruptcy risk in the near future
Altman found the following equation to best separate failing and nonfailing rms:
Z = 3.3
EBIT
Total assets
+ 99.9
Sales
Assets
+.6
Market value of equity
Book value of debt
+ 1.4
Retained earnings
Total assets
+ 1.2
Working capital
Total assets
Bond indentures
A bond is issued with an indenture, which is the contract between issuer/bondholder
Part of the indenture is a set of restrictions that protect rights of the bondholders
Include provisions relating to collateral/sinking funds/dividend policy/further borrowing
Issuer agrees to protective covenants to market its bonds to investors concerned about safety
77
Sinking funds
Payment of par value at end of bonds life constitutes large cash commitment for issuer
To help ensure the commitment does not create a CF crisis, the rm agrees to establish a
sinking fund to spread the payment burden over several years
The fund may operate in one of two ways:
1. Firm may repurchase fraction of outstanding bonds in the open market each year
2. Firm may purchase outstanding bonds at special call price from SF provision
Firm has option to purchase bonds at either market price/sinking fund price, whichever
is lower (bonds chosen for the call are selected at random based on serial number)
Although less common, sinking fund provision may call for periodic payments to trustee, with
payments invested so that accumulated sum is used for retirement of entire issue at maturity
The sinking fund call diers from a conventional bond call in two important ways:
1. Firm can repurchase only a limited fraction of bond issue at sinking fund call price
2. Usually: Callable bonds call prices > par value, sinking fund call price = par value
Although sinking funds protect bondholders (principal repayment more likely), they can hurt
investors: Firm will buy back discount bonds (selling below par) at market price, and buy back
premium bonds (selling above par) at par
One bond issue that does not require a sinking fund is a serial bond issue
In a serial bond issue, the rm sells bonds with staggered maturity dates
As bonds mature sequentially, principal repayment is spread over time ( sinking fund)
One advantage of serial bonds over sinking fund issues is that there is no uncertainty
introduced by the possibility that a particular bond will be called for the sinking fund
The disadvantage of serial bonds, however, is that bonds of dierent maturity dates are
not interchangeable, which reduces the liquidity of the issue
Subordination of further debt
One of the factors determining bond safety is total outstanding debt of the issuer
Subordination clauses restrict the amount of additional borrowing: Additional debt might be
required to be subordinated in priority to existing debt
Subordination is sometimes called a me-rst rule, meaning the senior (earlier) bondholders
are to be paid rst in the event of bankruptcy
Dividend restrictions
Covenants also limit the dividends rms may pay. These limitations protect bondholders
because they force the rm to retain assets rather than paying them out to stockholders
A typical restriction disallows payments of dividends if cumulative dividends paid since the
rms inception exceed cumulative retained earnings plus proceeds from sales of stock
Collateral
Some bonds are issued with specic collateral behind them
Collateral = particular asset of rm that bondholders receive if rm defaults on bond
If the collateral is property, the bond is called a mortgage bond
If collateral is another security held by rm Collateral trust bond
In the case of equipment, the bond is known as an equipment obligation bond
Collateralized bonds generally are considered the safest variety of corporate bonds
General debenture bonds by contrast do not provide collateral: They are unsecured bonds
Because they are safer, collateralized bonds generally oer lower yields than general debentures
Yield to maturity and default risk
Bonds subject to default risk Distinguish between bonds promised YTM and its expected yield
The stated yield is the maximum possible yield to maturity of the bond
The expected yield to maturity must take into account the possibility of a default
When bond becomes more subject to default risk, its price falls Its promised YTM rises
Similarly, default premium (spread between stated YTM/comparable Treasury bonds) rises
However, its expected YTM (ultimately tied to systematic risk of bond) will be far less aected
78
Default premium
To compensate for the possibility of default, corporate bonds must oer a default premium
The default premium is the dierence between the promised yield on a corporate bond and
the yield of an otherwise-identical government bond that is riskless in terms of default
Risk structure of interest rates Pattern of default premiums oered on risky bonds
The greater the default risk, the higher the default premium
One particular manner in which yield spreads vary over time is related to business cycle
Yield spreads tend to be wider when the economy is in a recession
Investors perceive a higher probability of bankruptcy when the economy is faltering, even
holding bond rating constant: They require a commensurately higher default premium
Aka ight to quality, meaning that investors move their funds into safer bonds unless they
can obtain larger premiums on lower-rated securities
Credit risk and collateralized debt obligations (CDOs)
Collateralized debt obligations: Major mechanism to reallocate credit risk in xed-income markets
To create a CDO, a nancial institution, commonly a bank, rst establishes a legally distinct entity
to buy and later resell a portfolio of bonds or other loans
A common vehicle for this purpose is the Structured Investment Vehicle (SIV)
Legal separation of bank/SIV allows ownership of the loans to be conducted o the banks
balance sheet, and thus avoids capital requirements the bank would otherwise encounter
The SIV raises funds, often by issuing short-term commercial paper, and uses the proceeds to
buy corporate bonds or other forms of debt such as mortgage loans or credit card debt
These loans are rst pooled together and then split into a series of classes known as tranches
Each tranche is given a dierent level of seniority in terms of its claims on the underlying loan
pool, and each can be sold as a stand-alone security
As the loans in the underlying pool make their interest payments, the proceeds are distributed
to pay interest to each tranche in order of seniority
This priority structure implies that each tranche has a dierent exposure to credit risk
The bottom tranche - aka the equity, rst-loss, or residual tranche - has last call on payments
from the pool of loans (head of the line in terms of absorbing default/delinquency risk)
Using junior tranches to insulate senior tranches from credit risk, one can create Aaa-rated
bonds even from a junk-bond portfolio
While Aaa-rated bonds are rare, Aaa-rated CDO tranches are common
Investors in tranches with the greatest exposure to credit risk demand the highest coupon rates
Investors with greater expertise in credit risk are natural investors in these securities
Often, the originating bank holds the residual tranche: Provides incentives to originator to
perform careful credit analysis of bonds included in structure
Mortgage-backed CDOs were an investment disaster in 2007
Highly rated tranches suered extreme losses as default rates far higher than anticipated
Rating agencies that certied these tranches as investment-grade came under re
Questions were raised concerning conicts of interest: Because the rating agencies are paid by
bond issuers, the agencies were accused of responding to pressure to ease their standards
79
80
Hull - Ch. 4: Interest rates
Types of rates
The higher the credit risk, the higher the interest rate that is promised by the borrower
Treasury rates
Treasury rates are the rates an investor earns on Treasury bills and Treasury bonds
These are the instruments used by a government to borrow in its own currency
Usually assumed that no chance a government will default on an obligation in its own currency
Traders argue that Treasury rates are too low to be used as risk-free rates because:
1. T-bills/Treasuries must be purchased by institutions to fulll many regulatory requirements
This increases demand for these Treasury instruments driving price up and yield down
2. Amount of capital a bank is required to hold for investment in T-bills/bonds is substantially smaller
than capital required to support similar investment in other instruments with very low risk
3. In the US, Treasuries are given a favorable tax treatment (no tax at the state level)
LIBOR
LIBOR is short for London Interbank Oered Rate
A LIBOR quote by a particular bank is the rate of interest at which the bank is prepared to
make a large wholesale deposit with other banks
Large banks/nancials quote LIBOR in all major currencies for maturities up to 12 months
AA-rated nancial institutions regard LIBOR as their short-term opportunity cost of capital: They
can borrow short-term funds at the LIBOR quotes of other nancial institutions
LIBOR rates are not totally free of credit risk
However, they are close to risk-free
Traders regard LIBOR rates as a better indication of the true risk-free rate than Treasury
rates, because many tax/regulatory issues cause Treasury rates to be articially low
LIBID rates (London Interbank Bid Rate)
Large banks also quote LIBID rates
This is the rate at which they will accept deposits from other banks
At any time, usually small spread between quoted LIBID/LIBOR rates (LIBOR > LIBID)
The rates themselves are determined by active trading between banks
LIBOR and LIBID trade in what is known as the Eurocurrency market
This market is outside the control of any one government
Repo rates
Sometimes trading activities are funded with a repo or repurchase agreement
This is a contract where an investment dealer who owns securities agrees to sell them to another
company now and buy them back later at a slightly higher price
The other company is providing a loan to the investment dealer
Dierence between prices at which securities are sold/repurchased is the interest it earns
The interest rate is referred to as the repo rate
If structured carefully, the loan involves very little credit risk
Measuring interest rates
Suppose that an amount A is invested for n years at an interest rate of R per annum
If the rate is compounded once per annum, the terminal value of the investment is: A(1 +R)
n
If rate compounded m times per annum, terminal value = A
_
1 +
R
m
_
mn
When m = 1, the rate is sometimes referred to as the equivalent annual interest rate
Continuous compounding
With continuous compounding (m ), A invested for n years at rate R grows to: Ae
Rn
Discounting at a continuously compounded rate R for n years involves multiplying by e
Rn
81
Suppose that R
c
is a rate of interest with continuous compounding and R
m
is the equivalent rate
with compounding m times per annum:
R
c
= mln
_
1 +
R
m
m
_
R
m
= m(e
Rc/m
1) (1)
Zero rates
n-year zero-coupon interest rate = rate of interest on investment starting today and lasting for n years
All the interest and principal is realized at the end of n years, there are no intermediate payments
n-year zero-coupon interest rate: Aka n-year spot rate, n-year zero rate, or n-year zero
Bond pricing
Price of a bond
Most bonds pay coupons to the holder periodically
The bonds principal (aka par value or face value) is paid at the end of its life
Theoretical price of bond calculated as PV of all CFs that will be received by bondholder
Sometimes bond traders use the same discount rate for all the CFs underlying a bond, but a more
accurate approach is to use a dierent zero rate for each CF
E.g., the theoretical price of a 2-year Treasury bond with a principal of $100 provides coupons at
the rate of 6% per annum semiannually is:
3e
0.050.5
+ 3e
0.0581.0
+ 3e
0.0641.5
+ 103e
0.0682.0
= 98.39
Bond yield
Bonds yield = discount rate that, when applied to all CFs, gives bond price = market price
If y is the yield on the bond, expressed with continuous compounding, then:
3e
y0.5
+ 3e
y1.0
+ 3e
y1.5
+ 103e
y2.0
= 98.39 y = 6.76%
Par yield
Par yield for a bond maturity = coupon rate that causes bond price = par value
Suppose that coupon on 2-year bond is c per annum (c/2 per 6m). Bond price = par value when:
c
2
e
0.050.5
+
c
2
e
0.0581.0
+
c
2
e
0.0641.5
+
_
100 +
c
2
_
e
0.0682.0
= 100 c = 6.87
The 2-year par yield is therefore 6.87% per annum with semiannual compounding
If d = PV of $1 received at maturity, A = value of annuity that pays $1 on each coupon date, and
m = # of coupon payments per year, then the par yield c is:
A
c
m
+ 100d = 100 c =
(100 100d)m
A
Determining treasury zero rates
One way of determining Treasury zero rates is to observe the yields on strips
Bootstrap method
Another way to determine Treasury zero rates is from Treasury bills and coupon-bearing bonds
Start with short-term instruments and move progressively to longer-term instruments, making sure
that zero rates calculated at each stage are consistent with prices of instruments
Bond principal ($) Time to maturity (years) Annual coupon ($) Bond price ($)
100 0.25 0 97.5
100 0.50 0 94.9
100 1.00 0 90.0
100 1.50 8 96.0
100 2.00 12 101.6
82
1st three bonds pay no coupons Zero rates corresponding to their maturities easily calculated
3-month bond provides return of 2.5 in 3 months on an initial investment of 97.5. With
quarterly compounding, 3-month zero rate is (4 2.5)/97.5 = 10.256% per annum. With
continuous compounding, rate becomes: 4 ln(1 + 0.10256/4) = 10.127% per annum
The 6-month bond provides a return of 5.1 in 6 months on an initial investment of 94.9. With
semiannual compounding the 6-month rate is (2 5.1)/94.9 = 10.748% per annum. With
continuous compounding, it becomes: 2 ln(1 + 0.10748/2) = 10.469% per annum
Similarly, the 1-year rate with continuous compounding is: ln(1+10/90) = 10.536% per annum
4th bond lasts 1.5 years. Payments: $4 at 6 months, $4 at one year, and $104 at 1.5 years
From earlier calculations, the discount rate for the payment at the end of 6 months is 10.469%
and that the discount rate for the payment at the end of 1 year is 10.536%
The bonds price $96 must equal the PV of all the payments received by bondholder
Denoting the 1.5-year zero rate by R, we have:
4e
0.104690.5
+ 4e
0.105361.0
+ 104e
R1.5
= 96 R = 10.681%
This is the only zero rate consistent with 6-month rate, 1-year rate, and data in table
2-year zero rate calculated similarly from 6m, 1yr, and 1.5yr zero rates, and info on last bond
If R is the 2-year zero rate, then:
6e
0.104690.5
+ 6e
0.105361.0
+ 6e
0.106811.5
+ 106e
R2.0
= 101.6 R = 10.808%
A chart showing the zero rate as a function of maturity is known as the zero curve
Common assumption: Zero curve is linear between points determined using bootstrap method
The 1.25-year zero rate is 0.5 10.536 + 0.5 10.681 = 10.608%
Usually assumed that zero curve is horizontal prior to rst point/beyond last point
Using longer maturity bonds, zero curve would be more accurately determined beyond 2 years
In practice, we do not usually have bonds with maturities equal to exactly 1.5 years, 2 years, etc.
Approach: Interpolate between bond price data before it is used to calculate zero curve
E.g., if a 2.3-year bond with a coupon of 6% sells for 98 and a 2.7-year bond with a coupon of 6.5%
sells for 99, assume that a 2.5-year bond with a coupon of 6.25% sells for 98.5
Forward rates
Forward interest rates are rates of interest implied by current zero rates for future periods of time
Year Zero rate for n-yr investment Forward rate for n-th year
1 3.0
2 4.0 5.0
3 4.6 5.8
4 5.0 6.2
5 5.3 6.5
The forward interest rate in the above table for year 2 is 5% per annum
Rate of interest implied by zero rates for period between end of 1st year/end of 2nd year
Calculated from 1-year zero rate of 3% per annum and 2-year zero rate of 4% per annum
Equals rate for year 2 that, when combined with 3% for year 1, gives 4% overall for the 2 years
When interest rates are continuously compounded and rates in successive time periods are combined,
the overall equivalent rate is simply the average rate during the whole period
If R
1
/R
2
are zero rates for maturities T
1
/T
2
, and R
F
= forward rate for period [T
1
, T
2
], then:
R
F
=
R
2
T
2
R
1
T
1
T
2
T
1
R
F
= R
2
+ (R
2
R
1
)
T
1
T
2
T
1
(2)
If the zero curve is upward sloping between T
1
and T
2
, so that R
2
> R
1
, then R
F
> R
2
(i.e., the
forward rate for a period of time ending at T
2
is greater than the T
2
zero rate)
If zero curve is downward sloping (R
2
< R
1
), then R
F
< R
2
(forward rate < T
2
zero rate)
83
Instantaneous forward rate for a maturity of T
Taking limits as T
2
approaches T
1
:
R
F
= R +T
R
T
where R is the zero rate for a maturity of T
Value of R
F
obtained in this way is known as instantaneous forward rate for a maturity of T
This is the forward rate applicable to a very short future time period that begins at time T
Dening P(0, T) as the price of a zero-coupon bond maturing at time T, since P(0, T) = e
RT
,
the instantaneous forward rate can also be written as:
R
F
=
T
lnP(0, T)
Assuming that the zero rates for borrowing and investing are the same (which is close to the truth for
a large nancial institution), an investor can lock in the forward rate for a future time period
Lending at the locked forward rate
E.g., investor borrows $100 at 3% for 1 year and invests the money at 4% for 2 years
Result: Cash outow of 100e
0.031
= 103.05 at EoY
1
, inow of 100e
0.042
= 108.33 at EoY
2
108.33 = 103.05e
0.05
Return equal to forward rate (5%) earned on $103.05 during 2nd year
Borrowing at the locked forward rate
E.g., investor borrows $100 for four years at 5% and invests it for three years at 4.6%
Result: Cash inow of 100e
0.0463
= 114.80 at EoY
3
, outow of 100e
0.054
= 122.14 at EoY
4
Since 122.14 = 114.80e
0.062
money is being borrowed for the 4th year at forward rate of 6.2%
If an investor thinks that rates in the future will be dierent from todays forward rates there are many
trading strategies that the investor will nd attractive
Forward rate agreements
A forward rate agreement (FRA) is an over-the-counter agreement that a certain interest rate will apply
to either borrowing or lending a certain principal during a specied future period of time
Assumption underlying the contract: Borrowing/lending would normally be done at LIBOR
Consider FRA where X agrees to lend money to Y for period [T
l
, T
2
]. Dene:
R
K
: The rate of interest agreed to in the FRA
R
F
: The forward LIBOR interest rate for the period between T
1
and T
2
, calculated today
R
M
: The actual LIBOR interest rate observed in the market at time T
1
for period [T
1
, T
2
]
L: The principal underlying the contract
Assume that the rates R
K
, R
F
, and R
M
are all measured with a compounding frequency reecting
the length of the period to which they apply
Normally X would earn R
M
from the LIBOR loan. FRA means that it will earn R
K
Extra interest rate (may be negative) earned as result of entering into FRA is R
K
R
M
The interest rate is set at time T
1
and paid at time T
2
The extra interest rate therefore leads to a cash ow to X at time T
2
of:
L(R
K
R
M
)(T
2
T
1
) (3)
Similarly there is a cash ow to Y at time T
2
of:
L(R
M
R
K
)(T
2
T
1
) (4)
Interpretation of FRA
It is an agreement where X will receive interest on the principal between T
1
and T
2
at the xed
rate of R
K
and pay interest at the realized market rate of R
M
Y will pay interest on principal between T
1
and T
2
at xed rate of R
K
and receive interest at R
M
Usually FRAs are settled at time T
1
rather than T
2
The payo must then be discounted from time T
2
to T
1
84
The payo at time T
1
for each company is then:
X:
L(R
K
R
M
)(T
2
T
1
)
1 +R
M
(T
2
T
1
)
and Y:
L(R
M
R
K
)(T
2
T
1
)
1 +R
M
(T
2
T
1
)
Valuation of FRAs
An FRA is always worth zero when R
K
= R
F
. This is because, a large nancial institution can at
no cost lock in the forward rate for a future time period
E.g., it can ensure that it earns the forward rate for the time period between years 2 and 3 by
borrowing a certain amount of money for 2 years and investing it for 3 years
Similarly, it can ensure that it pays the forward rate for the time period between years 2 and
3 by borrowing for a certain amount of money for 3 years and investing it for 2 years
Compare two FRAs:
1st promises that LIBOR forward rate R
F
will be earned on principal of L between T
1
and T
2
2nd promises that R
K
will be earned on the same principal between same two dates
The two contracts are the same except for the interest payments received at time T
2
Excess value of 2nd contract over 1st = PV of dierence between interest payments:
L(R
K
R
F
)(T
2
T
1
)e
R
2
T
2
where R
2
= riskless zero rate for maturity T
2
Hence, the value of the FRA where R
K
is earned is:
V
FRA
= L(R
K
R
F
)(T
2
T
1
)e
R
2
T
2
(5)
Similarly, for company receiving interest at oating rate/paying interest at R
K
, value of FRA is:
V
FRA
= L(R
F
R
K
)(T
2
T
1
)e
R
2
T
2
(6)
An FRA can be valued if we:
1. Calculate the payo on the assumption that forward rates are realized (i.e., R
M
= R
F
)
2. Discount this payo at the risk-free rate
Duration
Duration: Measure of how long on avg. the bondholder has to wait before receiving cash payments
A zero-coupon bond that lasts n years has a duration of n years
However, a coupon-bearing bond lasting n years has a duration of less than n years
Suppose that a bond provides the holder with cash ows c
i
at time t
i
(1 i n). The bond price B
and bond yield y (continuously compounded) are related by:
B =
c
i
e
yt
i
(7)
The duration D of the bond is dened as:
D =
t
i
c
i
e
yt
i
B
=
n
i=1
t
i
_
c
i
e
yt
i
B
_
(8)
The term in square brackets is the ratio of the PV of the CF at time t
i
to the bond price
The bond price is the PV of all payments
For purposes of duration, all discounting is done at the bond yield rate of interest y
The duration is a weighted average of the times when payments are made, with the weight applied
to time t
i
being equal to the proportion of the bonds total PV provided by the CF at time t
i
When a small change y in the yield is considered, it is approximately true that:
B =
dB
dy
y = y
n
i=1
c
i
t
i
e
yt
i
(9)
85
Key duration relationship
B = BDy
B
B
= Dy (10)
Modied duration
If y is expressed with annual compounding, the approximate relationship in Eq. (10) becomes:
B =
BDy
1 +y
More generally, if y is expressed with a compounding frequency of m times per year, then:
B =
BDy
1 +y/m
Modied duration D
=
D
1 +y/m
When y is expressed with a compounding frequency of m times per year, the bonds modied
duration D
y (11)
Dollar duration
This is the product of modied duration and bond price
B = D
y, where D
= B D
c
i
t
2
i
e
yt
i
B
From Taylor expansion, we obtain a more accurate expression than Eq. (10):
B =
dB
dy
y +
1
2
d
2
B
dy
2
y
2
B
B
= Dy +
1
2
C(y)
2
Convexity of bond portfolio greatest when portfolio provides payments evenly over long period of time
It is least when the payments are concentrated around one particular point in time
By choosing a portfolio of assets and liabilities with a net duration of zero and a net convexity of zero,
a nancial institution can make itself immune to relatively large parallel shifts in the zero curve
However, it is still exposed to nonparallel shifts
Theories of the term structure of interest rates
What determines the shape of the zero curve?
86
Expectations theory
Conjectures that long-term interest rates should reect expected future short-term interest rates
More precisely, it argues that a forward interest rate corresponding to a certain future period is
equal to the expected future zero interest rate for that period
Market segmentation theory
Conjectures that no relationship between short-, medium-, and long-term interest rates
Investors choose bonds of certain maturity and do not readily switch from one maturity to another
The short-term interest rate is determined by supply/demand in short-term bond market
Medium-term interest rate is determined by supply/demand in medium-term bond market . . .
Liquidity preference theory
Most appealing theory: Argues that forward rates always higher than expected future zero rates
Basic assumption: Investors prefer to preserve liquidity and invest funds for short periods of time
Borrowers, on the other hand, usually prefer to borrow at xed rates for long periods of time
Liquidity preference theory leads to forward rates > expected future zero rates
Consistent with empirical result that yield curves tend to be upward sloping
The management of net interest income
Banks net interest income: Excess of interest received over interest paid. Needs to be managed
Suppose you have money to deposit and agree with the prevailing view that interest rate increases
are just as likely as interest rate decreases
Would you deposit your money for 1 year at 3% per annum or for 5 years at 3% per annum?
Choose one year because more nancial exibility: Funds tied up for shorter period of time
Now suppose that you want a mortgage
Choose 5-year mortgage because it xes borrowing rate Less renancing risk
When banks have same rates for various maturities, customers opt for 1-yr deposits/5-yr mortgages
This creates an asset/liability mismatch for the bank and subjects it to risks
If rates rise, deposits nancing 6% loans will cost more and net interest income declines
Asset/liability management group must ensure that the maturities of the assets on which interest
is earned and the maturities of the liabilities on which interest is paid are matched
One way it can do this is by increasing the ve-year rate on both deposits and mortgages
The net result of all banks behaving in such way is liquidity preference theory:
Long-term rates tend to be higher than predicted by expected future short-term rates
The yield curve is upward sloping most of the time
It is downward sloping only when market expects a really steep decline in short-term rates
Sometimes derivatives such as interest rate swaps are also used to manage exposure
The result of all this is that net interest income is very stable
87
88
BKM - Ch. 15: The term structure of interest rates
Introduction
Relationship between time to maturity and YTM can vary dramatically from one period to another
Term structure of interest rates: Structure of interest rates to discount CFs of dierent maturities
Reveals market-consensus forecasts of future interest rates. Interest rate risk may aect those inferences
Traders use term structure to compute forward rates on forward or deferred loans
The yield curve
Yield curve
Summarizes the relationship between yield and maturity
Plot of yield to maturity as a function of time to maturity
Key concerns of xed-income investors and central to bond valuation
Allows investors to gauge their expectations for future interest rates against those of the market
Bond pricing
If yields on dierent-maturity bonds are not all equal, how to value coupon bonds that make
payments at many dierent times? Consider each bond cash ow - either coupon or principal
payment - as at least potentially sold o separately as a stand-alone zero-coupon bond
Treasury stripping suggests exactly how to value a coupon bond
If each cash ow can be sold o as a separate security, then the value of the whole bond should
be the same as the value of its cash ows bought piece by piece in the STRIPS market
If it werent, then easy prots to be made: If bond sold for less than the amount at which the
sum of its parts could be sold, buy the bond, strip it into stand-alone zero-coupon securities,
sell o the stripped CFs, and prot by the price dierence
Both bond stripping and bond reconstitution oer opportunities for arbitrage
Because each coupon payment matures at a dierent time, discount by using yield appropriate to
its particular maturity, i.e. yield on a Treasury strip maturing at the time of that CF
Example - Valuing coupon bonds
Value a 10% coupon bond with a maturity of 3 years when the yield to maturity for zero
coupon bonds of maturities of 1, 2 and 3 years are 5%, 6% and 7% respectively
The rst cash ow, the $100 coupon paid at the end of the rst year, is discounted at 5%. The
second cash ow, the $100 coupon at the end of the second year, is discounted at 6%
Final CF (nal coupon plus par value = $1,100) discounted at 7% Bond value:
100
1.05
+
100
1.06
2
+
1100
1.07
3
= 1, 082.17
Its YTM is 6.88%: While its maturity matches that of the 3-year zero, its yield is a bit lower
This reects the fact that the 3-year coupon bond may usefully be thought of as a portfolio of
three implicit zero-coupon bonds, one corresponding to each cash ow.
Yield on coupon bond = amalgam of yields on each of the three portfolio components
The pure yield curve refers to the curve for stripped, or zero-coupon, Treasuries
In contrast, the on-the-run yield curve refers to the plot of yield as a function of maturity for
recently issued coupon bonds selling at or near par value
On-the-run Treasuries have greatest liquidity Traders interested in their yield curve
The yield curve and future interest rates
The yield curve under certainty
If interest rates are certain, what should we make of the fact that the yield on the 2-year zero
coupon bond is greater than that on the 1-year zero?
It cant be that one bond is expected to provide a higher rate of return than the other
Upward-sloping yield curve: Evidence that short-term rates are going to be higher next year
Terminology to distinguish yields on long-term bonds from short-term rates available in future:
They call the yield to maturity on zero-coupon bonds the spot rate y, meaning the rate that
prevails today for a time period corresponding to the zeros maturity
89
In contrast, the short rate r for a given time interval (e.g., 1 year) refers to the interest rate
for that interval available at dierent points in time
E.g., the 2-year spot rate y
2
is an average of todays short rate r
1
and next years short rate r
2
.
But because of compounding, that average is a geometric one: (1 +y
2
)
2
= (1 +r
1
) (1 +r
2
)
At least in part, the yield curve reects the markets assessments of coming interest rates
When next years short rate r
2
is greater than this years short rate r
1
, the average of the two
rates is higher than todays rate, so y
2
> r
1
, and the yield curve slopes upward
If next years short rate were less than r
1
, the yield curve would slope downward
Example - Finding a future short rate
Compare 3yr strategies: (i) Buy a 3yr zero, with YTM = 7%, and hold to maturity, (ii) Buy
a 2yr zero yielding 6%, and roll proceeds in year 3, at short rate r
3
We must have: Buy/hold 3-year zero = Buy 2-year zero/roll into 1-year bond
(1 +y
3
)
3
= (1 +y
2
)
2
(1 +r
3
) r
3
= (1 +y
3
)
3
/(1 +y
2
)
2
1
Yield on 3-year bond reects geometric average of discount factors for next 3 years:
1 +y
3
= [(1 +r
1
) (1 +r
2
) (1 +r
3
)]
1/3
Yield/spot rate on long-term bond reects path of short rates anticipated by market
Holding-period returns
Multiyear cumulative returns on all of our competing bonds ought to be equal
This conclusion holds for holding-period returns over shorter periods such as a year
In a world of certainty, all bonds must oer identical returns, or investors will ock to the higher-
return securities, bidding up their prices, and reducing their returns
Example - Holding-period returns on zero-coupon bonds
1-year bond can be bought today for 1, 000/1.05 = 952.38 and matures to par value in 1 year
Rate of return is (1, 000 952.38)/952, 38 = .05
The 2-year bond can be bought for 1, 000/1.06
2
= 890, 00. Next year, the bond will have a
remaining maturity of 1 year and the 1-year interest rate will be 7.01%
Therefore, its price next year will be 1, 000/1.0701 = 934.49, and its 1-year holding-period rate
of return will be (934.49 890.00)/890.00 = .05, for an identical 5% rate of return
Forward rates
Generalization to inferring a future short rate from the yield curve of zero-coupon bonds:
Equate total return on two n-year strategies: (i) Buying/holding n-year zero-coupon bond vs.
(ii) Buying (n 1)-year zero and rolling over proceeds into 1-year bond
(1 +y
n
)
n
= (1 +y
n1
)
n1
(1 +r
n
) (1)
Given the observed yield curve, we can solve Eq. (1) for the short rate in the last period:
(1 +r
n
) =
(1 +y
n
)
n
(1 +y
n1
)
n1
(2)
Numerator on RHS: Total growth factor of investment in n-year zero held to maturity
Similarly, the denominator is the growth factor of an investment in an (n 1)-year zero
The dierence in these growth factors is the rate of return available in year n when the (n1)-
year zero can be rolled over into a 1-year investment
Forward interest rate
Future interest rates are uncertain Interest rate infered is the forward interest rate rather
than the future short rate (need not be interest rate that actually prevails at future date)
If the forward rate for period n is denoted f
n
, we then dene f
n
by:
(1 +f
n
) =
(1 +y
n
)
n
(1 +y
n1
)
n1
(1 +y
n
)
n
= (1 +y
n1
)
n1
(1 +f
n
) (3)
Forward rates equal future short rates in the special case of interest rate certainty
90
Interest rate uncertainty and forward rates
Short-term investors
When we account for risk, it is clear that short-term investors will shy away from the long-term
bond unless it oers an expected return greater than that of the 1-year bond
Investors require a risk premium to hold longer-term bond: Risk-averse investors will hold long-
term bond only if expected value of short rate < break-even value f
2
(the lower the expectation of
r
2
, the greater the anticipated return on the long-term bond)
If most individuals are short-term investors, bonds must have prices that make f
2
> E(r
2
)
The forward rate will embody a premium compared with the expected future short-interest rate
This liquidity premium compensates short-term investors for the uncertainty about the price at
which they will be able to sell their long-term bonds at the end of the year
Long-term investors
Whenever that risk is important, the long-term investor will not be willing to engage in the rollover
strategy unless its expected return exceeds that of the 2-year bond
LT investor requires that expected value of next years short rate > forward rate: E(r
2
) > f
2
Therefore, if all investors were long-term investors, no one would be willing to hold short-term
bonds unless those bonds oered a reward for bearing interest rate risk
Bond prices would be at levels such that rolling over short bonds resulted in greater expected return
than holding long bonds Forward rate < expected future spot rate
The liquidity premium f
2
E(r
2
) therefore is negative
Whether forward rates equal expected future short rates depends on investors readiness to bear interest
rate risk and willingness to hold bonds that do not correspond to their investment horizons
Theories of the term structure
The expectations hypothesis
Simplest theory of the term structure: States that the forward rate equals the market consensus
expectation of the future short interest rate: f
2
= E(r
2
) and liquidity premiums are zero
If f
2
= E(r
2
), we may relate yields on long-term bonds to expectations of future interest rates
Can use forward rates derived from yield curve to infer market expectations of future short rates
Liquidity preference
Short-term investors unwilling to hold long-term bonds unless f
2
> E(r
2
), whereas long-term
investors unwilling to hold short-term bonds unless E(r
2
) > f
2
Both groups require a premium to hold bonds with maturities ,= investment horizons
Liquidity preference theory
Short-term investors dominate the market so that forward rate > expected short rate
The excess of f
2
over E(r
2
), the liquidity premium, is predicted to be positive
F
o
rw
a
rd
ra
te
Expected short rate is increasing
Yield curve )
rises sharply
Liquidity premium
Increases with
maturity
Yield curve is humped
Forward rate
Expected short rate is falling
Constant liquidity premium
F
o
rw
a
rd
ra
te
I
n
t
e
r
e
s
t
R
a
t
e
(
%
)
I
n
t
e
r
e
s
t
R
a
t
e
(
%
)
Year
Year
interest rates
despite falling expected
Rising yield curve )
maturity
Increases with
Liquidity premium
Expected short rate is falling
Y
ield curve
Year
I
n
t
e
r
e
s
t
R
a
t
e
(
%
)
Constant liquidity premium
Yield curve is upward-sloping
Forward rate
Expected short rate is constant
Year
I
n
t
e
r
e
s
t
R
a
t
e
(
%
)
Figure 1: Yield curves
91
Interpreting the term structure
Yield curve reects expectations of future short rates Powerful tool for xed-income investors
If we can use the term structure to infer future interest rate expectations of other investors in the
economy, we can use those expectations as benchmarks for our own analysis
Unfortunately, the yield curve also reects other factors such as liquidity premiums
Moreover, forecasts of interest rate changes may have dierent investment implications depending
on whether those changes are driven by changes in the expected ination rate or the real rate
Certainty vs. uncertainty
Under certainty, 1 plus the yield to maturity on a zero-coupon bond is simply the geometric average
of 1 plus the future short rates that will prevail over the life of the bond
1 +y
n
= [(1 +r
1
)(1 +r
2
) (1 +r
n
)]
1/n
When future rates are uncertain, we replace future short rates with forward rates:
1 +y
n
= [(1 +r
1
)(1 +f
2
) (1 +f
n
)]
1/n
(4)
Direct relationship between yields on various maturity bonds and forward interest rates
What factors can account for a rising yield curve?
The yield curve is upward-sloping at any maturity date n for which the forward rate for the coming
period is greater than the yield at that maturity
What can account for that higher forward rate?
The forward rate can be related to the expected future short rate according to:
f
n
= E(r
n
) + Liquidity premium (5)
Two explanations for a higher forward rate:
1. Either investors expect rising interest rates, meaning that E(r
n
) is high
2. Or they require a large premium for holding longer-term bonds
Although expectations of increases in future interest rates can result in rising yield curve,
converse not true: Rising yield curve does not necessarily imply expectations of higher future rates
Liquidity premiums eects confound simple attempt to extract expectations from term structure
Rough approach to deriving expected future spot rates: Assume that liquidity premiums are constant
Markets expected interest rate = [Forward rate] - [Estimate of liquidity premium]
This approach has little to recommend it for two reasons:
1. It is next to impossible to obtain precise estimates of a liquidity premium:
The general approach to doing so would be to compare forward rates and eventually realized
future short rates and to calculate the average dierence between the two
However, the deviations between the two values can be quite large and unpredictable
because of unanticipated economic events that aect the realized short rate
2. There is no reason to believe that the liquidity premium should be constant
Still, steep yield curves are interpreted by professionals as warning signs of impending rate increases:
In fact, the yield curve is a good predictor of the business cycle as a whole, because long-term rates
tend to rise in anticipation of an expansion in economic activity
When steep curve, far lower probability of recession next year than when curve inverted/falling
The yield curve is a component of the index of leading economic indicators
Valid to interpret downward-sloping yield curve as evidence that interest rates are expected to decline?
If term premiums (spread between yields on long- and short-term bonds) generally are positive,
then downward-sloping curve signals anticipated declines in rates (possible impending recession)
Why might interest rates fall? Two factors: (i) The real rate, and (ii) The ination premium
Nominal interest rate composed of real rate plus factor to compensate for ination:
1 + Nominal rate = (1 + Real rate)(1 + Ination rate)
92
Important to distinguish between changes in expected real rate/expected ination rate because
economic environments associated with them may vary substantially
High real rates indicate rapidly expanding economy, high gov. decits, tight monetary policy
High ination rates can arise out of rapidly expanding economy, but also caused by rapid
expansion of money supply/supply-side shocks to the economy (oil supplies trouble)
Forward rates as forward contracts
In general, forward rates ,= eventually realized short rate, or even todays expected short rate
But there is still an important sense in which the forward rate is a market interest rate
Suppose that you wanted to arrange now to make a loan at some future date:
Need to agree today on the interest rate that will be charged
Interest rate on such forward loan would be the forward rate of interest for the loan period
To construct synthetic forward loan, sell (1 +f
2
) 2-year zeros for every 1-year zero that you buy
Denote B
0
(T) todays price of a zero maturing at time T
Pay B
0
(1) for a zero maturing in 1 year, receive B
0
(2) for each zero you sell maturing in 2 years
Initial CF = 0 because prices of the 1-/2-year zeros dier by factor (1 +f
2
)
B
0
(1) =
$1, 000
(1 +y
1
)
while B
0
(2) =
$1, 000
(1 +y
2
)
2
=
$1, 000
(1 +y
1
)(1 +f
2
)
Therefore, selling (1 +f
2
) 2-year zeros generates just enough cash to buy one 1-year zero
Both zeros mature to face value of $1,000 Dierence between cash inow at t = 1 and cash
outow at t = 2 is 1 +f
2
(see Fig. 2) f
2
is the rate on the forward loan
Present
2
f The rate that can be locked in for a one-period-ahead loan is
Amount repaid one year after borrowing:
Amount borrowed one year from now:
(which is observable
1
r General forward rate: The short rates in the two periods are
(which is not).
2
r today) and
2
r
1
r
2
1 0
)
2
f -$1,000(1 +
$1,000
Figure 2: Engineering a synthetic forward loan
93
94
Hull - Ch. 6.1: Day count and quotation conventions
Day counts
The day count denes the way in which interest accrues over time
The day count convention is usually expressed as X/Y
X = Number of days between the two dates, as measured by convention
Y = Total number of days in the reference period, as measured by convention
The interest earned between the two dates is:
Number of days between dates
Number of days in reference period
Interest earned in reference period
Three day count conventions commonly used in the US:
1. Actual/actual (in period)
2. 30/360
3. Actual/360
Actual/actual day count
Used for Treasury bonds in the US
The interest earned between two dates is based on the ratio of the actual days elapsed to the actual
number of days in the period between coupon payments
Suppose that the coupon payment dates are March 1 and September 1, the coupon rate is 8%, and
we wish to calculate the interest earned between March 1 and July 3
The reference period is from March 1 to September 1
There are 184 (actual) days in this period, and interest of $4 is earned during the period
There are 124 (actual) days between March 1 and July 3
The interest earned between March 1 and July 3 is therefore
124
184
4 = 2.6957
30/360 day count
Used for corporate and municipal bonds in the US
Assumes 30 days per month and 360 days per year when carrying out calculations
With the 30/360 day count, the total number of days between March 1 and September 1 is 180
The total number of days between March 1 and July 3 is (4 30) + 2 = 122
In a corporate bond with the same terms as the Treasury bond above, the interest earned between
March 1 and July 3 would therefore be:
122
180
4 = 2.7111
Actual/360 day count
Used for money market instruments in the US
The reference period is 360 days
The interest earned during part of a year is calculated by dividing the actual number of elapsed
days by 360 and multiplying by the rate
The interest earned in a whole year of 365 days is 365/360 times the quoted rate
Conventions vary from country to country and from instrument to instrument
LIBOR is quoted on an actual/360 for all currencies except sterling (actual/365 basis)
Euro-denominated and sterling bonds are usually quoted on an actual/actual basis
Price quotations
The prices of money market instruments are sometimes quoted using a discount rate
This is the interest earned as a % of the nal face value rather than as a % of the initial price paid
95
Example: Treasury bills in the US
If price of 91-day T-bill is quoted as 8, the annualized rate of interest earned is 8% of face value
Interest of $2.0222 (= 100 0.08 91/360) is earned over the 91-day life
This corresponds to true rate of interest of 2.0222/(100 2.0222) = 2.064% for the 91-day period
In general, the relationship between the cash price and quoted price of a Treasury bill in the US is:
P =
360
n
(100 Y )
Where P = quoted price, Y = cash price, and n = remaining life of T-bill measured in calendar days
US Treasury bonds
Treasury bond prices in the United States are quoted in dollars and thirty-seconds of a dollar
The quoted price is for a bond with a face value of $100 A quote of 90-05 indicates that the
quoted price for a bond with a face value of $100,000 is $90,156.25 = 1, 000 (90 + 5/32)
Denitions
Clean price: The quoted price
Dirty price: The cash price paid by the purchaser of the bond
Cash price = Quoted price + Accrued interest since last coupon date
Example
Suppose on March 5, 2010, the bond under consideration is an 11% coupon bond maturing on July
10, 2018, with a quoted price of 95-16 or $95.50
Because coupons are paid semiannually on government bonds, the most recent coupon date is
January 10, 2010, and the next coupon date is July 10, 2010
The number of days between January 10, 2010, and March 5, 2010, is 54, whereas the number of
days between January 10, 2010, and July 10, 2010, is 181
On a bond with $100 face value, the coupon payment is $5.50 on January 10 and July 10
Accrued interest on March 5, 2010 = share of July 10 coupon accruing on March 5, 2010
Because actual/actual in period is used for Treasury bonds in the US, this is:
54
181
$5.5 = $1.64
The cash price per $100 face value for the bond is therefore: 95.50 + 1.64 = 97.14
The cash price of a $100,000 bond is $97,140
96
Hull - Ch. 22 - Part 1: Credit risk
Credit ratings
Credit risk arises from possibility that borrowers/counterparties in derivatives transactions may default
Rating agencies (Moodys, S&P, Fitch) provide ratings describing creditworthiness of corporate bonds
Moodys ratings
Moodys best rating: Aaa. Aaa bonds considered to have almost no chance of default
The next best rating is Aa. Following that comes A, Baa, Ba, B, Caa, Ca, and C
Only bonds with ratings of Baa or above are considered to be investment grade
S&P ratings
The S&P ratings corresponding to Moodys Aaa, Aa, A, Baa, Ba, B, Caa, Ca, and C are AAA,
AA, A, BBB, BB, B, CCC, CC, and C, respectively
Fitchs rating categories are similar to those of S&P
To create ner rating measures, Moodys divides its Aa rating category into Aal, Aa2, and Aa3, . . .
Similarly, S&P divides its AA rating category into AA+, AA, and AA-, . . .
Moodys Aaa/S&Ps AAA category not subdivided, nor are the two lowest rating categories
Historical default probabilities
The data produced by rating agencies shows the default experience during a 20-year period of bonds
that had a particular rating at the beginning of the period
For investment grade bonds, probability of default in a year is an increasing function of time
This is because the bond issuer is initially considered to be creditworthy, and the more time
that elapses, the greater the possibility that its nancial health will decline
For bonds with a poor credit rating, probability of default is often a decreasing function of time
Reason: For a bond with a poor credit rating, the next year or two may be critical
The longer the issuer survives, the greater the chance that its nancial health improves
Default intensities
Unconditional default probability
The probability of default during the n-th year year as seen at time 0
Conditional default probabilities - Default intensities / Hazard rates
Probability that bond defaults during the n-th year conditional on no earlier default
The default intensity (t) at time t is then dened so that (t)t is the probability of default
between time t and t + t conditional on no earlier default
Denoting V (t) the cumulative probability of the company surviving to time t (i.e., no default by
time t), the conditional probability of default between time t and t +t is [V (t) V (t +t)]/V (t)
Since this is equal to (t)t, it follows that:
dV (t)
dt
= (t)V (t) V (t) = e
t
0
()d
Dening Q(t) as the probability of default by time t, so that Q(t) = 1 V (t), gives:
Q(t) = 1 e
t
0
()d
= 1 e
(t)t
(1)
Where
(t) is the average default intensity (hazard rate) between time 0 and time t
Recovery rates
Recovery rate for a bond: Bonds market value immediately after default, as % of its face value
Senior secured debt holders had an average recovery rate of 54.44 cents per dollar of face value
Junior subordinated debt holders had avg. recovery rate of 24.47 cents per dollar of face value
Recovery rates are signicantly negatively correlated with default rates
Moodys looked at avg. recovery rates/avg. default rates each year between 1982-06
Recovery = 59.1 - 8.356 Default rate
The recovery rate is the average recovery rate on senior unsecured bonds in a year measured as a
% and the default rate is the corporate default rate in the year measured as a %
Bad year for default rate is doubly bad because accompanied by low recovery rate
97
Estimating default probabilities from bond prices
The probability of default for a company can be estimated from the prices of bonds it has issued
Assumption: Corporate bond sells for less than similar risk-free bond due to possibility of default
In practice, bond prices are aected by liquidity: The lower the liquidity, the lower the price
Quick calculation of the average default intensity
Suppose that a bond yields s basis points more than a similar risk-free bond and that the expected
recovery rate in the event of a default is R%
The holder of the bond must be expecting to lose s basis points per year from defaults
Given recovery rate of R%, probability of default per year conditional on no earlier default is:
=
s
1 R
where
= avg. default intensity (hazard rate) per year (2)
A more exact calculation
Suppose that the corporate bond we have been considering lasts for 5 years, provides a coupon 6%
per annum (paid semiannually) and that the yield on the corporate bond is 7% per annum (with
continuous compounding). The yield on a similar risk-free bond is 5%
The yields imply that corporate bond price is 95.34 and risk-free bond price is 104.09
The expected loss from default over the 5-year life of the bond is 104.09 95.34 = 8.75
Denoting the probability of default per year Q, the table below calculates the expected loss from
default in terms of Q on the assumption that defaults can happen at times 0.5, 1.5, 2.5, 3.5, and
4.5 years (immediately before coupon payment dates)
Time Default Recovery Risk-free Loss given Discount PV of expected
(years) probability amount ($) value ($) default ($) factor loss ($)
0.5 Q 40 106.73 66.73 0.9753 65.08Q
1.5 Q 40 105.97 65.97 0.9277 61.20Q
2.5 Q 40 105.17 65.17 0.8825 57.52Q
3.5 Q 40 104.34 64.34 0.8395 54.01Q
4.5 Q 40 103.46 63.46 0.7985 50.67Q
Total 288.48Q
Calculation example
Expected value of corp. bond at time 3.5 years (using forward interest rates, no default) is:
3 + 3e
0.050.5
+ 3e
0.051.5
+ 103e
0.051.5
= 104.34
Amount recovered if default = 40 loss given default is 104.34 40 = 64.34
The present value of this loss is 54.01 The expected loss is 54.01Q
Total expected loss = 288.48Q = 8.75 Value for Q of 8.75/288.48 = 3.03%
With several bonds, several parameters describe the term structure of default probabilities
Suppose we have bonds maturing in 3, 5, 7, and 10 years
We could use the rst bond to estimate a default probability per year for the rst 3 years, the
second bond to estimate default probability per year for years 4 and 5, . . .
Approach analogous to bootstrap procedure for calculating a zero-coupon yield curve
The risk-free rate
Key issue when bond prices used to estimate default probabilities: risk-free rate/risk-free bond
In Eq. (2), spread s = excess of corporate bond yield over yield on similar risk-free bond
In table above, risk-free value of the bond must be calculated using risk-free discount rate
Benchmark risk-free rate for corporate bond yields = yield on similar Treasury bonds
Traders usually use LIBOR/swap rates as proxies for risk-free rates when valuing derivatives
Traders also use LIBOR/swap rates as risk-free rates when calculating default probabilities
When they determine default probabilities from bond prices, the spread s in Eq. (2) is the
spread of the bond yield over the LIBOR/swap rate
Risk-free discount rates used in calculations in table above are LIBOR/swap zero rates
Credit default swaps can be used to imply the risk-free rate assumed by traders
Implied rate LIBOR/swap rate minus 10 basis points on average
98
Asset swaps
In practice, traders use asset swap spreads to extract default probabilities from bond prices
Asset swap spreads provide direct estimate of bond yields spread over LIBOR/swap curve
E.g., asset swap spread for a bond is quoted as 150 basis points. Three possible situations:
1. The bond sells for its par value of 100
The swap then involves one side (company A) paying the coupon on the bond and the
other side (company B) paying LIBOR plus 150 basis points
Only coupons are exchanged; Exchanges take place regardless of whether bond defaults
2. The bond sells below its par value, say, for 95
In addition to coupons, company A pays $5 per $100 of notional principal at outset
Company B pays LIBOR plus 150 basis points
3. The underlying bond sells above par, say, for 108
In addition to LIBOR plus 150 bp, company B pays $8 per $100 of principal at outset
Company A pays the coupons
The eect of all this is that the PV of the asset swap spread is the amount by which the price of
the corporate bond is exceeded by the price of a similar risk-free bond where the risk-free rate is
assumed to be given by the LIBOR/swap curve
Suppose that instead of knowing bonds price we know that asset swap spread is 150 basis points
This means that the amount by which the value of the risk-free bond exceeds the value of the
corporate bond is the present value of 150 basis points per year for 5 years
Assuming semiannual payments, this is $6.55 per $100 of principal
The total loss in the table above would in this case be set equal to $6.55
The default probability Q per year would be 6.55/288.48 = 2.27%
Comparison of default probability estimates
Default probabilities estimated from historical data those derived from bond prices
Default intensities from historical data [Eq. (1)]: Default intensities from bond prices [Eq. (2)]:
(t) =
1
t
ln[1 Q(t)]
(t) =
s
1 R
Ratio of default probability backed out from bond prices to default probability from historical data
is very high for investment grade companies and declines as a companys credit rating declines
Dierence between the two default probabilities tends to increase as the credit rating declines
Excess return
The expected excess return on bond (in basis points) is the bond yield spread over Treasuries
minus the spread of risk-free rate over Treasuries minus the spread for historical default
A large percentage dierence between default probability estimates translates into a small (but
signicant) excess return on the bond
The excess return tends to increase as credit quality declines
Real-world vs. risk-neutral probabilities
The default probabilities implied from bond yields are risk-neutral probabilities of default
The calculations assume that expected default losses can be discounted at the risk-free rate
The risk-neutral valuation principle shows that this is a valid procedure providing the expected
losses are calculated in a risk-neutral world
The default probability Q in the table above is a risk-neutral probability
By contrast, default probabilities from historical data are real-world default probabilities
Expected excess return arises from dierence real-world/risk-neutral default probabilities
If there were no expected excess return, then real-world = risk-neutral default probabilities
Why do we see such big dierences between real-world/risk-neutral default probabilities?
One reason for the results is that corporate bonds are relatively illiquid and the returns on bonds
are higher than they would otherwise be to compensate for this
But this is small part of what is going on: Explains 25 basis points of excess return
99
Other reason: The subjective default probabilities of bond traders are much higher
Bond traders may be allowing for depression scenarios much worse than anything seen earlier
However, dicult to see how this explains a large part of the observed excess return
By far, most important reason for results: Bonds do not default independently of each other
There are periods of time when default rates are very low and periods when very high
The year-to-year variation in default rates gives rise to systematic risk (i.e., risk that cannot
be diversied away) and bond traders earn an excess expected return for bearing the risk
Variation in default rates from year to year stems from overall economic conditions and because
a default by one company has ripple eect Defaults by other companies (credit contagion)
In addition to systematic risk, nonsystematic (idiosyncratic) risk associated with each bond
Bond returns are highly skewed with limited upside. This type of risk is dicult to diversify
It would require tens of thousands of dierent bonds
In practice, many bond portfolios are far from fully diversied
Bond traders earn extra return for bearing nonsystematic risk and systematic risk
When to use real-world or risk-neutral default probabilities in the analysis of credit risk?
When valuing credit derivatives/estimating impact of default risk, risk-neutral default prob-
abilities should be used because the analysis calculates the PV of expected future CFs and
almost invariably (implicitly or explicitly) involves using risk-neutral valuation
For scenario analyses to calculate potential losses from defaults, use real-world probabilities
Using equity prices to estimate default probabilities
Credit ratings are revised relatively infrequently Some analysts argue that equity prices can provide
more up-to-date information for estimating default probabilities
Merton model
In Mertons model, a companys equity is an option on the assets of the company
Suppose rm has one zero-coupon bond outstanding and that bond matures at T. Dene:
V
0
: Value of companys assets today D: Debt repayment due at time T
V
T
: Value of companys assets at time T
V
: Volatility of assets (assumed constant)
E
0
: Value of companys equity today
E
: Instantaneous volatility of equity
E
T
: Value of companys equity at time T
If V
T
< D, rational for company to default on debt at time T Value of equity = zero
If V
T
> D, company should make debt repayment at time T Value of equity at T = V
T
D
Mertons model gives the value of the rms equity at time T as: E
T
= max(V
T
D, 0)
Equity = call option on assets value with strike price equal to repayment required on debt
The Black-Scholes formula gives the value of the equity today as:
E
0
= V
0
N(d
1
) De
rT
N(d
2
) with d
1
=
ln V
0
/D+(r+
2
V
/2)T
T
, d
2
= d
1
T (3)
The value of the debt today is V
0
E
0
The risk-neutral probability that the company will default on the debt is N(d
2
)
However, neither V
0
nor
V
is directly observable
If the company is publicly traded, we can observe E
0
Eq. (3) provides one condition that must be satised by V
0
and
V
We can also estimate
E
from historical data or options
From Itos lemma,
E
E
0
=
E
V
V
V
0
E
E
0
= N(d
1
)
V
V
0
(4)
Eqs. (3) and (4) provide a pair of simultaneous equations that can be solved for V
0
and
V
How well do default probabilities from Mertons model correspond to actual default experience?
They produce a good ranking of default probabilities (risk-neutral or real-world)
Monotonic transformation can be used to convert probability of default output from Mertons model
into good estimate of either real-world/risk-neutral default probability
100
Altman: Measuring corporate bond mortality and performance
Introduction
In addition to default risk, investors also consider the eects of the two other major risk dimensions of
investing in xed-interest instruments: (i) Interest rate risk and, (ii) Liquidity risk
The appropriate measure of default risk and the accuracy of its measurement are critical in:
The pricing of debt instruments
The measurement of their performance
The assessment of market eciency
Analysts have concentrated their eorts on measuring the default rate for nite periods of time (e.g.,
one year) and then averaging the annual rates for longer periods
The rate of default has been measured simply as the value of defaulting issues for some specic
population of debt compared with the value of bonds outstanding that could have defaulted
Annual default rates are then usually compared with observed promised yield spreads in order to
assess the attractiveness of particular bonds or classes of bonds
A corollary approach is to compare default rates with ex-post returns to assess whether investors
were compensated for the risks they bear
Although it is informative to measure default rates and losses based on the average annual rate method,
that traditional technique has at least two deciencies:
1. It fails to consider that there are other ways in which a bond dies, namely redemptions from calls,
sinking funds, and maturation Fails to consider the surviving population of bonds
2. It does not answer the question of the probability of default for various time periods in the future
on the basis of an issues specic attributes at issuance, summarized into its bond rating
This study does explicitly consider the surviving population as the relevant basis or denominator in the
default calculation and addresses the initial default assessment by the following questions
Given an issues initial bond rating:
1. What is the probability of default/loss from default over a specic time horizon of N years?
2. What are the estimates of the cumulative annual mortality rates and losses for various time frames
as well as the marginal rates for specic one-year periods?
3. Given estimates of cumulative mortality losses suered by investors and expected return spreads
earned on the surviving population of bonds, what were the net return spreads earned or lost in
comparison with returns on risk-free securities?
Prior studies
Previous works in the area of default were of three general kinds:
1. Hickman-style (1958) reports
Presents statistics on annual default rates/actual returns over various time frames
2. Studies which emphasize default risk potential of individual-company debt by examining determi-
nants of risk premiums over risk-free securities, or by constructing uni-/multivariate classication
models based on combination of micro-nance measures/statistical classication techniques
Variants on those models were based on the gamblers ruin concept, recursive partitioning
techniques, and market indicators of survival
3. Fons study
Attempts to combine observed pricing and the inherent default risk premium with estimates
of corporate bond default experience
Incorporates default experience with risk-neutral investment strategy: The only factor that
matters is the return distribution of debt with no relevance for volatility or liquidity factors
Traditional measures of default rates and losses
Main distinction: Between so-called investment-grade and non-investment-grade categories
More precise ratings: 4 classes of investment-grade debt, 3 classes of lower quality junk bonds
Default rates are calculated on average annual basis, with individual rates for each year combined with
rates for other years, over longer time horizon to form estimate for the average annual rate
101
The rate for each year is based on the dollar amount of defaulting issues in that year divided by
the total population outstanding as of some point during that year
Average annual default rate, measured in such way, for period 1978-87 was 1.86% per year
Default losses
More relevant default statistic for investors: Not the rate of default but amount lost from defaults
E.g., measure amount lost by tracking price for the defaulting issue just after default and
assuming that investor had purchased issue at par value and sold just after default
The investor also is assumed to lose one coupon payment
Average annual default loss over the sample period has been approximately 1.2% per year
That lower percentage of loss compared with default rates stems from the fact that defaulting
debt, on average, sells for approximately 40% of par at the end of the defaulting month
The mortality rate concept
Begin with specic cohort (rating category) and track groups performance for multiple time periods
Consider mortalities in relation to survival population and input defaults to calculate mortality rates
Bonds can exit from the original population by means of at least four dierent events:
(i) Defaults, (ii) Calls, (iii) Sinking funds, and (iv) Maturities
The individual mortality rate for each year (marginal mortality rate = MMR) is calculated by:
MMR
(t)
=
Total value of defaulting debt in the year (t)
Total value of the population of bonds at the start of the year (t)
We then measure the cumulative mortality rate (CMR) over a specic time period (1, 2, , T years)
by subtracting the product of the surviving populations of each of the previous years from unity:
CMT
(T)
= 1
T
t=1
SR
(t)
= 1
T
t=1
(1 MMR
(t)
)
Where CMT
(T)
= Cumulative mortality rate in (T)
SR
(t)
= Survival rate in (t) = 1 MMR
(t)
The individual year marginal mortality rates for each bond rating are based on a compilation of that
years mortality measured from issuance
E.g., all one-year mortalities combined for 17-year sample period to get the one-year rate
All of the second-year mortalities are combined to get the two-year rate, etc . . .
Mortality rate = value-weighted rate for particular year after issuance, not simple average
Simple average Results susceptible to signicant specic-year bias
Weighted-average correctly biases results toward larger-issue years, especially more recent years
Empirical results
Mortality rates
The relative results across cohort groups are pretty much in line with expectations, with the
mortality rates very low for the higher-rated bonds and increasing for lower rated issues
AAA debt: Zero mortality rate for rst 5 yrs after issuance, 0.13% from 6-10 yrs
The mortality rates for BBB and lower bonds begin to increase almost immediately after
issuance, with BBB (the lowest investment-grade debt level) showing a cumulative rate of
0.91% after ve years and 2.12% after ten years
The marginal mortality rates are fairly constant after year three
The longer term mortality results should be analyzed with considerable caution with respect to
expectations about future mortality rates and return spreads because of thin volume of data
In addition, later years biased since portion of original population redeemed by then
The traditional default rates are calculated on the basis of the population on June 30, while our
mortality rates use survival population data from the start of each year
The old way probably understates default rates somewhat
102
Since we adjust population for all redemptions, mortality rates higher than if data unadjusted
Both the adjusted and unadjusted methods of calculating the results are meaningful
Mortality gures over time should adjust for changing population size, while unadjusted data
helpful for probability of default of specic rating from a given years issuance
Strictly speaking, however, the unadjusted gures are not rates
Losses
The loss to investors from defaults is of paramount importance
In analysis of net return spreads, we use the actual recovery amount for which investors were able
to sell the defaulting issue and also assume that one coupon payment was lost
The average recovery rate was slightly below 40% of par
We did look at the relation between individual bond ratings at issuance and the subsequent average
price that could be realized upon default and found essentially that no relationship existed
Virtually no correlation between initial bond rating and average price after default
No correlation between price after default and number of years bond in existence before default
While the marginal default rate is relatively low in the rst three years after issuance, the
recovery rate is unaected by the age of the issue
Net return performance
Analysis tracks performance of bonds from issuance, across ratings and over relevant time horizons
Compare performance of various risky bond categories with default risk-free US Treasuries
Factor into the analysis actual losses from defaults and yield spreads over Treasuries
We calculate actual return-spread performance
The spreads, expressed in terms of basis points compounded over a ten-year investment horizon,
are based on actual yield spreads for the 18-year period
Results: AAA bonds can be expected to earn 45 basis points (0.45%) more than Treasuries
over one year (two semiannual coupon payments) and 1245 basis points after ten years
BB bonds earn 326 basis points more than Treasuries after one year and 7637 basis points after
10 years. $100 would return $76.37 more than Treasuries over 10 years
Assumptions
These results use actual long-term Treasury coupon rates, yield spreads at birth for the dif-
ferent rating categories, the sale of defaulted debt, the loss of one coupon payment, and the
reinvestment of cash ows at the then prevailing interest rates for that bond-rating group
Cash ows are reinvested from coupon payments on the surviving population as well as the
reinvestment of sinking funds, calls, and the recovery from defaulted debt
The results assume no capital gains or losses over the measurement period, and the investor
follows a buy-and-hold strategy for the various horizons
For all holding periods, all bond types do well and have positive spreads over Treasuries
Average historical yield spreads ranged from 0.47% (AAA) to 3.05% (BB) to 7.07% (CCC)
Historical average 4.09% yield spread for B-rated debt provides an ample cushion to compensate
for losses, but performance relative to BB category is inferior in later years
This changes, however, if we adjust our initial yield spread assumptions to reect dierent
market conditions, assuming the same default experience
Implications
Despite higher than expected cumulative mortality rates over long holding periods, return spreads on
all corporate bonds are positive, with impressive results for high-yield, low-grade categories
Investors have been more than satisfactorily compensated for investing in high-risk securities
Indeed, if expected default losses are fully discounted in the prices (and yields) of securities, our
return spread results should be insignicantly dierent from zero
Possible explanation: Fixed-income market has been mispricing corporate debt issues and discrepancy
has persisted, perhaps because of lack of appropriate info Market ineciency
If default losses are consistently lower than yield spreads and this comparison is the only relevant
determinant of future yield spreads, ineciency is a reasonable conclusion
103
If all other things are not equal, however, for determining yield spreads on corporate bonds, then
the market ineciency conclusion is dicult to reach
Liquidity risk is often mentioned as important to price determination
If liquidity risk increases with lower bond ratings, then the excess returns noted earlier may in part
be the returns necessary to bear this risk
Indeed, during post-October 19, 1987 period, poor liquidity was cited as one cause of precipitous
drop in common stock prices and the rise in yields of certain high-yield debt issues
The other risk element that is not isolated in our study is interest rate or reinvestment risk
Actual returns on bonds are obviously aected by interest rate changes
Our results include actual reinvestment rates over time, and we have not factored in any capital
gains or losses, assuming a buy-and-hold strategy for investors
However, lower grade bonds have lower volatility from interest rate changes than Treasuries
Another explanation of the persistent positive return spreads attributed to lower rated bonds is the
variability of retention values after default
Our 40% recovery rate of par value just after default is an expected value
Investors might require positive spreads based on possibility that retention values < 40%
In addition, the 40% retention is relevant only for a portfolio of defaulting bonds
Investor not well-diversied is vulnerable to higher than avg. mortality losses on specic issues
If the market prices low-quality issues as individual investments and not as portfolios, required
spreads are likely to be higher than is perhaps necessary
On the other hand, if defaults are correlated with market returns, risks may not be as diversiable
as we assume to be the case for equities
Investors might also be restricted in relation to the risk class of possible investments, thereby creating
an articial barrier to supply-demand equilibrium
For instance, certain institutions are prohibited from investing in low-grade bonds or are limited
in the amount that they can invest in such securities
That reduces demand and inates yield and possibly return spreads
104
Cummins: CAT Bonds and other risk-linked securities
Introduction
Risk-linked securities are nancing devices that enable insurance risk to be sold in capital markets,
raising funds that insurers/reinsurers can use to pay claims arising from CAT/other loss events
The most prominent type of risk-linked security is the catastrophic risk (CAT) bond, which is a fully
collateralized instrument that pays o on the occurrence of a dened catastrophic event
CAT bonds and other risk-linked securities are potentially quite important because they have the ability
to access the capital markets to provide capacity for insurance and reinsurance markets
CAT bond market has expanded signicantly in recent years and seems to have reached critical mass
Although the CAT bond market is small in comparison with the overall nonlife reinsurance market, it
is of signicant size in comparison with the property-catastrophe reinsurance market
The structure of risk-linked securities
Early developments
Following Hurricane Andrew in 1992, eorts began to access securities markets directly as a mech-
anism for nancing future catastrophic events
First contracts launched by the Chicago Board of Trade (CBOT), which introduced catastrophe
futures in 1992 and later introduced catastrophe put and call options
Options based on aggregate CAT loss indices compiled by Property Claims Services (PCS)
The contracts were later withdrawn due to lack of trading volume
In 1997, Bermuda Commodities Exchange (BCE) also tried to develop a market in CAT op-
tions, but contracts were withdrawn within 2 years (lack of trading)
Insurers had little interest in the CBOT and BCE contracts for various reasons:
Thinness of the market
Possible counterparty risk on the occurrence of a major catastrophe
Potential for disrupting long-term relationships with reinsurers
Possibility of excessive basis risk, i.e., the risk that payos under the contracts would be
insuciently correlated with insurer losses
In 2007, the Chicago Mercantile Exchange (CME) and the New York Mercantile Exchange (NYMEX)
introduced futures and options contracts on US hurricane risk
Their introduction was motivated by the 2005 US hurricane season, which revealed the limi-
tations on the capacity of insurance and reinsurance markets
CME contracts settle on Carvill (reinsurance intermediary) Hurricane Indices. NYMEX con-
tracts settle on CAT loss indices by Gallagher Re based on PCS data
Given that both the CME and NYMEX contracts are based on broadly dened geographical
areas, they will be subject to signicant basis risk
Given the existence of a secondary market as well as dedicated CAT bond mutual funds, it is
possible that the CME or NYMEX contracts could be used for hedging purposes
Another early attempt at securitization involved contingent notes aka Act of God bonds
In 1995, Nationwide issued $400 million in contingent notes through a special trust
Proceeds from the sale of the bonds were invested in 10-year Treasury securities, and investors
were provided with a coupon payment equal to 220 basis points over Treasuries
Embedded in these contingent notes was a substitutability option for Nationwide: Given a
pre-specied event that depleted Nationwides equity capital, Nationwide could substitute up
to $400 million of surplus notes for Treasuries in the Trust at any time during a 10-year period
for any business reason, with surplus notes carrying a coupon of 9.22%
Structure did not achieve signicant segregation of Nationwides liabilities, leaving investors
exposed to general business risk and to risk that Nationwide might default on the notes
In addition, unlike CAT bonds, the withdrawal of funds from the trust would create the
obligation for Nationwide eventually to repay the Trust
Contingent notes have not emerged as a major solution to the risk-nancing problem
105
CAT bonds
Introduction to CAT Bonds
CAT bonds modeled on asset-backed-security transactions executed for a wide variety of -
nancial assets including mortgage loans, automobile loans, aircraft leases, and student loans
CAT bonds are part of event-linked bonds, which pay o on occurrence of a specied event
Most event-linked bonds issued to date have been linked to catastrophes such as hurricanes/earthquakes,
although bonds also have been issued that respond to mortality events
First successful CAT bond was $85 million issue by Hannover Re in 1994 (Swiss Re, 2001)
The rst CAT bond issued by a nonnancial rm, occurring in 1999, covered earthquake losses
in the Tokyo region for Oriental Land Company, the owner of Tokyo Disneyland
More recently CAT bonds more standardized Driven by need for bonds to respond to
requirements of principal stakeholders (sponsors, investors, rating agencies, and regulators)
Properties of CAT Bonds
CAT bonds often issued to cover the high layers of reinsurance (return period of 100+ years)
Higher layers of protection often go unreinsured by ceding companies for two reasons:
1. For huge events, ceding insurers more concerned about credit risk of reinsurer
2. High layers have highest reinsurance margins/pricing spreads above expected loss
CAT bonds are fully collateralized Eliminate concerns about credit risk
CATs have low correlations with investment returns CAT bonds may provide lower spreads
than high-layer reinsurance because attractive for diversication
CAT bonds also can lock in multi-year protection, unlike traditional reinsurance, and shelter
the sponsor from cyclical price uctuations in the reinsurance market
The multi-year terms (or tenors) of most CAT bonds also allow sponsors to spread the xed
costs of issuing the bonds over a multi-year period, reducing costs on an annualized basis
Typical CAT bond structure
Transaction begins with formation of single purpose reinsurer (SPR): SPR issues bonds and
invests proceeds in safe, short-term securities (gov. bonds, AAA corp.), held in trust
Embedded in the bonds is a call option triggered by a dened CAT event: On occurrence of
event, proceeds are released from SPR to pay claims arising from the event
In most CAT bonds, principal fully at risk. In return, insurer pays premium to investors
Fixed returns on securities held in trust are swapped for oating returns based on LIBOR
Immunize insurer/investors from interest rate (mark-to-market) risk and default risk
The investors receive LIBOR plus the risk premium in return for providing capital to the trust.
If no contingent event occurs, principal returned to investors upon expiration
Some CAT bonds include principal protected tranches, with guaranteed return of principal
In this tranche, the triggering event would aect the interest and spread payments and the
timing of the repayment of principal
E.g., a 2-year CAT bond subject to the payment of interest and a spread premium might
convert into a 10-year zero-coupon bond that would return only the principal
Principal-protected tranches have become relatively rare, primarily because they do not provide
as much risk capital to the sponsor as a principal-at-risk bond
Insurers and investores prefer to use a SPR
Insurers: To capture tax/accounting benets associated with traditional reinsurance
Investors: To isolate the risk of their investment from the general business and insolvency risks
of the insurer, thus creating an investment that is a pure play in catastrophic risk
Bonds are fully collateralized, with collateral in trust, insulating investors from credit risk
The issuer of the securitization can realize lower nancing costs through segregation
The transaction also is more transparent than a debt issue by the insurer, because the funds
are held in trust and are released according to carefully dened criteria
The bonds are attractive to investors because catastrophic events have low correlations with returns
from securities markets and hence are valuable for diversication purposes
106
Types of triggers
CAT bonds/other ILS structured to pay o on three types of triggering variables:
1. Indemnity triggers: Payouts based on size of the sponsoring insurers actual losses
2. Index triggers: Payouts based on an index not directly tied to sponsors losses
3. Hybrid triggers, which blend more than one trigger in a single bond
There are three broad types indices that can be used as CAT bond triggers:
(i) Industry loss indices, (ii) Modeled loss indices, and (iii) Parametric indices
With industry loss indices, the payo on the bond is triggered when estimated industry-
wide losses from an event exceed a specied threshold
A modeled-loss index is calculated using a model provided by one of the major catastrophe-
modeling rms (actual events physical parameters are used in running simulations)
With a parametric trigger, the bond payo is triggered by specied physical measures of
the catastrophic event such as the wind speed and location of a hurricane
Number of factors to consider in the choice of a trigger when designing a CAT bond: Trade-o
between moral hazard (transparency to investors) and basis risk
Indemnity triggers
Often favored by insurers/reinsurers because they minimize basis risk (risk that loss payout of
bond will be greater or less than sponsors actual losses)
However, indemnity triggers require investors to obtain info on risk exposure of sponsors
underwriting portfolio (dicult, especially for complex commercial risks)
Disadvantage to sponsor: Require disclosure of condential info on sponsors portfolio
Contracts based on indemnity triggers may require more time than nonindemnity triggers to
reach nal settlement because of the length of the loss adjustment process
Index triggers
Index triggers tend to be favored by investors because they minimize the problem of moral
hazard: I.e., they maximize the transparency of the transaction
Moral hazard occurs if issuer fails to settle CAT losses carefully/appropriately
Insurer might also excessively expand its premium writings in geographical areas covered
Although CAT bonds almost always contain copayment provisions to control moral hazard,
moral hazard remains a residual concern for some investors
Indices measurable more quickly after event Sponsor receives quicker payment
Disadvantage of index triggers: Expose sponsor to higher degree of basis risk
Degree of basis risk varies depending upon several factors. Parametric triggers tend to have
the lowest exposure to moral hazard but may have the highest exposure to basis risk
However, even with parametric trigger, basis risk can be reduced substantially by appropriately
dening location where event severity is measured. Similarly, industry loss indices based on
narrowly dened geographical areas have less basis risk than those based on wider areas
Modeled-loss indices may become the favored mechanism for obtaining benets of index trigger
without signicant basis risk. However, modeled-loss indices subject to model risk, dimin-
ishing over time as modeling rms rene their models
Sidecars
Sidecars are special purpose vehicles formed by insurance and reinsurance companies to provide
additional capacity to write reinsurance, usually for property catastrophes and marine risks, and
typically serve to accept retrocessions exclusively from a single reinsurer
Sidecars are typically o-balance sheet, formed to write specic types of reinsurance such as
property-catastrophe quota share or excess of loss, and generally have limited lifetimes
Reinsurers receive override commissions for premiums ceded to sidecars
Sidecars capitalized by private investors (hedge funds), but insurers/reinsurers also participate
Sidecars receive premiums for reinsurance underwritten and pay claims under contracts terms
Sidecars also enable sponsoring reinsurer to move some risks o balance sheet, improving leverage
Sidecars can also be formed quickly and with minimal documentation/administrative costs
107
Catastrophic Equity Puts (Cat-E-Puts)
Unlike CAT bonds, Cat-E-Puts are not asset-backed securities but options
In return for a premium paid to the writer of the option, the insurer obtains the option to issue
preferred stock at a preagreed price on the occurrence of a contingent event
Enables insurer to raise equity capital at good price after a CAT, when stock price depressed
Cat-E-Puts have lower transactions costs than CAT bonds because no need to set up an SPR
However, they are not collateralized Expose insurer to counterparty performance risk
In addition, issuing the preferred stock can dilute the value of the rms existing shares
Thus, although Cat-E-Puts have been issued, they are not nearly as important as CAT bonds
Catastrophe risk swaps
Catastrophe risk swaps are not prefunded but rely on agreement between two counterparties
Catastrophe swaps executed between two rms with exposure to dierent types of CAT risk
In some instances, a reinsurer may serve as an intermediary between the swap partners, but in
most instances CAT swaps are done directly between two (re)insurers
Swaps are facilitated by the Catastrophic Risk Exchange (CATEX), a web-based exchange where
insurers and reinsurers can arrange reinsurance contracts and swap transactions
Swap agreement
Event(s) that trigger payment under swap are carefully dened in the agreement
The swap can be designed such that the two sides of the risk achieve parity, i.e., such that the
expected losses under the two sides of the swap are equivalent
Swap denes amount to be paid if event occurs. Some contracts have sliding scale payos
Swaps can be annual or can span several years
Swaps also can be executed that fund multiple risks simultaneously
Swaps may be attractive substitutes for reinsurance/CAT bonds/other risk nancing devices:
Advantage that the reinsurer simultaneously lays of some of its core risk and obtains a new
source of diversication by exchanging uncorrelated risks with counterparty
Swaps may enable reinsurers to operate with less equity capital
Swaps also are characterized by low transactions costs and reduce current expenses because
no money changes hands until the occurrence of a triggering event
The potential disadvantages of swaps are that modeling the risks to achieve parity can be
challenging and is not necessarily completely accurate
Swaps also may create more exposure to basis risk than some other types of contracts and also
create exposure to counter-party nonperformance risk
Industry Loss Warranties (ILW)
Possible impediment to growth of CAT securitization market has to do with whether securities are
treated as reinsurance by regulators, and given favorable regulatory accounting treatment:
Properly structured indemnity CAT securities (those that pay o based on the losses of the
issuing insurer) will be treated as reinsurance
Sponsors and their bankers have found various ways to nesse potential regulatory problems:
E.g., even if the SPV is an oshore vehicle, the trust holding the assets can be onshore,
mitigating regulatory concerns regarding credit risk of oshore entities
Dual-trigger contracts, aka ILW, also overcome regulatory objections to nonindemnity bonds:
ILWs: Dual-trigger reinsurance contracts with retention trigger based on incurred losses of
insurer buying the contract and also a warranty trigger based on an industry-wide loss index
Both triggers have to be hit in order for the buyer of the contract to receive a payo
ILWs cover events from specied catastrophe perils in a dened geographical region
The term of the contact is typically 1 year
ILWs may have binary triggers, where full amount pays o once the two triggers are satised
or pro rata triggers where payo depends upon how much loss exceeds warranty
The principal advantages of ILWs are that they are treated as reinsurance for regulatory purposes,
and that they can be used to plug gaps in reinsurance programs
108
The ILW market is roughly of the same order of magnitude as the CAT bond market
Capital market participants provide majority of risk capital in ILW/CAT bonds markets
ILWs can be packaged and securitized, broadening the investor base
The risk-linked securities market
The CAT bond market: Size and bond characteristics
Although the CAT bond market seemed to get o to a slow start in late 1990s, the market has
matured and now has become a steady source of capacity for both primary insurers/reinsurers:
Market growing steadily. New records for market issuance volume in 2005, 2006, and 2007
CAT bonds make sound economic sense as a mechanism for funding mega-catastrophes
$100B < 0.5% of US securities markets value Easily absorbed with securitized transactions
Securities markets more ecient in reducing info asymmetries/facilitating price discovery
Recently, event-linked bonds have also been issued to cover third-party commercial liability, auto-
mobile quota share, and indemnity-based trade credit reinsurance
The amount of risk capital outstanding in CAT bond markets has also grown steadily
Risk-capital outstanding represents the face value of all bonds still in eect in each year
Characteristics of CAT bonds continue to evolve. Overall trend is toward higher standardization
Between 2000 and 2006, index or hybrid bonds accounted for 80% of total issue volume
Leading type of index by volume is the parametric index (34% of total issuance)
Market has converged on shorter-term issues, with mostly 3-yr bonds
Maturities greater than 1 year favored because they provide a steady source of risk capital that
is insulated from year-to-year swings in reinsurance prices and because they permit issuers to
amortize costs of issuance over a longer period, reducing per period transactions costs
Bonds > 5 years not favored: Market participants want to reprice risk periodically to reect
new info on frequency/severity of CATs and to recognize changes in u/w risk prole of sponsor
2000-06: Insurers account for 47.9% of bonds by issue volume, reinsurers for 47.5%
In 2006, the rst government issued disaster-relief bond placement was executed to provide funds
to the government of Mexico to defray costs of disaster recovery
Such bonds illustrate how securitization can be used by governments to prefund disaster relief
programs, rather than waiting for disaster relief from donor countries ex post
Obtaining a nancial rating is a critical step in issuing a CAT bond
Buyers use ratings to compare yields on CAT bonds with other corporate securities
Almost all bonds are issued with nancial ratings
The vast majority of CAT bonds issued in 2005 and 2006 have been below investment grade
(ratings below BBB). In 2007, there has been a resurgence in investment-grade bonds, although
the majority of CAT bonds are below investment grade in 2007 as well
Although lower than investment grade ratings are generally bad news for insurers/reinsurers
and other corporate bond issuers, they are not necessarily adverse in CAT bond market
Because CAT bonds are fully collateralized, CAT bond ratings tend to be determined by
the probability that the bond principal will be hit by a triggering event
Bond ratings indicate layer of CAT-risk coverage being provided by the bonds
The modeling rms analysis drives the price more than the actual rating
There is broad market interest in CAT bonds among institutional investors
1999: Insurers/reinsurers among leading investors in the bonds, 55% of the market
By 2007, insurers/reinsurers = 7% of demand External capital attracted to market
Dedicated CAT funds = 55% of market in 2007, money managers/hedge funds = 36%
The declining spreads and increasingly broad market interest suggest that bonds are attractive
to investors and are playing an increasingly important role vs. conventional reinsurance
In addition to CAT bonds, signicant amount of new capital raised through sidecars in 05 and 06
Indication that sidecars were competing with CAT bonds for risk capital of interested investors
in 2005, leading to rising prices and tightening capacity in the CAT bond market
109
The rst publicly acknowledged total loss of principal for a CAT bond took place in 2005
Short-term impact of wipeout: Increase investor wariness of indemnity-based transactions
Indemnity transactions rebounded in 07 due to surge of primary insurer CAT bond issues
Longer-term impact of the KAMP Re wipeout on CAT bond market is favorable
Smooth settlement of KAMP Re bond established important precedent, showing that CAT
bonds function well, with minimal confusion and controversy between sponsor/investors
CAT Bond prices
CAT bonds are priced at spreads over LIBOR, meaning that investors receive oating interest plus
a spread or premium over the oating rate
In the past, CAT bonds have been somewhat notorious for having high spreads
However, there are now signicant indications that the spreads are not as high as they might
seem relative to the cost of reinsurance, such that CAT bonds are more competitive
Because CAT bonds are not publicly traded, dicult to obtain data on CAT bond yields
However, there is an active (nonpublic) secondary market that provides guidance on yields
Prior to Katrina, there was a more or less steady decline in yields and a slight increase in the
expected loss, implying a general decline in the cost of nancing through CAT bonds
The ratio of the premium to expected loss was about six in early 2001
However, ratio of premium to expected loss steadyly declined to 2.1 for 2001-05
Not surprisingly, yields and spreads increased following Katrina as market tightened and in-
vestors had opportunities to place capital in other CAT-risk vehicles such as sidecars
The CAT bond market was able to withstand the post-Katrina competition for capital without
returning to the high relative spreads of earlier periods
This is the expected result in a market where there is growing investor interest and expertise
as well as growing volume, which adds to market liquidity
Comparison of CAT bond and catastrophe reinsurance pricing is dicult because of the general
lack of systematic data on reinsurance prices
The rate on line (ROL) is dened as the reinsurance premium divided by the policy limit
The loss on line (LOL) is the expected loss on the contract divided by the policy limit
Ratio ROL-to-LOL is analogous to ratio of yield/expected loss on CAT bonds
Like the CAT bond yield to ELR, the ratios of ROL to expected LOL are signicantly higher
in 2006 than in 2005, reecting the eects of Hurricanes Katrina, Rita, and Wilma
ROL-to-LOL ratios are signicantly larger for national insurers than for regional insurers
Finally, the ratios are lower for contracts with higher expected LOL, because policies with low
expected LOL are covering the more risky upper tails of the loss distribution
CAT bonds on average tend to have expected losses of between 1-3% of principal, and thus are
most comparable to catastrophe reinsurance contracts with relatively low LOLs
CAT bonds are in the ballpark in terms of pricing for national companies
CAT bonds do not appear to be expensive relative to catastrophe reinsurance
Moreover, investment banks have reduced transactions costs/time to market as they have
gained experience with ILS Bonds more attractive to insurers/reinsurers
CAT bond prices look less attractive relative to reinsurance for regional companies
However, because regional rms have not been active in the CAT bond market, it is not
clear what the bond premia would be for these rms
Another comparative indication of trends in CAT bond spreads is provided by a comparison of the
Mexican CAT bonds with previously issued earthquake bonds
The spreads on the Mexican bonds are very low in comparison to the prior bonds
Two phenomena (not precisely separated) inuence spreads:
1. The Mexican bonds are more recent and CAT bond spreads have been declining
2. The Mexican bonds are valuable to CAT bond investors for diversication purposes because
they cover a previously unsecuritized area of the world and permit investors to diversify
their current large proportionate exposure to US hurricane risk
110
Relevant to compare CAT bond yields relative to yields on comparably rated corporate bonds
BB CAT bond yields comparable to yields on BB corporate bond for 2001-05 (until Katrina)
At the peak, yields on CAT bonds were 2-3% higher than the yields on BB corporates
Nevertheless, considering reinsurance prices in 06 and uncertainty created by Katrina/recent
CATs, the CAT bond market has weathered the storms very well
Regulatory, Accounting, Tax (RAT), and rating issues
Regulatory and accounting issues do not pose a material impediment to the growth of the market at
the present time, and the statistics on market size and growth clearly seem to bear this out
Regulatory issues
Some commentators have argued that CAT bonds have mostly been issued o-shore for regulatory
reasons and that lack of onshore issuance represents a barrier to market developments
Encouraging onshore issuance might reduce transactions costs/facilitate market growth
However, oshore jurisdictions provide low issuance costs/high levels of expertise in issuing ILS
Transactions costs for onshore CAT bonds generally higher than for oshore issues
O-shore jurisdictions perform very eectively/eciently in issuing/settling ILS
Nonindemnity CAT bonds currently face uncertain prospects with respect to regulatory treatment
Regulators concerned about basis risk and use of securitized risk instruments to speculate
Denied reinsurance accounting for nonindemnity CAT bonds impedes market development
However, other industry experts indicate that regulatory treatment does not presently pose a
signicant obstacle to market development
Market participants have found a variety of structuring mechanisms to blunt regulatory con-
cerns about alternative risk nancing with respect to nonindemnity CAT bonds
E.g., contracts can be structured to pay o on narrowly dened geographical indices or com-
binations of indices that are highly correlated with the insurers losses
Concerns about speculative investing can be addressed through dual-trigger contracts that pay
o on an index but where the insurer cannot collect more than its ultimate net loss
In the area of risk-linked securities, it would be helpful if regulators were to codify the rules
and regulations relating to the SAP treatment of various types of risk-linked securities and avoid
imposing any unnecessary regulatory impediments in the future
Tax issues
Oshore CAT bonds do not create taxation problems for sponsors
No income/corporate/withholding taxes in oshore jurisdictions apply to CAT bonds
The bonds SPRs are also not taxable for US federal income tax purposes
Main tax issue for US investors: Treatment of bond premia under US tax law
Tax Code/IRS do not address tax treatment of income received from CAT bonds
Bonds are presently being treated as passive foreign investment companies (PFICs) Income
from CAT bonds included in taxable income as dividends rather than interest
US sponsors have been deducting premium payments on oshore bonds for income tax pur-
poses, i.e., bond interest is currently treated similarly to reinsurance premiums
Dissemination of information on bonds
Although the ultimate objective should be the development of a public market for CAT bonds,
privately placed bonds are likely to continue to play an important role
Market development impeded to the extent that info on existing bonds is not available
Under current securities regulations, bond prospectuses for privately placed bonds can be
distributed only to investors falling under the denition of accredited investors
These rules have the unintended consequence of inhibiting research on CAT bonds
The SEC rules should be changed to allow sponsors to distribute bond prospectuses to researchers
who are not necessarily accredited investors
E.g., post prospectuses on repository maintained by appropriate governmental entity
111
Issues to be explored
Insurance regulators in key jurisdictions such as the US, the EU, and Japan could mandate catas-
trophe loss reporting for events above a given industry threshold such as $1 billion
This would solve an important current problem, i.e., the lack of a PCS-equivalent index for
the EU and Japan, and would enhance the market by providing more info on US losses
Until loss turns into recoverable, credit quality of counterparty is ignored by regulators
In RBC, the charges for reinsurance are not graded by reinsurer credit quality
Explicitly incorporating reinsurance credit quality into regulatory capital calculations and re-
lated regulatory credit evaluations has the potential to provide an important boost to the ILS
market as well as improving insurance solvency regulation in more general terms
In several key CAT-prone states, regulators are most reluctant to allow price increases at precisely
the times when insurer loss expectations/reinsurance prices are increasing most rapidly
The best solution to this problem would be to deregulate prices at the state level so that
primary insurers would not be caught in this price-cost bind
Short of deregulating prices, regulators could help ease the problem by giving primary insurers
credit for locking in multi-year pricing and capacity by issuing ILS
The application of the Employee Retirement Income Security Act (ERISA) to CAT bond collateral
trusts is complex and would benet from some thorough research
Conclusions
The CAT bond market is thriving and seems to have reached critical mass
The market achieved record bond issuance in 2005, 2006, and 2007
Bond premia have declined signicantly since 2001
The bonds now are priced competitively with catastrophe reinsurance
The bonds now account for a signicant share of the property-catastrophe reinsurance market
CAT bonds have an important role to play for high coverage layers and in retrocession market
Regulatory and accounting issues such as the regulatory accounting treatment of non-indemnity CAT
bonds and the issuance of most bonds oshore, which have been cited as impediments to the development
of the market, do not presently seem to pose serious problems
However, there are a number of issues/reforms that should be explored:
Fostering better reporting of CAT losses to facilitate development of better index products
Solvency regulation should be adapted to recognize the credit quality of reinsurance receivables
and give recognition to the full collateralization provided by CAT bonds
Primary insurance prices should be deregulated, and primary insurers should receive credit for
entering into contracts providing multi-year pricing/capacity through ILS/conventional reinsurance
Applicability of ERISA to CAT bond collateral trusts and the US GAAP and SAP treatment of
triggers employed in ILW and similar contracts
Issuers of CAT bonds should be required to make available bond prospectuses to researchers who
could provide valuable analysis of CAT risk nancing
The future looks bright for the ILS market
CAT bonds, swaps, sidecars, ILW, and other innovative products will play an increasingly important
role in providing risk nancing for CAT events
Event-linked bonds are also being used increasingly by primary insurers for lower layers of coverage
and non-CAT coverages such as automobile and commercial liability insurance
It remains to be seen whether CAT futures and options will play an important role in catastrophe
risk management in the years to come
Basis risk and counterparty credit risk, as well as the need to educate insurance industry partici-
pants, are the primary impediments to the success of these contracts
112
Additional Notes
113
Additional Notes
114
Additional Notes
115
CAS Exam 8 Notes - Parts C, D, & E
Futures, Forwards and Swaps
Options
Asset-Liability Management
Part II
Table of Contents
C Futures, Forwards and Swaps 1
Hull - Ch. 2: Mechanics of futures markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Hull - Ch. 3: Hedging strategies using futures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Hull - Ch. 5: Determination of forward and futures prices . . . . . . . . . . . . . . . . . . . . . . . 17
Hull - Ch. 7: Swaps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
D Options 35
Hull - Ch. 8: Mechanics of options markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Hull - Ch. 9: Properties of stock options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Hull - Ch. 10: Trading strategies involving options . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Hull - Ch. 11: Binomial trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Fabozzi: Valuation of bonds with embedded options . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Hull - Ch. 12: Wiener processes and Itos lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
Hull - Ch. 13: The Black-Scholes-Merton model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Black: Using the holes in Black-Scholes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Hull - Ch. 15: Options on stock indices and currencies . . . . . . . . . . . . . . . . . . . . . . . . . 87
Hull - Ch. 16: Futures options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
E Asset-Liability Management 99
BKM - Ch. 16: Managing bond portfolios . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Feldblum: Asset liability matching for P&C insurers . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Noris: Asset/Liability management strategies for P&C companies . . . . . . . . . . . . . . . . . . . 119
Panning: Managing interest rate risk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Additional Notes 129
C Futures, Forwards and Swaps
Hull - Ch. 2: Mechanics of futures markets
Background
Both futures/forward contracts are agreements to buy/sell an asset at a future time for a certain price
Futures are traded on an organized exchange, contract terms are standardized by the exchange
By contrast, forward contracts are private agreements between nancial institutions/clients
Chicago Board of Trade (CBOT), Chicago Mercantile Exchange (CME): Largest US futures exchanges
Futures trade
On March 5, a trader in New York might call a broker with instructions to buy 5,000 bushels of
corn for delivery in July of the same year. The broker would immediately issue instructions to a
trader to buy (i.e., take a long position in) one July corn contract
Another trader in Kansas might instruct a broker to sell 5,000 bushels of corn for July delivery.
This broker would then issue instructions to sell (i.e., take a short position in) one corn contract
A price would be determined and the deal would be done
Under traditional open outcry system, oor traders representing each party would physically meet
to determine price. With electronic trading, a computer matches trades/monitor prices
Denitions
The trader in New York who agreed to buy has a long futures position in one contract
The trader in Kansas who agreed to sell has a short futures position in one contract
The price agreed to is the current futures price for July corn
This price, like any other price, is determined by the laws of supply and demand
Closing out positions
The vast majority of futures contracts do not lead to delivery:
Most traders choose to close out their positions prior to delivery period specied in contract
Closing out a position means entering into the opposite trade to the original one
Investors gain/loss determined by change in futures price from 3/5 to when contract closes out
Nevertheless, it is the possibility of nal delivery that ties the futures price to the spot price
Specication of a futures contract
As stipulated by the exchange, the agreement between the two parties must specify the asset, the
contract size, where delivery will be made, and when delivery will be made
Sometimes alternatives are specied for grade of asset delivered/delivery locations
As a general rule, it is the party with the short position (the party that has agreed to sell the asset)
that chooses what will happen when alternatives are specied by the exchange
When party with short position ready, it les a notice of intention to deliver with the exchange
Notice indicates selections made wrt. grade of asset delivered and delivery location
The asset
When asset is a commodity, the exchange stipulates acceptable grade(s) of the commodity
For some commodities, range of grades can be delivered, but price received depends it
Often, substitutions are allowed with price being adjusted in a way established by the exchange
The nancial assets in futures contracts are generally well dened and unambiguous
The contract size
This is an important decision for the exchange
If the contract size is too large, many investors will be unable to use the exchange
If the contract size is too small, trading expensive (cost associated with each contract traded)
Delivery arrangements
The place where delivery will be made must be specied by the exchange
This is particularly important for commodities that involve signicant transportation costs
When alternative delivery locations are specied, the price received by the party with the short
position is sometimes adjusted according to the location chosen by that party
Price higher for delivery locations relatively far from main sources of commodity
3
Delivery months
A futures contract is referred to by its delivery month
The exchange must specify the precise period during the month when delivery can be made
At any time, contracts trade for closest delivery month/few subsequent delivery months
The exchange species when trading in a particular months contract will begin
The exchange also species the last day on which trading can take place for a given contract
Trading generally ceases a few days before the last day on which delivery can be made
Price quotes
The exchange denes how prices will be quoted
E.g., crude oil prices on NYMEX are quoted in dollars and cents
Treasury bond/Treasury note futures on CBOT quoted in dollars and thirty-seconds
Price limits and position limits
For most contracts, daily price movement limits are specied by the exchange
A limit move is a move in either direction equal to the daily price limit
Normally, trading ceases for the day once the contract is limit up or limit down
Purpose: Prevent large price movements from speculative excesses
Position limits are the maximum number of contracts that a speculator may hold
Purpose: Prevent speculators from exercising undue inuence on the market
Convergence of futures price to spot price
As delivery period approaches, the futures price converges to spot price of underlying asset
When delivery period is reached, futures price spot price
Suppose that futures price > spot price during delivery period Arbitrage opportunity:
(i) Sell (i.e., short) a futures contract, (ii) Buy the asset, and (iii) Make delivery
These steps lead to a prot equal to the amount by which the futures price > spot price
As traders exploit this arbitrage opportunity, the futures price will fall
Suppose next that the futures price is below the spot price during the delivery period:
Companies interested in acquiring the asset nd attractive to enter into long futures contract and
wait for delivery As they do, futures prices tend to rise
The result is that the futures price is very close to the spot price during the delivery period
Daily settlements and margins
Key role of the exchange: Organize trading so that contract defaults are avoided Margins
The operation of margins
Consider two December gold futures contracts (size = 100 ounces, current futures price is $600/ounce)
The broker will require the investor to deposit funds in a margin account
Initial margin: Amount that must be deposited at time contract is entered into
At the end of each trading day, the margin account is adjusted to reect investors gain/loss
This practice is referred to as marking to market the account
A trade is rst marked to market at the close of the day on which it takes place
It is then marked to market at the close of trading on each subsequent day
Marking to market is not merely an arrangement between broker and client
When there is a decrease in the futures price so that the margin account of an investor with a
long position is reduced by $600, the investors broker has to pay the exchange $600 and the
exchange passes the money on to the broker of an investor with a short position
Investor entitled to withdraw any balance in margin account in excess of initial margin
Maintenance margin (< initial margin): Ensures that margin account never < 0
If balance in margin account falls below maintenance margin, investor receives a margin call
and is expected to top up margin account to initial margin level the next day
The extra funds deposited are known as a variation margin
Further details
Many brokers allow an investor to earn interest on the balance in a margin account
Balance in the account ,= true cost, provided that interest rate is competitive
4
To satisfy the initial margin requirements (but not subsequent margin calls), an investor can
sometimes deposit securities with the broker
Treasury bills are usually accepted in lieu of cash at about 90% of their face value
Shares are also sometimes accepted in lieu of cash, at 50% of market value
Eect of marking to market: Futures contract settled daily rather than at end of its life
At the end of each day, the investors gain (loss) is added to (subtracted from) the margin
account, bringing the value of the contract back to zero
A futures contract is in eect closed out and rewritten at a new price each day
Minimum levels for initial and maintenance margins are set by the exchange
Individual brokers may require greater margins from clients than specied by exchange
Margin levels are determined by the variability of the price of the underlying asset
The higher this variability, the higher the margin levels
The maintenance margin is usually about 75% of the initial margin
Day trades/spread transactions give rise to lower margin requirements than hedge transactions
Day trade: Trader tells broker an intent to close out position the same day
Spread transaction: The trader simultaneously buys a contract on an asset for one maturity
month and sells a contract on the same asset for another maturity month
Margin requirements are the same on short futures positions as they are on long futures positions
The spot market does not have this symmetry
Long position in spot market involves buying asset for immediate delivery No problems
Short position involves selling an asset not owned More complex (may not be possible)
The clearinghouse and clearing margins
Clearinghouse
Acts as an intermediary in futures transactions
Guarantees the performance of the parties to each transaction
The clearinghouse has a number of members, who must post funds with the exchange
Brokers who are not members themselves must channel their business through a member
The main task of the clearinghouse is to keep track of all the transactions that take place
during a day, so that it can calculate the net position of each of its members
The broker is required to maintain a margin account with a clearinghouse member and the clear-
inghouse member is required to maintain a margin account with the clearinghouse
The latter is known as a clearing margin
The margin accounts for clearinghouse members are adjusted for gains and losses at the end
of each trading day in the same way as are the margin accounts of investors
However, for clearinghouse members, there is an original margin, but no maintenance margin
Brokers not clearinghouse members maintain a margin account with a clearinghouse member
In determining clearing margins, the exchange clearinghouse calculates the number of contracts
outstanding on either a gross or a net basis
Gross basis: The number of contracts equals the sum of the long and short positions
Net basis: These are oset against each other
Most exchanges currently use net margining
Credit risk
Purpose of margining system: Eliminate risk that a trader who makes a prot is not paid
Overall the system has been very successful
Collateralization in OTC markets
Credit risk has traditionally been a feature of the over-the-counter markets
To reduce credit risk, OTC markets now imitates the margining system with collateralization
Participants in OTC market can enter into collateralization agreement, valuing contract each day
Each day, if value to A increases, B required to pay A (cash amount = increase), and conversely
Interest is paid on outstanding cash balances
Collateralization signicantly reduces the credit risk in OTC contracts
5
Newspaper quotes
Many newspapers carry futures prices
The prices refer to the trading that took place on the previous day
For most commodities, contracts trade with much longer maturities than those shown
However, trading volume tends to decrease as contract maturity increases
The asset underlying the futures contract, the exchange that the contract is traded on, the contract
size, how the price is quoted, and the maturity month of the contract are usually shown
Quotes usually show opening price, highest/lowest prices achieved in trading during the day
Settlement price
The settlement price is the price used for calculating daily gains/losses and margin requirements
Price at which contract traded just before the bell signaling end of trading for the day
Quotes also usually give the change in the settlement price from the previous day
Open interest
This is the total number of contracts outstanding
The open interest is the number of long positions the number of short positions
When volume of trading in a day > open interest at end of day, it indicates many day trades
Patterns of futures prices
Normal market: Settlement prices increase with the maturity of the contract
Inverted market: Futures price is a decreasing function of maturity
Futures prices can also show a mixture of normal and inverted markets
Delivery
Period during which delivery can be made is dened by the exchange and varies by contract
The decision on when to deliver is made by the party with the short position (investor A)
When A ready to deliver, As broker issues a notice of intention to deliver to exchange clearinghouse
This notice states how many contracts will be delivered and, in the case of commodities, also
species where delivery will be made and what grade will be delivered
The exchange then chooses a party with a long position to accept delivery
Exchange usually passes notice of intention to deliver to party with oldest outstanding long position
Parties with long positions must accept delivery notices
However, if notices are transferable, long investors have a short period of time (half an hour) to
nd another party with a long position that is prepared to accept the notice from them
For commodities, delivery means accepting a warehouse receipt in return for immediate payment
The party taking delivery is then responsible for all warehousing costs
In the case of nancial futures, delivery is usually made by wire transfer
For all contracts, the price paid is usually the most recent settlement price
If specied by the exchange, this price is adjusted for grade, location of delivery, and so on
The whole delivery procedure from notice to delivery takes 2-3 days
There are three critical days for a contract:
First notice day: 1st day on which a notice of intention to make delivery can be submitted
Last notice day: The last such day
Last trading day: Generally a few days before the last notice day
To avoid risk of having to take delivery, long positions should be closed out prior to 1st notice day
Cash settlement
Some nancial futures (on stock indices) are settled in cash (impossible to deliver underlying asset)
When contract settled in cash, all outstanding contracts are declared closed on a predened day
Final settlement price = spot price of underlying asset at opening/close of that trading day
Types of traders and types of orders
There are two main types of traders executing trades: Commission brokers and locals
Commission brokers are following instructions of their clients and charge a commission
Locals are trading on their own account
6
Individuals taking positions can be categorized as hedgers, speculators, or arbitrageurs
Speculators can be classied as scalpers, day traders, or position traders
Scalpers are watching for very short-term trends and attempt to prot from small changes in the
contract price. They usually hold their positions for only a few minutes
Day traders hold positions < one trading day, unwilling to take risk of adverse overnight news
Position traders hold positions for much longer periods (prot from major market movements)
Orders
Market order: Request that trade be carried out immediately at best price available in market
Limit order: Species a particular price: Order executed only at this price or better
Stop order or stop-loss order: Also species a particular price: Order executed at the best
available price once a bid or oer is made at that particular price or a less-favorable price
Purpose of stop order: To close out a position if unfavorable price movements take place
It limits the loss that can be incurred
Stop-limit order: A combination of a stop order and a limit order
Suppose a stop-limit order to buy is issued with stop price of $40 and limit price of $41
As soon as there is a bid or oer at $40, the stop-limit becomes a limit order at $41
If stop price = limit price, the order is called a stop-and-limit order
Market-if-touched (MIT) order: Executed at the best available price after a trade occurs at a
specied price or at a price more favorable than the specied price
In eect, an MIT becomes a market order once the specied price has been hit
An MIT is also known as a board order
Consider investor who has a long position and is issuing instructions to close out contract:
Stop order: Designed to limit the loss that can occur if unfavorable price movements
MIT order: Designed to ensure that prots are taken if favorable price movements
Discretionary order or market-not-held order: Traded as a market order except that execu-
tion may be delayed at the brokers discretion in an attempt to get a better price
Some orders specify time conditions:
Unless otherwise stated, an order is a day order and expires at the end of trading day
Time-of-day order: Species a particular period of time during the day
Open order or a good-till-canceled order: In eect until executed/until end of trading
Fill-or-kill order: Must be executed immediately on receipt or not at all
Regulation
US futures markets regulated federally by the Commodity Futures Trading Commission (CFTC)
This body is responsible for licensing futures exchanges and approving contracts
All new contracts and changes to existing contracts must be approved by the CFTC
To be approved, contract must have some useful economic purpose (serve hedgers and speculators)
The CFTC looks after the public interest:
Responsible for ensuring that prices are communicated to the public and that futures traders report
their outstanding positions if they are above certain levels
Licenses all individuals who oer their services to the public in futures trading
Deals with complaints brought by public and ensures that disciplinary action is taken
Has authority to force exchanges to discipline members in violation of exchange rules
With formation of National Futures Association (NFA) in 1982, some responsibilities of CFTC shifted:
The NFA is an organization of individuals who participate in the futures industry
Its objective is to prevent fraud/ensure that market operates in best interests of general public
Authorized to monitor trading and take disciplinary action when appropriate
NFA set up an ecient system for arbitrating disputes between individuals/members
From time to time, other bodies, such as the SEC, the Federal Reserve Board, and the US Treasury
Department, have claimed jurisdictional rights over some aspects of futures trading
Concerned with eects of futures trading on spot markets for securities (stocks, T-bills, T-bonds)
SEC has eective veto over approval of new stock/bond index futures contracts
7
Trading irregularities
One type of trading irregularity occurs when an investor group tries to corner the market
The investor group takes a huge long futures position and also tries to exercise some control
over the supply of the underlying commodity
Regulators usually deal with this type of abuse of the market by increasing margin requirements
or imposing stricter position limits or prohibiting trades that increase a speculators open
position or requiring market participants to close out their positions
Other types of trading irregularity can involve the traders on the oor of the exchange:
Overcharging customers, not paying customers the full proceeds of sales, and traders using
their knowledge of customer orders to trade rst for themselves (front running)
Accounting and tax
Accounting
Accounting standards require changes in the market value of a futures contract to be recognized
when they occur unless the contract qualies as a hedge
If the contract does qualify as a hedge, gains or losses are generally recognized in the same period
in which the gains or losses from the item being hedged are recognized (hedge accounting)
Consider a company with a December year end. In September 2007 it buys a March 2008 corn
futures contract and closes out the position at the end of February 2008
Suppose: Futures prices are 250c per bushel when contract entered into, 270c at end of 2007,
and 280c when contract closed out. Contract is for 5,000 bushels
If the contract does not qualify as a hedge, the gains for accounting purposes are:
5, 000 (2.70 2.50) = 1, 000 in 2007 and 5, 000 (2.80 2.70) = 500 in 2008
If contract qualies for hedge accounting, entire gain ($1,500) realized in 08 for accounting
FAS 133
Issued in June 1998, applies to all types of derivatives (futures, forwards, swaps, options)
FAS 133 requires all derivatives to be included on the balance sheet at fair market value
It increases disclosure requirements
It also gives companies far less latitude than previously in using hedge accounting:
For hedge accounting to be used, the hedging instrument must be highly eective in o-
setting exposures and an assessment of this eectiveness is required every three months
A similar standard IAS 39 has been issued by the IASB
Tax
Key issues: (i) Nature of a taxable gain/loss and, (ii) Timing of recognition of the gain/loss
Gains/losses are either classied as capital gains/losses or as part of ordinary income
Corporate taxpayer
Capital gains taxed at same rate as ordinary income, and restricted ability to deduct losses
Capital losses are deductible only to the extent of capital gains
Corporation may carry back capital loss for 3 years/forward for up to 5 years
Non-corporate taxpayer
Short-term capital gains taxed at same rate as ordinary income, but long-term (capital asset
held for 1+ year) capital gains are subject to a maximum capital gains tax rate of 15%
Capital losses are deductible to the extent of capital gains plus ordinary income up to $3,000
and can be carried forward indenitely
Generally, positions in futures are treated as if closed out on last day of the tax year
For noncorporate taxpayer, this gives rise to capital gains/losses treated as if 60% long term
and 40% short term without regard to holding period (60/40 rule)
A noncorporate taxpayer may elect to carry back for three years any net losses from the 60/40
rule to oset any gains recognized under the rule in the previous three years
Hedging transactions are exempt from this rule. The denition of a hedge transaction for tax
purposes is dierent from that for accounting purposes
8
The tax regulations dene a hedging transaction as a transaction entered into in the normal course
of business primarily for one of the following reasons:
1. Reduce the risk of price changes or currency uctuations with respect to property that is held
or to be held by the taxpayer for the purposes of producing ordinary income
2. Reduce risk of price/interest rate/currency uctuations wrt. taxpayer borrowings
Hedging transaction must be identied before end of the day on which taxpayer enters into it
The asset being hedged must be identied within 35 days
Gains or losses from hedging transactions are treated as ordinary income
The timing of recognition of gains/losses from hedging transactions generally matches the
timing of recognition of income/expense associated with transaction being hedged
Forward vs. futures contracts
Comparison of forward and futures contracts
Forward Futures
Private contract between two parties Traded on an exchange
Not standardized Standardized contract
Usually one specied delivery date Range of delivery dates
Settled at end of contract Settled daily
Delivery/nal cash settlement usually takes place Contract usually closed out prior to maturity
Some credit risk Virtually no credit risk
Prots from forward and futures contracts
Under forward contract, whole gain/loss is realized at the end of the life of the contract
Under futures contract, gain/loss is realized day by day because of the daily settlement
Foreign exchange quotes
Both forward and futures contracts trade actively on foreign currencies
However, dierences in the way exchange rates are quoted in the two markets
Futures prices where one currency is USD are quoted as USD per unit of foreign currency
Forward prices are always quoted in the same way as spot prices:
For British pound, euro, Australian dollar, and New Zealand dollar, forward quotes show USD
per unit of foreign currency (directly comparable with futures quotes)
For other currencies, forward quotes show number of units of foreign currency per USD
9
10
Hull - Ch. 3: Hedging strategies using futures
Basic principles
Short hedges
A short hedge is a hedge that involves a short position in futures contracts
Appropriate when hedger already owns asset and expects to sell it in the future
A short hedge can also be used when an asset is not owned right now but will be owned
Example
Assume: On May 15, an oil producer negotiates a contract to sell 1 million barrels of crude oil
The price that will apply in the contract is the market price on August 15
The oil producer will gain $10,000 for each 1 cent increase in the price of oil over the next 3
months and lose $10,000 for each 1 cent decrease in the price during this period
On May 15, spot price = $60 per barrel and futures price for August delivery = $59
Because each futures contract on NYMEX is for the delivery of 1,000 barrels, the company
can hedge its exposure by shorting 1,000 futures contracts
If producer closes out position on August 15, he will lock in a price $59 per barrel
Suppose that the spot price on August 15 proves to be $55 per barrel
The company realizes $55 million for the oil under its sales contract
The futures price on August 15 should be very close to the spot price of $55 on that date
Company gains $59 $55 = $4 per barrel from the short futures position
Total from both futures position/sales contract $59 per barrel, or $59M total
Suppose that the price of oil on August 15 proves to be $65 per barrel
Company realizes $65 for oil and loses approximately $65 $59 = $6 per barrel on the futures
Again, the total amount realized is approximately $59 million
Long hedges
Hedges that involve taking a long position in a futures contract
Appropriate when company will purchase an asset in the future and wants to lock in a price now
Long hedges can be used to manage an existing short position:
Consider an investor who has shorted a certain stock
Part of the risk faced by the investor is related to the performance of the whole stock market
The investor can neutralize this risk with a long position in index futures contracts
Making or taking delivery can be costly and inconvenient
Delivery is not usually made even when hedger keeps futures contract until delivery month
Hedgers with long positions usually avoid any possibility of having to take delivery by closing out
their positions before the delivery period
In practice, daily settlement does have a small eect on the performance of a hedge
If the exposure is such that the company gains when the price of the asset increases and loses when the
price of the asset decreases, a short hedge is appropriate
If the exposure is the other way round (i.e., the company gains when the price of the asset decreases
and loses when the price of the asset increases), a long hedge is appropriate
Arguments for and against hedging
Arguments for hedging
Most companies have no particular skills/expertise in predicting interest rates, exchange rates, and
commodity prices Makes sense to hedge such risks to avoid unpleasant surprises
The companies can then focus on their main activities and expertise
Hedging and shareholders
Argument against hedging: Shareholders can do the hedging themselves
Argument assumes that shareholders have as much info about companys risks as management
The argument also ignores commissions and other transactions costs Hedging less expensive
when carried out by company rather than individual shareholders
One thing that shareholders can do far more easily than a corporation is diversify risk
11
If companies are acting in the best interests of well-diversied shareholders, it can be argued
that hedging is unnecessary in many situations (at least in theory)
Hedging and competitors
If hedging not the norm in an industry, it may not make sense for one company to be dierent
Competitive pressures within industry may be such that prices of goods/services produced uctuate
to reect raw material costs, interest rates, exchange rates, and so on
A company that does not hedge can expect its prot margins to be roughly constant
However, a company that does hedge can expect its prot margins to uctuate!
This emphasizes the importance of looking at the big picture when hedging
All the implications of price changes on a companys protability should be taken into account
in the design of a hedging strategy to protect against the price changes
Hedging can lead to a worse outcome
A hedge can result in decrease/increase in a companys prots relative to position without hedging
Treasurer may fear criticism if company gains from underlying asset/loses on the hedge
Ideally, hedging strategies are set by a companys board of directors and are clearly communicated
to both the companys management and the shareholders No misunderstandings
Basis risk
In practice, hedging is often not quite straightforward. Reasons:
1. Asset hedged may not be exactly same as asset underlying futures contract
2. The hedger may be uncertain as to the exact date when the asset will be bought or sold
3. The hedge may require the futures contract to be closed out before its delivery month
These problems give rise to what is termed basis risk
The basis
The basis (hedging situation): Dierence between spot price of an asset and its futures price:
Basis = Spot price of asset to be hedged Futures price of contract used
If the asset to be hedged and the asset underlying the futures contract are the same, the basis
should be zero at the expiration of the futures contract
As time passes, spot price/futures price do not change by same amount Basis changes
An increase in the basis is referred to as a strengthening of the basis
A decrease in the basis is referred to as a weakening of the basis
Notations
S
1
: Spot price at time t
1
F
1
: Futures price at time t
1
b
1
: Basis at time t
1
S
2
: Spot price at time t
2
F
2
: Futures price at time t
2
b
2
: Basis at time t
2
Assume that a hedge is put in place at time t
1
and closed out at time t
2
. E.g., consider the case
where the spot and futures prices at the time the hedge is initiated are $2.50 and $2.20, respectively,
and that at the time the hedge is closed out they are $2.00 and $1.90, respectively
This means that S
1
= 2.50, F
1
= 2.20, S
2
= 2.00, and F
2
= 1.90
From the denition of the basis, we have: b
1
= S
1
F
1
= 0.30 and b
2
= S
2
F
2
= 0.10
Consider hedger who knows that asset will be sold at t
2
and takes short futures position at t
1
The price realized for the asset is S
2
and the prot on the futures position is F
1
F
2
The eective price that is obtained for the asset with hedging is therefore: S
2
+F
1
F
2
= F
1
+b
2
If b
2
were also known at this time, a perfect hedge would result. The hedging risk is the
uncertainty associated with b
2
and is known as basis risk
Consider a short hedge (for a long hedge, the reverse holds):
If the basis strengthens (i.e., increases) unexpectedly, the hedgers position improves
If the basis weakens (i.e., decreases) unexpectedly, the hedgers position worsens
When asset of hedgers exposure is dierent from asset underlying the futures contract:
This increases the basis risk
S
2
= price of asset underlying futures contract at t
2
, S
2
= price of asset being hedged at t
2
By hedging, company ensures that price paid (or received) for the asset is:
S
2
+F
1
F
2
F
1
+ (S
2
F
2
) + (S
2
S
2
)
12
The terms (S
2
F
2
) and (S
2
S
2
) represent the two components of the basis:
(S
2
F
2
): Basis if asset being hedged = asset underlying futures contract
(S
2
S
2
): Basis arising from the dierence between the two assets
Choice of contract
Key factor aecting basis risk: Choice of futures contract used for hedging. Two components:
1. The choice of the asset underlying the futures contract
2. The choice of the delivery month
Necessary to carry out a careful analysis to determine which of the available futures contracts has
futures prices that are most closely correlated with the price of the asset being hedged
The choice of the delivery month is likely to be inuenced by several factors:
We assumed: If hedge expiration = delivery month, contract with that delivery month chosen
In fact, a contract with a later delivery month is usually chosen in these circumstances
Reason: Futures prices can be quite erratic during the delivery month
Moreover, a long hedger runs the risk of having to take delivery of the physical asset if the
contract is held during the delivery month (can be expensive and inconvenient)
Long hedgers prefer to close out futures contract and buy asset from their usual suppliers
Basis risk increases as time dierence between hedge expiration/delivery month increases
Choose a delivery month as close as possible to, but later than, the expiration of the hedge
Cross hedging
Cross hedging occurs when asset underlying futures contract ,= asset whose price is being hedged
Hedge ratio: Ratio of [size of the position taken in futures contracts] [size of the exposure]
When asset underlying futures = asset being hedged, natural to use hedge ratio = 1.0
When cross hedging is used, setting the hedge ratio equal to 1.0 is not always optimal
Choose a hedge ratio value that minimizes the variance of the value of the hedged position
Calculating the minimum variance hedge ratio
Notations
S: Change in spot price S during a period of time equal to the life of the hedge
F: Change in futures price F during a period of time equal to the life of the hedge
S
: Standard deviation of S
F
: Standard deviation of F
: Coecient of correlation between S and F
h
F
(1)
If = 1 and
F
=
S
, hedge ratio h
2
= h
2
2
F
2
S
Parameters ,
F
, and
S
in Eq. (1) usually estimated from historical data on S and F
The implicit assumption is that the future will in some sense be like the past
Equal nonoverlapping time intervals are chosen to observe values of S and F
Ideally, length of each interval = length of time interval for which hedge is in eect
13
Optimal number of contracts
Denitions Q
A
: Size of position being hedged (units)
Q
F
: Size of one futures contract (units)
N
Q
A
units of the asset
The number of futures contracts required is therefore given by:
N
=
h
Q
A
Q
F
(2)
Denoting the i-th observations on F and S by x
i
and y
i
, respectively, we have:
F
=
x
2
i
n 1
(
x
i
)
2
n(n 1)
and
S
=
y
2
i
n 1
(
y
i
)
2
n(n 1)
The estimate of is:
=
n
x
i
y
i
x
i
y
i
_
[n
x
2
i
(
x
i
)
2
][n
y
2
i
(
y
i
)
2
]
Tailing the hedge
When futures are used for hedging, a small adjustment, known as tailing the hedge, can be made
to allow for the impact of daily settlement. In practice this means that Eq. (2) becomes:
N
=
h
V
A
V
F
(3)
V
A
= dollar value of hedged position, V
F
= dollar value of 1 futures contract (futures price Q
F
)
Eect of tailing the hedge: Multiply hedge ratio in Eq. (2) by ratio of spot price to futures price
Ideally, futures position should be adjusted as V
A
/V
F
change (not feasible in practice)
Stock index futures
Stock index futures are used to hedge or manage exposures to equity prices
A stock index tracks changes in the value of a hypothetical portfolio of stocks
Weight of a stock in the portfolio equals proportion of the portfolio invested in the stock
% increase in stock index over t = % increase in value of hypothetical portfolio
Dividends usually not included Index tracks only capital gain/loss from investing in portfolio
If hypothetical portfolio xed, weights assigned to individual stocks do not remain xed
Some indices are constructed from portfolio consisting of one of each of many stocks
Weights assigned to stocks are then their market prices (with adjustments for stock splits)
Other indices constructed so that weights market cap (stock price shares outstanding)
Underlying portfolio then adjusted to reect stock splits, dividends, new issues
Stock indices
Dow Jones Industrial Average
Based on a portfolio consisting of 30 blue-chip stocks in the US
The weights given to the stocks are proportional to their prices
CBOT: 2 futures contracts on DJIA: One on $10 index, other on $5 index
Standard & Poors 500 (S&P 500)
500 stocks: 400 industrials, 40 utilities, 20 transportation, and 40 nancials
Weights of stocks at any given time market capitalizations
CME: 2 futures on S&P 500: $250 index, $50 index
Nasdaq 100
100 stocks using National Association of Securities Dealers Automatic Quotations Service
CME: 2 contracts, $100 index, and $20 index
14
Russell 1000
Index of the prices of the 1000 largest capitalization stocks in the US
US Dollar Index
Trade-weighted index of the values of six foreign currencies (the euro, yen, pound, Canadian
dollar, Swedish krona, and Swiss franc)
Futures contracts on stock indices are settled in cash, not by delivery of the underlying asset
Hedging an equity portfolio
Stock index futures can be used to hedge a well-diversied equity portfolio. Dene:
P: Current value of the portfolio
F: Current value of one futures contract (futures price contract size)
If the portfolio mirrors the index, the optimal hedge ratio h
= P/F (4)
When portfolio ,= index mirror, use from CAPM to determine the hedge ratio
Beta is the slope of the best-t line obtained when excess return on the portfolio over the
risk-free rate is regressed against the excess return of the market over the risk-free rate
In general, h
=
P
F
(5)
Formula assumes that maturity of futures contract is close to maturity of the hedge
Example
Suppose that a futures contract with 4 months to maturity is used to hedge the value of a
portfolio over the next 3 months in the following situation:
Value of S&P 500 index = 1,000 Risk-free interest rate = 4% per annum
S&P 500 futures price = 1,010 Dividend yield on index = 1% per annum
Value of portfolio = $5,050,000 Beta of portfolio = 1.5
One futures contract is for delivery of $250 times the index
F = 250 1, 010 = 252, 500 and from Eq. (5), the number of futures contracts that should
be shorted to hedge the portfolio is: 1.5 5, 050, 000/252, 500 = 30
Suppose the index turns out to be 900 in 3 months and the futures price is 902
The gain from the short futures position is: 30 (1010 902) 250 = $810, 000
The loss on the index is 10%. The index pays a dividend of 1% per annum, or 0.25% per 3
months. When dividends are taken into account, an investor in the index would therefore earn
-9.75% in the 3-month period. The risk-free interest rate is approximately 1% per 3 months
Because the portfolio has a of 1.5, the CAPM gives that the expected return (%) on the
portfolio during the 3 months is: 1.0 + [1.5 (9.75 1.0)] = 15.125
The expected value of the portfolio (inclusive of dividends) at the end of the 3 months is
therefore $5, 050, 000 (1 0.15125) = $4, 286, 187 The expected value of the hedgers
position, including gain on the hedge, is: $4, 286, 187 + $810, 000 = $5, 096, 187
Reasons for hedging an equity portfolio
The hedge results in the investors position growing at the risk-free rate
It is natural to ask why the hedger should go to the trouble of using futures contracts:
Hedger could simply sell portfolio/invest proceeds in risk-free instruments (T-bills)
Hedging can be justied if hedger feels that stocks in portfolio have been chosen well
A hedge using index futures removes the risk arising from market moves and leaves the hedger
exposed only to the performance of the portfolio relative to the market
Another reason for hedging may be that the hedger is planning to hold a portfolio for a long period
of time and requires short-term protection in an uncertain market situation
Alternative strategy of selling portfolio/buying back later has very high transaction costs
15
Changing the beta of a portfolio
Sometimes futures contracts are used to change the beta of a portfolio
Continuing with our earlier example, to reduce the beta of the portfolio from 1.5 to 0.75, the
number of contracts shorted should be 15 rather than 30
To increase the beta of the portfolio to 2.0, a long position in 10 contracts should be taken
To change the portfolio from >
, a short position in (
, a long position in (
= k
at t
Discounting at r
)e
r
Example
Suppose that a nancial institution has agreed to pay 6-month LIBOR and receive 8% per
annum (with semiannual compounding) on a notional principal of $100 million
28
The swap has a remaining life of 1.25 years. The LIBOR rates with continuous compounding
for 3-month, 9-month, and 15-month maturities are 10%, 10.5%, and 11%. The 6-month
LIBOR rate at the last payment date was 10.2% (with semiannual compounding)
The calculations for valuing the swap in terms of bonds are summarized in the table below
Time B
fix
cash ow B
fl
cash ow Discount factor PV B
fix
CF PV B
fl
CF
0.25 4.0 105.100 0.9753 3.901 102.505
0.75 4.0 0.9243 3.697
1.25 104.0 0.8715 90.640
Total: 98.238 102.505
The xed-rate bond has CFs of 4, 4, and 104 on the three payment dates. The discount factors
for these CFs are: e
0.10.25
, e
0.1050.75
, and e
0.111.25
[Col (4)]
The table shows that the value of the xed-rate bond (in millions) is 98.238
L = $100 million, k
= probability of up move:
22p
+ 18(1 p
) = 20e
0.163/12
p
= 0.7041
The expected payo from the option in the real world is then: p
1 + (1 p
) 0 = 0.7041
Unfortunately, correct discount rate to apply to expected payo in real world is unknown
A position in a call option is riskier than a position in the stock
The discount rate to be applied to the payo from a call option is greater than 16%
Using risk-neutral valuation is convenient because in a risk-neutral world the expected return on
all assets (and thus the discount rate to use for all expected payos) is the risk-free rate
Two-step binomial trees
We can extend the analysis to a two-step binomial tree such as that shown in Fig. 1
Stock price initially S
0
. During each time step, it moves up/down to [u/d] [its initial value]
Suppose that the risk-free interest rate is r and the length of the time step is t years
Option prices at nal nodes of tree are easily calculated: They are the option payos
58
]
dd
f
2
) p + (1{
ud
f ) p (1{ p + 2
uu
f
2
p [
t r {2
e = f )
]
d
f ) p + (1{
u
pf [
t r {
e = f
]
dd
f ) p + (1{
ud
pf [
t r {
e =
d
f
]
ud
f ) p + (1{
uu
pf [
t r {
e =
u
f
p
p 1{
p 1{
p
p 1{
p
d { u
| | | |
d {
t r
e
= p
t
uu
f
ud
f
dd
f
d
f
f
u
f
2
u
0
S
2
d
0
S
ud
0
S
d
0
S
0
S
u
0
S
Figure 1: Stock and option prices in general two-step tree
Eqs. (2) and (3) become:
f = e
rt
[pf
u
+ (1 p)f
d
] (5)
p =
e
rt
d
u d
(6)
Repeated application of Eq. (5) gives:
f
u
= e
rt
[pf
uu
+ (1 p)f
ud
] (7)
f
d
= e
rt
[pf
ud
+ (1 p)f
dd
] (8)
f = e
rt
[pf
u
+ (1 p)f
d
] (9)
Substituting Eqs. (7) and (8) into Eq. (9), we get:
f = e
2rt
[p
2
f
uu
+ 2p(1 p)f
ud
+ (1 p)
2
f
dd
] (10)
This is consistent with the principle of risk-neutral valuation
p
2
, 2p(1 p), (1 p)
2
are probabilities that upper/middle/lower nal nodes reached
Option price = its expected payo in risk-neutral world discounted risk-free
As we add more steps to the binomial tree, the risk-neutral valuation principle continues to hold
Option price always = its expected payo in risk-neutral world discounted risk-free
Put options
The procedures described above can be used to price puts as well as calls
American options
The procedure is to work back through the tree from the end to the beginning, testing at each
node to see whether early exercise is optimal
The value of the option at the nal nodes is the same as for the European option
At earlier nodes the value of the option is the greater of:
1. The value given by Eq. (5)
2. The payo from early exercise
Delta
is an important parameter in the pricing and hedging of options
= ratio of [change in option price] [change in underlying stock price]
= # of units of stock we should hold for each option shorted to create riskless portfolio
The construction of a riskless hedge is sometimes referred to as delta hedging
The delta of a call option is positive, whereas the delta of a put option is negative
changes over time To maintain a riskless hedge, we need to adjust holdings in stock periodically
59
Matching volatility with u and d
In practice, when constructing a binomial tree, we choose u and d to match volatility of stock price
Suppose that the expected return on a stock (in the real world) is and its volatility is
The step is of length t. Stock price starts at S
0
and moves either up to S
0
u or down to S
0
d
Probability of up move in real world is p
S
0
u + (1 p
)S
0
d
To match the expected return on the stock with the trees parameters, we must have:
p
S
0
u + (1 p
)S
0
d = S
0
e
t
p
=
e
t
d
u d
(11)
The volatility of a stock price is dened so that
u
2
+ (1 p
)d
2
[p
u + (1 p
)d]
2
To match the stock price volatility with the trees parameters, we must have:
p
u
2
+ (1 p
)d
2
[p
u + (1 p
)d]
2
=
2
t (12)
Substituting Eq. (11) into Eq. (12) gives: e
t
(u+d) ud e
2t
=
2
t and when terms in t
2
and higher powers of t are ignored, one solution to this equation is:
u = e
t
and d = e
t
(13)
These are values of u/d proposed by Cox, Ross, Rubinstein (1979) for matching volatility
Risk-neutral analysis
The variable p is given by Eq. (6) as:
p =
a d
u d
where a = e
rt
(14)
The expected stock price at the end of the time step is S
0
e
rt
The variance of the stock price return in the risk-neutral world is:
pu
2
+ (1 p)d
2
[pu + (1 p)d]
2
= [e
rt
(u +d) ud e
2rt
]
Substituting u/d from Eq. (13), this equals
2
t when terms in t
2
are ignored
When we move from the real world to the risk-neutral world the expected return on the stock changes,
but its volatility remains the same (at least in the limit as t tends to zero)
Girsanovs theorem
When we move from a world with one set of risk preferences to a world with another set of risk
preferences, the expected growth rates in variables change, but their volatilities remain the same
Moving from one set of risk preferences to another is sometimes referred to as changing the measure
Real-world measure: aka, P-measure. Risk-neutral world measure: aka, Q-measure
Increasing the number of steps
When binomial trees are used in practice, option life is typically divided into 30+ time steps
In each time step there is a binomial stock price movement
With 30 time steps, 31 terminal stock prices and 2
30
10
9
stock price paths implicitly considered
Options on other assets
We can construct and use binomial trees for options on indices, currencies, and futures contracts in
exactly the same way as for options on stocks except that the equations for p change
Options on stocks paying a continuous dividend yield
Consider a stock paying a known dividend yield at rate q
The total return from dividends and capital gains in a risk-neutral world is r
The dividends provide a return of q Capital gains must provide a return of r q
60
If stock starts at S
0
, its expected value after one time step t must be S
0
e
(rq)t
. Hence,
pS
0
u + (1 p)S
0
d = S
0
e
(rq)t
p =
e
(rq)t
d
u d
We match volatility by setting u = e
t
and d = 1/u
We can use Eqs. (13) to (14), except that we set a = e
(rq)t
instead of a = e
rt
Options on stock indices
We assume that the stocks underlying the index provided a dividend yield at rate q
Valuation of option on stock index Option on stock paying known dividend yield
Options on currencies
Foreign currency asset providing a yield at foreign risk-free rate of interest r
f
Analogy with stock index: For options on a currency, use Eqs. (13)/(14) and set a = e
(rr
f
)t
Options on futures
It costs nothing to take a long or a short position in a futures contract
In a risk-neutral world a futures price should have an expected growth rate of zero
If F
0
= initial futures price, expected futures price at end of time step t is also F
0
. Thus,
pF
0
u + (1 p)F
0
d = F
0
p =
1 d
u d
Therefore, we can use Eqs. (13)/(14) with a = 1
61
62
Fabozzi: Valuation of bonds with embedded options
Introduction
The complication in building a model to value bonds with embedded options and option-type derivatives
is that cash ows will depend on interest rates in the future
The rst step is to move from the yield curve to a valuation lattice
Lattice holds all the info required to value certain option-like interest-rate products
First, the lattice is used to generate the CFs across the life of the security
Next, the interest rates on the lattice are used to compute the PV of those CFs
Binomial model: Lattice model where only two rates are possible in next period
Regardless of underlying assumptions, models share common restriction: Interest-rate tree generated
must produce a value for an on-the-run optionless issue consistent with current par yield curve
Output from the model must be equal to observed market price for optionless instrument
Under these conditions, the model is said to be arbitrage free
A lattice that produces an arbitrage-free valuation is said to be fair
The lattice is used for valuation only when it has been calibrated to be fair
The interest rate lattice
Fig. 1 provides an example of a binomial interest-rate tree: Consists of number of nodes/legs
Each leg represents a one-year interval over time
The methodology is the same for smaller time periods
In practice, the selection of the length of the time period is critical
The distribution of future interest rates is represented by the nodes at each point in time
Each node is labeled as N and has a subscript, a combination of Ls and Hs
Subscripts indicate whether node is lower/higher on the tree, relative to other nodes
The root of the tree is N, the only point in time at which we know the interest rate with certainty:
The one-year rate today (at N) is the current one-year spot rate r
0
Today Year 1 Year 3
LLL
N
| | |
;LLL 3
r
HLL
N
| | |
;HLL 3
r
HHL
N
| | |
;HHL 3
r
HHH
N
| | |
;HHH 3
r
N
|
0
r
L
N
| |
;L 1
r
H
N
| |
;H 1
r
HL
N
| |
;HL 2
r
LL
N
| |
;LL 2
r
Year 2
HH
N
| | |
;HH 2
r
Figure 1: 3-yr binomial interest-rate tree
Year 2
N
|
0
r
Year 3 Year 1 Today
H
N
| |
2
e
1
r
L
N
| |
1
r
LL
N
| |
2
r
HL
N
| |
2
e
2
r
HH
N
| |
4
e
2
r
LLL
N
| | |
3
r
HLL
N
| | |
2
e
3
r
HHL
N
| | |
4
e
3
r
HHH
N
| | |
6
e
3
r
Figure 2: 3-yr binomial tree with 1-yr rates
We must make an assumption concerning the probability of reaching one rate at a point in time:
Here, rates at any point in time have the same probability of occurring
The probability is 50% on each leg
The interest-rate model used to construct the binomial tree assumes that the one-year rate evolves over
time based on a log-normal random walk with a known (stationary) volatility
The relationship between any two adjacent rates at a point in time is calculated via:
r
1,H
= r
1,L
e
2
t
where = assumed volatility of 1-yr rate and t = time in years
In the second year, there are three possible values for the one-year rate:
r
2,HH
= r
2,LL
e
4
and r
2,HL
= r
2,LL
e
2
and r
2,LL
This relationship between rates holds for each point in time, as shown in Fig. 2
63
Determining the value at a node
To get a securitys value at a node, we follow the fundamental rule for valuation:
The value is the PV of the expected cash ows
Discount rate for CFs one year forward = one-year rate at node where we are valuing
Fig. 3: Illustration with 1-yr rate r
r 1+
| | |
C +
L
V
+
r 1+
| | |
C +
H
V
[
2
|
1
= V
value is sought
node where bond's
One-year rate at
higher-rate state
Cash flow in
Bond's value in higher-rate
state one-year forward
state one-year forward
Bond's value in lower-rate
lower-rate state
Cash flow in
C +
L
V
r
|
V
C +
H
V
Figure 3: Calculating a value at a node
The future CFs include:
1. The coupon payment one year from now
2. The bonds value one year from now, both of which may be uncertain
Starting process from the last year in tree and working backwards resolves the uncertainty:
At maturity, the instruments value is known with certainty: Par value
Final coupon payment determined from coupon rate/prevailing rates to which it is indexed
This process of working backward is often referred to as recursive valuation
Using our notation, the cash ow at a node is either:
V
H
+C for the higher one-year rate
V
L
+C for the lower one-year rate
The PV of these two CFs using the one-year rate at the node r
is:
V
H
+C
(1 +r
)
= PV for the higher one-year rate,
V
L
+C
(1 +r
)
= PV for the lower one-year rate
Then the value of the bond at the node is found as:
Value at a node =
1
2
_
V
H
+C
(1 +r
)
+
V
L
+C
(1 +r
)
_
Calibrating the lattice
To ensure that no-arbitrage condition holds, lattice must be calibrated to current par yield curve
Ultimately, the lattice must price optionless par bonds at par
Assume the on-the-run par yield curve for a hypothetical issuer as it appears below:
Maturity Par Rate Market Price
1 year 3.50% 100
2 years 4.20% 100
3 years 4.70% 100
4 years 5.20% 100
We assume that volatility is 10% and construct a two-year tree using the two-year bond with a
coupon rate of 4.2%, the par rate for a two-year security
The root rate for the tree r
0
is simply the current one-year rate, 3.5%
At beginning of year 2, two possible one-year rates: The higher rate and the lower rate
64
We already know the relationship between the two
A rate of 4.75% at N
L
has been chosen arbitrarily as a starting point
An iterative process determines the proper rate
The steps are described below and illustrated in Fig. 4: The goal is a rate that, when applied in
the tree, provides a value of par for the two-year 4.2% bond
(98.486 + 4.2) / (1.035)
+ (99.475 + 4.2) / (1.035)]
99.691 = 1/2 [
98.486 = (100 + 4.2) / (1.0582)
99.475 = (100 + 4.2) / (1.0475)
98.486
4.2
5.8017%
99.691
3.5000%
N
H
N
L
N
4.7500%
4.2
99.475
4.2
100
100
4.2
4.2
100
HH
N
HL
N
LL
N
Year 2 Year 1 Today
Figure 4: 1-yr rates for Y
1
: 1st trial
(99.766 + 4.2) / (1.035)
+ (98.834 + 4.2) / (1.035)]
100.00 = 1/2 [
100.00
3.5000%
99.466 = (100 + 4.2) / (1.0445)
98.834 = (100 + 4.2) / (1.0543)
98.834
4.2
5.4289%
4.4448%
4.2
99.766
N
H
N
L
N
4.2
100
100
4.2
4.2
100
HH
N
HL
N
LL
N
Year 2 Year 1 Today
Figure 5: 1-yr rates for Y
1
using 2-yr 4.2% issue
Step 1
Select a value for r
1
, the lower one-year rate. In this rst trial, r
1
= 4.75%
Step 2
Determine the corresponding value for the higher one-year rate
This rate is related to the lower one-year rate as: r
1
e
2
The higher one-year rate is 5.8017% (= 4.75%e
20.10
), reported in Fig. 4 at node N
H
Step 3: Compute the bond values one year from now
1. Determine bonds value 2 years from now: Since we are using a two-year bond, the bonds
value is its maturity value ($100) plus its nal coupon payment ($4.2) It is $104.2
2. Calculate V
H
: Cash ows are known, the appropriate discount rate is the higher one-year rate
The present value is $98.486 (= $104.2/1.058017)
3. Calculate V
L
: CFs known, and discount rate assumed for lower one-year rate is 4.75%
The present value is $99.475 (= $104.2/1.0475)
Step 4: Calculate V
1. Add the coupon to both V
H
and V
L
to get the cash ow at N
H
and N
L
We have $102.686 for the higher rate and $103.675 for the lower rate
2. Calculate V : The one-year rate is 3.50%
V = $99.691 = 1/2($102.686/1.035 + $103.675/1.035)
Step 5
Compare the value in step 4 to the bonds market value
If the two values are the same, then the r
1
used in this trial is the one we seek
If the value found in step 4 is not equal to the market value of the bond, this means that the
value r
1
is not the one-year rate that is consistent with the current yield curve
In this case, the ve steps are repeated with a dierent value for r
1
The correct rate for r
1
in this example is 4.4448%
The corresponding binomial tree is shown in Fig. 5
The value at the root is equal to the market value of the two-year issue (par)
We can grow this tree for one more year by determining r
2
Now we will use the three-year on-the-run issue, the 4.7% coupon bond, to get r
2
Same 5 steps in iterative process to nd the 1-yr rates in the tree two years from now
Our objective is now to nd the value of r
2
that produces a bond value of $100
Note that the two rates one year from now of 4.4448% (the lower rate) and 5.4289% (the higher
rate) do not change: These are the fair rates for the tree one-year forward
65
Using the lattice for valuation
To use lattice for valuation, consider a 6.5% option-free bond with 4 years remaining to maturity
Since this bond is option-free, it is not necessary to use the lattice model to value it
All that is necessary to obtain an arbitrage-free value for this bond is to discount the cash ows
using the spot rates obtained from bootstrapping the yield curve shown in the table above
The spot rates are: 1-year 3.5000%, 2-year 4.2147%, 3-year 4.7345%, 4-year 5.2707%
Discounting the 6.5% 4-yr option-free bond (par = $100) at spot rates above Bond value = $104.64
5.0483%
6.5
101.38
100.31
6.5
6.1660%
7.5312%
6.5
99.041
9.1987%
6.5
97.529
102.53
4.6958%
6.5
6.5
5.7354%
100.42
97.925
7.0053%
6.5
103.38
4.4448%
6.5
6.5
5.4289%
100.23
Coupon
Short-term rate
Computed value
Year 2 Year 1
L
N
H
N
Today
N
6.5
100
100
6.5
Year 4
100
6.5
100
6.5
LLLL
N
HHHH
N
HLLL
N
HHHL
N
HHLL
N
6.5
100
LLL
N
HHH
N
HLL
N
HHL
N
Year 3
HH
N
HL
N
LL
N
3.5000%
104.64
Figure 6: Valuing an option-free bond with four years to maturity and 6.5% coupon
Fig. 6 shows the various values in the discounting process
Root of the tree shows the bond value of $104.64 = value found by discounting at spot rates
The lattice is consistent with the valuation of an option-free bond when using spot rates
Fixed-coupon bonds with embedded options
Embedded option requires adjustment to CFs on the tree depending on option structure:
Decision whether to call/put must be made at nodes where option is eligible for exercise
Valuing a callable bond
In the case of a call option, the call will be made when the PV of the future cash ows is greater
than the call price at the node where the decision to exercise is being made:
V
t
= min[Call price, PV (future CF)]
V
t
represents the PV of future cash ows at the node
This operation is performed at each node where the bond is eligible for call
Fig. 7 shows that two values are now present at each node of the binomial tree:
Discounting explained earlier is used to calculate 1st of the two values at each node
The second value is the value based on whether the issue will be called
Again, the issuer calls the issue if the PV of future CFs exceeds the call price
This second value is incorporated into the subsequent calculations
Value of call option: [Value of an optionless bond] [value of a callable bond]
Here, the value of the option-free bond is $104.64. The value of the callable bond is $102.899
Hence the value of the call option is $1.744 (= 104.634 102.899)
Valuing a putable bond
Putable bond: Bondholder has the right to force issuer to pay o the bond prior to maturity
The analysis of the putable bond follows closely that of the callable bond
If the PV of the future CFs is less than the put price (i.e. par), then the bond will be put:
V
t
= max[Put price, PV (future CFs)]
66
101.38
100.00
6.5
5.0483%
6.1660%
6.5
100.00
100.31
99.041
99.041
6.5
7.5312%
4.6958%
6.5
101.72
100.00
100.00
100.27
6.5
5.7354%
7.0053%
6.5
97.925
97.925
4.4448%
6.5
100.00
101.97
100.03
100.00
6.5
5.4289%
Call price / Computed value
Coupon
Short-term rate
Computed value
9.1987%
6.5
97.529
97.529
Year 2 Year 1
L
N
H
N
Today
N
6.5
100
100
6.5
Year 4
100
6.5
100
6.5
LLLL
N
HHHH
N
HLLL
N
HHHL
N
HHLL
N
6.5
100
LLL
N
HHH
N
HLL
N
HHL
N
Year 3
HH
N
HL
N
LL
N
3.5000%
102.90
Figure 7: Valuing a callable bond
The value of the putable bond is greater than the value of the corresponding option-free bond
Value of put option: [Value of the putable bond] [value of corresponding option-free bond]
Suppose that a bond is both putable and callable
Adjust the value at each node to reect whether the issue would be put or called
Specically, at each node, two decisions about exercising option must be made:
If it is called, the value at the node is replaced by the call price
The valuation procedure then continues using the call price at that node
If call not exercised, determine whether the put option will be exercised
If put exercised, then put price is substituted at that node and used subsequently
Valuation of more exotic structures
Valuing a step-up callable note
Callable instruments whose coupon rate is increased (stepped up) at designated times
Single step-up callable note: When coupon rate increased only once over securitys life
Multiple step-up callable note: Coupon increased more than one time
Valuation using lattice similar to that for valuing a callable bond except that the CFs are altered
at each node to reect the coupon characteristics of a step-up note
Value of embedded call option: [Optionless step-up note value] [step-up callable note value]
Valuing a range note
A range note is a security that pays the reference rate only if the rate falls within a band:
If reference rate falls outside the band, whether lower/upper boundary, no coupon paid
Typically, the band increases over time
The tree is modied to reect the fact that either the one-year reference rate is paid, or nothing,
depending on the interest rate at a given node for the calibrated tree
Using recursive method, we work back through the tree to current value of range note
Extensions
Option-adjusted spread
Market transactions determine actual price of xed income instrument, not lattice calculations
If markets provide meaningful price (function of liquidity), this price can be translated into an
alternative measure of value: The option-adjusted spread (OAS)
The OAS for a security is the xed spread (measured in basis points) over benchmark rates that
equates the output from valuation process with actual market price of the security
For an optionless security, the calculation of OAS is a relatively simple, iterative process
Process much more analytically challenging with the added complexity of optionality
And just as the value of the option is volatility-dependent, the OAS for a xed income security
with embedded options or an option-like interest-rate product is volatility-dependent
67
Suppose info from market that price of callable bond from Fig. 7 is actually $102.218
We need the OAS that equates the value from the lattice with the market price
Since the market price is lower than the valuation, the OAS is a positive spread to the rates
in the exhibit, rates that we assume to be benchmark rates
The solution here is 35 bp: Observed market value ($102.218) = value of callable bond calcu-
lated in Fig. 7 after adding 35 bp to each rate, at each node of original calibrated lattice
Eective duration and eective convexity
Duration/convexity provide measure of interest-rate risk inherent in xed income security
We rely on the lattice model to calculate the eective duration and eective convexity of a bond
with an embedded option and other option-like securities
The formulas for these two risk measures are given by:
Eective duration =
V
V
+
2V
0
r
and Eective convexity =
V
+
+V
2V
0
2V
0
(r)
2
Where V
/V
+
are values derived following parallel shift in yield curve down/up, by xed spread
The model adjusts for the changes in the value of the embedded call option that result from the
shift in the curve in the calculation of V
and V
+
Note that the calculations must account for the OAS of the security
Steps for the proper calculation of V
+
(calculation for V
is analogous):
Step 1: Given the market price of the issue, calculate its OAS
Step 2: Shift the on-the-run yield curve up by a small number of basis points r
Step 3: Construct a binomial interest-rate tree based on the new yield curve from step 2
Step 4: Shift the binomial interest-rate tree by the OAS to obtain an adjusted tree
The calculation of the eective duration and convexity assumes a constant OAS
Step 5: Use the adjusted tree in step 4 to determine the value of the bond V
+
To determine the value of V
, the same ve steps are followed except that in step 2, the on-the-run
yield curve is shifted down by a small number of basis points r
68
Hull - Ch. 12: Wiener processes and Itos lemma
Stochastic processes
Variable whose value changes over time in uncertain way is said to follow a stochastic process
Stochastic processes can be classied as discrete time or continuous time:
Discrete-time stochastic process: Variable can change only at certain xed points in time
Continuous-time stochastic process: Changes can take place at any time
Stochastic processes can also be classied as continuous variable or discrete variable:
In a continuous-variable process, the underlying variable can take any value within a range
In a discrete-variable process, only certain discrete values are possible
The Markov property
Markov process: Stochastic process where only the present value of a variable is relevant for the future:
Past history of variable and the way that present has emerged from past are irrelevant
Stock prices are usually assumed to follow a Markov process
The Markov property implies that the probability distribution of the price at any particular future time
is not dependent on the particular path followed by the price in the past
The Markov property of stock prices is consistent with the weak form of market eciency:
This states that stock present price impounds all info contained in past prices
Competition in the marketplace tends to ensure that weak-form market eciency holds
Due to the many investors watching closely the stock market, trying to make a prot from it leads
to a situation where a stock price, at any given time, reects the information in past prices
Continuous-time stochastic processes
Consider a variable that follows a Markov stochastic process:
Suppose that its current value is 10 and that the change in its value during 1 year is (0, 1), where
(m, v) denotes a Normal probability distribution with mean m and variance v
Change in 2 years = sum of two normal distributions, each of which has m = 0, v = 1.0:
Because the variable is Markov, the two probability distributions are independent
When we add two independent normal distributions, the result is a normal distribution where
the mean is the sum of the means and the variance is the sum of the variances
The change in the variable over 2 years has the distribution (0, 2)
More generally, change during any time period of length T distributed according to (0, T)
Wiener process
Type of Markov process with mean change = 0 and variance rate = 1.0/year (aka Brownian motion)
Formally, a variable z follows a Wiener process if it has the following two properties:
Property 1: The change z during a small period of time t is:
z =
i=1
t where the
i
(i = 1, , N) are distributed (0, 1) (2)
From the second property of Wiener processes, the
i
are independent of each other
From Eq. (2), z(T) z(0) is normally distributed, with:
mean of [z(T) z(0)] = 0
standard deviation of [z(T) z(0)] =
T
69
Our uncertainty about the value of the variable at a certain time in the future, as measured by its
standard deviation, increases as the square root of how far we are looking ahead
Two intriguing properties of Wiener processes related to
t t for small t:
1. The expected length of the path followed by z in any time interval is innite
2. The expected number of times z equals any particular value in any time interval is innite
Generalized Wiener process
Drift rate: The mean change per unit time for a stochastic process
Variance rate: The variance per unit time
A generalized Wiener process for a variable x can be dened in terms of dz as:
dx = a dt +b dz with a and b constants (3)
The a dt term implies that x has an expected drift rate of a per unit of time:
Without the b dz term, the equation is dx = a dt, which implies that: x = x
0
+at
In a period of time of length T, the variable x increases by an amount aT
RHS of Eq. (3): b dz term adding noise/variability to path followed by x
Amount of noise/variability is b [Wiener process] and has a std. dev. of b
In a small time interval t, the change x in the value of x is: x = at +b
t
Thus, x has a normal distribution with:
mean of x = at
standard deviation of x = b
t
variance of x = b
2
t
The change in the value of x in any time interval T is normally distributed with:
mean of change in x = aT
standard deviation of change in x = b
T
variance of change in x = b
2
T
Thus, the generalized Wiener process in Eq. (3) has expected drift rate a and variance rate b
2
Ito process
Generalized Wiener process where a and b are functions of x and t
dx = a(x, t)dt +b(x, t)dz (4)
Both expected drift rate/variance rate of an Ito process are liable to change over time:
Between [t, t + t], variable changes from x to x + x, with: x = a(x, t)t +b(x, t)
t
This relationship assumes that the drift and variance rate of x remain constant, equal to a(x, t)
and b(x, t)
2
, during the small time interval between t and t + t
The process for a stock price
Key aspect of stock prices: Expected % return required by investors is independent of stocks price
The assumption of constant expected drift rate is inappropriate and needs to be replaced by the
assumption that the expected return (i.e., expected drift divided by the stock price) is constant
If S = stock price at t, then expected drift rate in S should be S for some constant
In a short interval of time t, the expected increase in S is St
The parameter is the expected rate of return on the stock
If the volatility of the stock price is always zero, then this model implies that:
S = St S
T
= S
0
e
T
(5)
Eq. (5): When variance rate = 0, stock grows at continuously compounded rate of per unit of time
In practice, a stock price does exhibit volatility
Assumption: Variability of % return during t is the same regardless of stock price
Std. dev. of change during t should be stock price and the model becomes:
dS = Sdt +Sdz
dS
S
= dt +dz (6)
70
Eq. (6) is the most widely used model of stock price behavior
The variable is the volatility of the stock price
The variable is its expected rate of return
Model Limiting case of random walk (binomial trees in Ch. 11) as t 0
Discrete-time model
This model of stock price behavior is a geometric Brownian motion. Discrete-time version:
S
S
= t +
t (7)
The LHS of Eq. (7) is the return provided by the stock in a short period of time t
The term t is the expected value of this return
The term
t:
S
S
(t,
2
t) (8)
Monte Carlo simulation
Procedure for sampling random outcomes for a stochastic process
Path of stock price simulated by sampling repeatedly for from (0, 1)/substituting into Eq. (7)
Because simulated process is Markov, samples for should be independent of each other
Repeating simulations, complete distribution of stock price at end of time period is obtained
The parameters
The parameter is the expected return (annualized) earned by an investor in a short period of time
should depend on the risk of the return from the stock (higher risk higher )
More precisely, depends on that part of risk that cannot be diversied away by investor
also depends on level of interest rates (higher interest rates higher )
Fortunately, we do not have to concern ourselves with the determinants of in any detail because
the value of a derivative dependent on a stock is, in general, independent of
(stock price volatility) is by contrast critically important to determine many derivatives values
Typical values of for a stock are in the range 0.15 to 0.60
Std. dev. of proportional change in stock price in a small interval t is
t
Std. dev. of proportional change in stock price over long period T is
T (rough approx.)
Volatility can be approximated as the std. dev. of the change in stock price in 1 year
Itos lemma
Suppose that the value of a variable x follows the Ito process:
dx = a(x, t)dt +b(x, t)dz where dz Wiener process, a and b functions of x and t (9)
The variable x has a drift rate of a and a variance rate of b
2
Itos lemma shows that a function G of x and t follows the process:
dG =
_
G
x
a +
G
t
+
1
2
2
G
2
x
b
2
_
dt +
G
x
bdz (10)
dz is the same Wiener process as in Eq. (9) G also follows an Ito process, with drift rate of:
G
x
a +
G
t
+
1
2
2
G
2
x
b
2
and a variance rate of:
_
G
x
_
2
b
2
71
Earlier, we argued that dS = Sdt +Sdz with and constant, is a reasonable model of stock price
movements. From Itos lemma, it follows that the process followed by a function G of S and t is:
dG =
_
G
S
S +
G
t
+
1
2
2
G
2
S
2
S
2
_
dt +
G
S
Sdz (11)
Where both S and G are aected by the same underlying source of uncertainty dz
Application to forward contracts
Consider a forward contract on a non-dividend-paying stock
F
0
= S
0
e
rT
where F
0
is the forward price at time zero, S
0
is the spot price at time zero, T is
the time to maturity, and r is the risk-free rate of interest (constant for all maturities)
Dene F as the forward price at a general time t, and S as the stock price at time t, with
t < T. The relationship between F and S is given by:
F = Se
r(Tt)
(12)
Assume process for S is given by Eq. (6). Itos lemma determines process for F. From Eq. (12):
F
S
= e
r(Tt)
,
2
F
2
S
= 0,
F
t
= rSe
r(Tt)
From Eq. (11), the process for F is given by:
dF =
_
e
r(Tt)
S rSe
r(Tt)
_
dt +e
r(Tt)
Sdz
Substituting F for Se
r(Tt)
gives:
dF = ( r)Fdt +Fdz (13)
Like S, the forward price F follows geometric Brownian motion
It has an expected growth rate of r rather than
The growth rate in F is the excess return of S over the risk-free rate
The lognormal property
Use Itos lemma to derive process followed by lnS when S follows Eq. (6). We dene G = lnS. Since:
G
S
=
1
S
,
2
G
2
S
=
1
S
2
,
G
t
= 0,
It follows from Eq. (11) that the process followed by G is:
dG =
_
2
2
_
dt +dz (14)
Since and are constant, G = lnS follows a generalized Wiener process:
It has constant drift rate
2
/2 and constant variance rate
2
Change in lnS from 0 to T normally distributed, with mean (
2
/2)T and variance
2
T. Hence,
lnS
T
lnS
0
__
2
2
_
T,
2
T
_
lnS
T
_
lnS
0
+
_
2
2
_
T,
2
T
_
(15)
Eq. (15) shows that lnS
T
is normally distributed
A stocks price at time T, given its price today, is lognormally distributed
Std. dev. of log of stock price is
t so that:
S
S
(t,
2
t) (1)
The model implies that lnS
T
is normally distributed, so that S
T
has a lognormal distribution:
ln
S
T
S
0
__
2
2
_
T,
2
T
_
lnS
T
_
lnS
0
+
_
2
2
_
T,
2
T
_
(2)
The mean of lnS
T
is lnS
0
+ (
2
/2)T and the standard deviation is
T
Properties of the (lognormal) stock price distribution
A variable that has a lognormal distribution can take any value between zero and innity
95% chance that normally distributed variable is within 1.96 std. dev. of its mean
The expected value E(S
T
) of S
T
is given by:
E(S
T
) = S
0
e
T
(3)
The variance V ar(S
T
) of the stock price S
T
is given by:
V ar(S
T
) = S
2
0
e
2T
(e
2
T
1) (4)
The distribution of the rate of return
Dene the continuously compounded rate of return per annum realized between 0 and T as x. Then:
S
T
= S
0
e
xT
x =
1
T
ln
S
T
S
0
From Eq. (2), it follows: x
_
2
2
,
2
T
_
(5)
Thus, the continuously compounded rate of return is Normal, mean =
2
/2 and std. dev. = /
T
As T increases, the standard deviation of x declines
More certain about avg. return over 20 years than about return in any one year
The expected return
Expected return required by investors depends on riskiness of stock and on level of interest rates:
The higher the risk, the higher the expected return
The higher the level of interest rates, the higher the expected return required
Fortunately, we do not have to concern ourselves with the determinants of in any detail:
Value of stock option, when expressed in terms of underlying stock, does not depend on
Reason why expected continuously compounded return x ,= is subtle, but important:
Under our assumptions for stock price behavior, the average of the returns on the stock in each
interval of length t is close to t is close to arithmetic mean of the S
i
/S
i
However, the expected return over the whole period covered by the data E(x), expressed with a
compounding interval of t is close to
2
/2
Reason: Geometric mean of set of numbers (not all the same) < arithmetic mean
Volatility
From Eq. (5), the volatility of a stock price can be dened as the standard deviation of the return
provided by the stock in 1 year when the return is expressed using continuous compounding
73
_
1
n 1
n
i=1
(u
i
u)
2
=
_
1
n 1
n
i=1
u
2
i
1
n(n 1)
_
n
i=1
u
i
_
2
where u = mean of u
i
s
From Eq. (2), the standard deviation of the u
i
is
2n
Choosing an appropriate value for n is not easy:
More data generally lead to more accuracy, but does change over time and data that are too
old may not be relevant for predicting the future volatility
Compromise: Use closing prices from daily data over most recent 90-180 days
Alternatively, n set equal to # of days to which volatility is to be applied
The foregoing analysis can be adapted to accommodate dividend-paying stocks:
Return u
i
during a time interval that includes an ex-dividend day:
u
i
= ln
S
i
+D
S
i1
where D is the amount of dividend
The return in other time intervals is still: u
i
= lnS
i
/S
i1
However, as tax factors play a part in determining returns around an ex-dividend date, it is
probably best to discard altogether data for intervals that include an ex-dividend date
Trading days vs. calendar days
Important issue: Should time be measured in calendar/trading days when volatility is estimated/used?
Volatility is much higher when the exchange is open for trading than when it is closed
As a result, practitioners tend to ignore days when the exchange is closed when estimating
volatility from historical data and when calculating the life of an option
The volatility per annum is calculated from the volatility per trading day using the formula:
Volatility per annum = Volatility per trading day
_
# trading days per annum
The life of an option is also usually measured using trading days rather than calendar days:
The number of trading days in a year is usually assumed to be 252 for stocks
The life of an option is calculated as T years, where:
T =
Number of trading days until option maturity
252
It is natural to assume that the volatility of a stock is caused by new info reaching the market:
This is not true, volatility is to a large extent caused by trading itself
The idea underlying the Black-Scholes-Merton dierential equation
Equation that must be satised by price of any derivative dependent on a non-dividend-paying stock
Derivation: Set up a riskless portfolio consisting of positions in derivative/stock
If no arbitrage, the return from the portfolio must be the risk-free interest rate r
74
The reason a riskless portfolio can be set up is that the stock price and the derivative price are both
aected by the same underlying source of uncertainty: Stock price movements
In any short period, derivative price perfectly correlated with price of underlying stock
When an appropriate portfolio of the stock and the derivative is established, the gain or loss from
the stock position always osets the gain or loss from the derivative position
The portfolio overall value at end of the short period of time is known with certainty
Example
Suppose that at a particular time, relationship between small change S in stock price and
resultant small change c in price of a European call option is: c = 0.4S
The riskless portfolio would consist of:
1. A long position in 0.4 shares
2. A short position in one call option
Suppose that stock price increases by 10 cents: Option price increases by 4c and the 400.10 =
$4 gain on the shares = the 100 0.04 = $4 loss on short option position
Important dierence between the Black-Scholes-Merton analysis and binomial model in Ch. 11
In Black-Scholes-Merton, position in stock/derivative is riskless for only a very short period
To remain riskless, it must be adjusted, or rebalanced, frequently
Assumptions to derive the Black-Scholes-Merton dierential equation
1. The stock price follows the process developed in Ch. 12 with and constant
2. The short selling of securities with full use of proceeds is permitted
3. There are no transactions costs or taxes. All securities are perfectly divisible
4. There are no dividends during the life of the derivative
5. There are no riskless arbitrage opportunities
6. Security trading is continuous
7. The risk-free rate of interest r is constant and the same for all maturities
Derivation of the Black-Scholes-Merton (BSM) dierential equation
The stock price process we are assuming is:
dS = Sdt +Sdz (6)
Suppose f = price of derivative on S. f must be a function of S and t. Hence, from Eq. (12.11):
df =
_
f
S
S +
f
t
+
1
2
2
f
S
2
2
S
2
_
dt +
f
S
Sdz (7)
The discrete versions of Eqs. (6) and (7) are:
S = St +Sz (8)
f =
_
f
S
S +
f
t
+
1
2
2
f
S
2
2
S
2
_
t +
f
S
Sz (9)
From Itos lemma, the Wiener processes underlying f and S are the same
A portfolio of stock/derivative can be constructed so that the Wiener process is eliminated
Such portfolio is: 1: derivative
+f/S: shares
Holder is short one derivative and long f/S shares. Dene = portfolio value:
= f +
f
S
S (10)
The change in the value of the portfolio in the time interval t is:
= f +
f
S
S =
_
f
t
1
2
2
f
S
2
2
S
2
_
t (11)
75
Because z not involved, the portfolio must be riskless during time t. Hence:
= rt (12)
Substituting Eqs. (10) and (11) into (12), we obtain:
_
f
t
+
1
2
2
f
S
2
2
S
2
_
t = r
_
f
f
S
S
_
t
f
t
+rS
f
S
+
1
2
2
S
2
2
f
S
2
= rf (13)
Black-Scholes-Merton dierential equation
Eq. (13) is the Black-Scholes-Merton dierential equation
Many solutions, corresponding to dierent derivatives dened with S as underlying variable
Particular derivative obtained when equation solved depends on the boundary conditions
European call option: The key boundary condition is: f = max(S K, 0) when t = T
European put option: The key boundary condition is: f = max(K S, 0) when t = T
The portfolio used in the derivation of Eq. (13) is not permanently riskless:
As S and t change, f/S also changes
To keep portfolio riskless, frequently rebalance relative proportions of derivative/stock
Example
Forward contract on non-dividend-paying stock = derivative on stock Must satisfy Eq. (13)
The value of the forward contract f at a general time t is in terms of the stock price S at this time:
f = S Ke
r(Tt)
, where K is the delivery price. Hence,
f
t
= rKe
r(Tt)
,
f
S
= 1,
2
f
S
2
= 0
When substituted into LHS of Eq. (13), we get: rKe
r(Tt)
+rS = rf Eq. (13) satised
The price of tradeable derivatives
Any function f(S, t) solution of Eq. (13) is the theoretical price of a derivative
If a derivative with that price existed, it would not create any arbitrage opportunities
Conversely, if a function f(S, t) does not satisfy the dierential Eq. (13), it cannot be the price of
a derivative without creating arbitrage opportunities for traders
Risk-neutral valuation
BSM Eq. (13) does not involve any variables aected by the risk preferences of investors
Variables: Stock price, time, stock volatility, and risk-free rate, all independent of risk preferences
The Black-Scholes-Merton dierential equation would not be independent of risk preferences if it
involved the expected return on the stock because does depend on risk preferences
If risk preferences do not enter the equation, they cannot aect its solution
Any set of risk preferences can be used when evaluating f (e.g., risk neutral world)
Consider derivative with payo at one particular time. Can be valued using risk-neutral valuation by:
1. Assume the expected return from underlying asset is the risk-free interest rate r (i.e = r)
2. Calculate the expected payo from the derivative
3. Discount the expected payo at the risk-free interest rate
Risk-neutral valuation is a device for obtaining solutions to the BSM equation
Solutions obtained are valid in all worlds, not just when investors are risk neutral
When we move from a risk-neutral world to a risk-averse world, two things happen:
1. The expected growth rate in the stock price changes
2. The discount rate that must be used for any payos from the derivative changes
It happens that these two changes always oset each other exactly
Application to forward contracts on a stock
Consider a long forward contract that matures at time T with delivery price K. The value of the
contract at maturity is S
T
K where S
T
is the stock price at time T
From the risk-neutral valuation argument, the value f of the forward contract at time 0 is its
expected value at time T in a risk-neutral world discounted at the risk-free rate of interest
76
f = e
rT
E(S
T
K) where
E denotes the expected value in a risk-neutral world
Since K is a constant, this equation becomes:
f = e
rT
E(S
T
) Ke
rT
(14)
Expected return = r in risk-neutral world From Eq. (3),
E(S
T
) = S
0
e
rT
and f = S
0
Ke
rT
Black-Scholes pricing formulas
The Black-Scholes formulas for the prices at time 0 of a European call option on a non-dividend-paying
stock and a European put option on a non-dividend-paying stock are:
c = S
0
N(d
1
) Ke
rT
N(d
2
) (15)
p = Ke
rT
N(d
2
) S
0
N(d
1
) (16)
d
1
and d
2
are dened as:
d
1
=
ln(S
0
/K) + (r +
2
/2)T
T
and d
2
=
ln(S
0
/K) + (r
2
/2)T
T
= d
1
T
N(x) = cumulative probability distribution for standardized Normal distribution (0, 1)
Deriving Black-Scholes formulas
One way is by solving the dierential Eq. (13) subject to the boundary condition
Another approach is to use risk-neutral valuation
To provide an interpretation of the terms in Eq. (15), we note that it can be written:
c = e
rT
[S
0
N(d
1
)e
rT
KN(d
2
)] (17)
The expression N(d
2
) is the probability that the option will be exercised in a risk-neutral world,
so that KN(d
2
) is the strike price times the probability that the strike price will be paid
S
0
N(d
1
)e
rT
= expected value (risk-neutral) of variable equal to S
T
if S
T
> K and 0 otherwise
Never optimal to exercise early an American call option on a non-dividend-paying stock
Eq. (15) is the value of an American call option on a non-dividend-paying stock
No analytic formula for value of American put on a non-dividend-paying stock
Black-Scholes in practice
In practice, interest rate r is set equal to the zero-coupon risk-free rate for maturity T
Time measured as [# of trading days left in option life] [# of trading days in 1 year]
Properties of the Black-Scholes formulas
When the stock price S
0
becomes very large, a call option is almost certain to be exercised
It becomes very similar to a forward contract with delivery price K
From Eq. (5.5), we expect the call price to be S
0
Ke
rT
This is the call price given by Eq. (15) because when S
0
becomes very large, both d
1
and d
2
become very large, and N(d
1
) and N(d
2
) become close to 1.0
When the stock price becomes very large, the price of a European put p approaches zero
Consistent with Eq. (16) because N(d
1
) and N(d
2
) both 0 in this case
Consider next what happens when the volatility approaches zero:
Because the stock is virtually riskless, its price will grow at rate r to S
0
e
rT
at time T and the
payo from a call option is: max(S
0
e
rT
K, 0)
Discounting at rate r, call value today = e
rT
max(S
0
e
rT
K, 0) = max(S
0
Ke
rT
, 0)
Consider rst the case where S
0
> Ke
rT
. This implies that ln(S
0
/K) +rT > 0:
As 0, d
1
, d
2
+, and N(d
1
), N(d
2
) 1.0 Eq. (15) becomes: c = S
0
Ke
rT
When S
0
< Ke
rT
, it follows that ln(S
0
/K) +rT < 0:
As 0, d
1
, d
2
, and N(d
1
), N(d
2
) 0 Eq. (15) gives: c = 0
The call price is therefore always max(S
0
Ke
rT
, 0) as tends to zero
Similarly, the put price is always max(Ke
rT
S
0
, 0) as tends to zero
77
Cumulative normal distribution function
In calculating the cumulative normal distribution function N(x), we can use:
(i) Tables for N(x), (ii) The NORMSDIST function in Excel, and (iii) A polynomial approximation
Warrants and employee stock options
Warrants and employee stock options are dierent from regular call options in that exercise leads to the
company issuing more shares and then selling them to the option holder for the strike price
As the strike price is less than the market price, this dilutes the interest of existing shareholders
How should potential dilution aect the way we value outstanding warrants/employee stock options?
The answer is that it should not! Assuming markets are ecient, the stock price will reect
potential dilution from all outstanding warrants and employee stock options
Consider company contemplating a new issue of warrants (or employee stock options)
Company interested in the cost of issue assuming there are no compensating benets
Assume that the company has N shares worth S
0
each and the number of new options contemplated
is M, with each option giving the holder the right to buy one share for K
Company value today = NS
0
. This value does not change as result of warrant issue
Suppose that without warrant issue, share price will be S
T
at warrants maturity With/without
warrant issue, the total value of the equity and the warrants at T will be NS
T
If warrants are exercised, cash inow from strike price increasing this to NS
T
+ MK. This value
is distributed among N +M shares Share price immediately after exercise becomes:
NS
T
+MK
N+M
Therefore the payo to an option holder if the option is exercised is:
NS
T
+MK
N +M
K =
N
N +M
(S
T
K)
Thus, the value of each option is the value of N/(N +M) regular calls c on the companys stock
The total cost of the options is M times this, i.e., MNc/(N +M)
Example
A company with 1 million shares worth $40 each is considering issuing 200,000 warrants each giving
the holder the right to buy one share with a strike price of $60 in 5 years
Interest rate = 3%, volatility = 30% per annum. The company pays no dividends
From Eq. (15), the value of a 5-year European call option on the stock is $7.04
In this case, N = 1, 000, 000 and M = 200, 000, so that the value of each warrant is:
1, 000, 000
1, 000, 000 + 200, 000
7.04 = $5.87
The total cost of the warrant issue is 200, 000 5.87 = $1.17 million
If market perceives no benets from warrant issue, we expect stock price to decline by $1.17
Implied volatilities
The one parameter in BSM that cannot be directly observed is the volatility of the stock price
It can be estimated from a history of the stock price
In practice, traders usually work with what are known as implied volatilities
These are the volatilities implied by option prices observed in the market
Suppose that a European call is worth 1.875 when S
0
= 21, K = 20, r = 0.1, and T = 0.25
The implied volatility is the value of that, when substituted into Eq. (15), gives c = 1.875
An iterative search procedure can be used to nd the implied
Similar procedure with binomial trees used to nd implied volatilities for American options
Implied volatilities are used to monitor the markets opinion about the volatility of a particular stock
Whereas historical volatilities are backward looking, implied volatilities are forward looking
Traders often quote the implied volatility of an option rather than its price
This is convenient because the implied volatility tends to be less variable than the option price
78
The VIX index
CBOE publishes indices of implied volatility. The most popular index (SPX VIX) is an index of
the implied volatility of 30-day options on the S&P 500 calculated from wide range of calls/puts
Trade involving futures/options on S&P 500 is a bet on both future level of S&P 500/its volatility.
By contrast, a futures/options contract on VIX is a bet only on volatility
Example
A trader buys an April futures contract on VIX when futures price is 18.5 (30-day S&P 500
volatility of 18.5%) and closes out contract when futures price is 19.3
The trader makes a gain of $800 (one contract is on 1,000 the index)
Dividends
Modied Black-Scholes model to take account of dividends
Assume amount/timing of dividends during option life can be predicted with certainty
For long-life options, assume that the dividend yield rather cash dividend payments is known
The date on which the dividend is paid should be assumed to be the ex-dividend date. On this
date the stock price declines by the amount of the dividend
For tax reasons, stock price may go down by somewhat less than the cash amount of dividend
We need to interpret the word dividend in the context of option pricing as the reduction in
the stock price on the ex-dividend date caused by the dividend
European options
European options analyzed by assuming that stock price = sum of 2 components:
(i) Riskless component for known dividends during option life and, (ii) Risky component
The riskless component, at any given time, is the PV of all the dividends during the life of the
option discounted from the ex-dividend dates to the present at the risk-free rate
By time option matures, dividends have been paid and riskless component no longer exists
Thus, BSM correct if S
0
is equal to the risky component of the stock price and is the volatility
of the process followed by the risky component
In theory, this is ,= volatility of stochastic process followed by whole stock price
The volatility of the risky component is approximately equal to the volatility of the whole
stock price multiplied by S
0
/(S
0
D), where D is the PV of the dividends
However, adjustment necessary only when volatilities estimated using historical data
An implied volatility is calculated after the PV of dividends have been subtracted from
the stock price and is the volatility of the risky component
Operationally, Black-Scholes can be used provided that stock price is reduced by PV of all dividends
during option life, with risk-free discounting from ex-dividend dates
American options
Can only be optimal to exercise American call immediately before stock goes ex-dividend
Assume n ex-dividend dates anticipated, at times t
1
, , t
n
with t
1
< < t
n
. The dividends
corresponding to these times are D
1
, , D
n
Consider the possibility of early exercise just prior to nal ex-dividend date (i.e., at t
n
)
If the option is exercised at time t
n
, the investor receives S(t
n
) K
If the option is not exercised, the stock price drops to S(t
n
) D
n
The value of the option is then greater than: S(t
n
) D
n
Ke
r(Ttn)
If S(t
n
) D
n
Ke
r(Ttn)
S(t
n
) K, i.e.:
D
n
K
_
1 e
r(Ttn)
_
Not optimal to exercise at t
n
(18)
On the other hand, if:
D
n
> K
_
1 e
r(Ttn)
_
Optimal to exercise at t
n
for high value of S(t
n
) (19)
The inequality in Eq. (19) will tend to be satised when the nal ex-dividend date is fairly
close to the maturity of the option (i.e., T t
n
is small) and the dividend is large
79
Consider next time t
n1
If the option is exercised immediately prior to time t
n1
, the investor receives S(t
n1
) K
If the option is not exercised at time t
n1
, the stock price drops to S(t
n1
) D
n1
and the
earliest subsequent time at which exercise could take place is t
n
Hence, lower bound to option price if not exercised at t
n1
= S(t
n1
) D
n1
Ke
r(tnt
n1
)
It follows that if S(t
n1
) D
n1
D
n
Ke
r(tnt
n1
)
S(t
n1
) K, or:
D
n1
K
_
1 e
r(tnt
n1
)
_
Not optimal to exercise immediately prior to t
n1
Similarly, for any i < n, if:
D
i
K
_
1 e
r(t
i+1
t
i
)
_
Not optimal to exercise immediately prior to t
i
(20)
The inequality in Eq. (20) is approximately equivalent to D
i
Kr(t
i+1
t
i
)
Assuming that K is fairly close to current stock price, inequality is satised when dividend
yield on the stock is less than risk-free rate of interest (often the case)
We can conclude that, in many circumstances, the most likely time for the early exercise of an
American call is immediately before the nal ex-dividend date t
n
Blacks approximation
Black suggests an approximate procedure for taking account of early exercise in call options:
Calculate prices of European options that mature at T and t
n
, and then set the American price
equal to the greater of the two (approximation works well in most cases)
Example
Consider an American option with dividends D
1
= D
2
= 0.5 in two and ve months, S
0
= 40,
K = 40, r = 0.09, t
1
= 2/12, and t
2
= 5/12
Since K[1 e
r(t
2
t
1
)
] = 40(1 e
0.090.25
) = 0.89 > 0.5, it follows from Eq. (20) that the
option should never be exercised immediately before the rst ex-dividend date
Since K[1 e
r(Tt
2
)
] = 40(1 e
0.090.0833
) = 0.30 < 0.5, it follows that, when suciently
deep in the money, option should be exercised immediately before 2nd ex-dividend date
Blacks approximation to value the option
PV of 1st dividend is 0.5e
0.16670.09
= 0.4926 Value of option, assuming it expires just
before nal ex-dividend date, can be calculated using BSM with S
0
= 40 0.4926 = 39.5074,
K = 40, r = 0.09, = 0.30, and T = 0.4167. It is $3.52
PV of the two dividends is 0.5e
0.16670.09
+ 0.5e
0.41670.09
= 0.9711 Value of option
exercised at the end of 6 months can be calculated using BSM with S
0
= 400.9711 = 39.0259,
K = 40, r = 0.09, = 0.30, and T = 0.5. It is $3.67
Blacks approximation involves taking the max of these 2 options American call = $3.67
Two reasons for dierences between Binomial Model (BM) and Blacks approximation (BA):
(i) The timing of the early exercise decision, and (ii) The way volatility is applied
The timing of the early exercise decision tends to make BM greater than BA
BA: Holder has to decide today whether option will be exercised after 5/6 months
BM: Decision on early exercise at 5-month point depends on stock price at that time
The way in which volatility is applied tends to make BA greater than BM
In BA, when exercise after 5m, volatility is applied to stock price less PV of 1st dividend
When exercise after 6m, volatility is applied to stock price less PV of both dividends
80
Black: Using the holes in Black-Scholes
The formula
Using the Black-Scholes formula, the value of the option increases with increases in the stocks price,
the interest rate, the time remaining until the option expires, and the stocks volatility
Except for volatility, which can be estimated several ways, we can observe all other factors
The stocks expected return doesnt appear in the formula
A higher expected return on the stock means a higher expected return on the option, but it doesnt
aect the options value for a given stock price
This feature of the formula is very general
How to improve the assumptions
The original derivation of the Black-Scholes formula involves the following assumptions:
The stocks volatility is known, and doesnt change over the life of the option
The stock price changes smoothly: It never jumps up or down a large amount in a short time
The short-term interest rate never changes
Anyone can borrow or lend as much as he wants at a single rate
An investor who sells the stock or the option short will have the use of all the proceeds of the sale
and receive any returns from investing these proceeds
There are no trading costs for either the stock or the option
An investors trades do not aect the taxes he pays
The stock pays no dividends
An investor can exercise the option only at expiration
There are no takeovers or other events that can end the options life early
Volatility changes
The volatility of a stock is not constant
Changes in volatility may impact certain options, especially far-out-of-the-money options
One measure of that uncertainty is the volatility of the volatility
Formula with volatility includes both current/expected future levels of volatility
Other measure about future volatility: Relation between future stock price/its volatility
A decline in the stock price implies a substantial increase in volatility, while an increase in the
stock price implies a substantial decrease in volatility
Cox/Ross have formulas that account for relation between future stock price/its volatility
Cox-Ross formula implies lower values for out-of-the-money call options than Black-Scholes
But putting in uncertainty about future volatility often implies higher values for these options
We cant tell how the option values will change when we put in both eects
You should buy volatility if you think volatility will rise, and sell volatility if you think it will fall
To buy volatility, buy options. To sell volatility, sell options
Instead of buying stock, you can buy calls or buy stock and sell calls
Strongest position on volatility: Add long/short position in straddles to existing position
To buy pure volatility, buy both puts/calls in ratio giving no added exposure to the stock
To sell pure volatility, sell both puts and calls in the same ratio
Jumps
Up jumps and down jumps have dierent eects on option values than symmetric jumps
Merton has a formula that reects possible symmetric jumps
Compared to the Black-Scholes formula, his formula gives higher values for both in-the-money and
out-of-the-money options and lower values for at-the-money options
The dierences are especially large for short-term options
Short-term options show strikingly dierent eects for up jumps and down jumps
An increase in the probability of an up jump will cause out-of-the-money calls to go way up in
value relative to out-of-the-money puts
An increase in the probability of a down jump will do the reverse
81
If you expect a symmetric jump, buy short-term out-of-the-money options
Instead of stock, you can hold call options or more stock plus put options
Or you can sell at-the-money options
Instead of stock, you can hold more stock and sell call options
Pure play on symmetric jumps: Buy out-of-the-money calls/puts, and sell at-the-money calls/puts
For up jumps, use similar strategies that involve buying short-term out-of-the-money calls, or selling
short-term out-of-the-money puts, or both. For down jumps, do the opposite
Interest rate changes
If future changes in the interest rate are known, we can just replace the short-term rate with the yield
on a zero-coupon bond that matures when the option expires
When the stocks volatility is known, Merton has shown that the zero-coupon bond yield will still work
At any point in time, use the zero-coupon bond yield at that time for the short-term rate
Eects of interest rate changes on option values not nearly as great as eects of volatility changes
Better o with direct positions in xed-income securities vs. options to play interest rate changes
Interest rate changes may aect your decisions to buy or sell options
Higher interest rates mean higher call values and lower put values
If you think interest rates will rise more than market thinks, be more inclined to buy calls, and
more inclined to buy more stocks/sell puts, as a substitute for a straight stock position
If you think interest rates will fall more than market thinks, these preferences should be reversed
Borrowing penalties
Rate at which one can borrow, even with collateral, is higher than rate at which one can lend
Also, margin requirements/restrictions put on by lenders may limit amount one can borrow
High rates and limits on borrowing may cause a general increase in call option values, since calls provide
leverage that can substitute for borrowing
The interest rates implied by option values may be higher than lending rates
If this happens and you have borrowing limits but no limits on option investments, buy calls
If you can borrow freely at lending rate, get leverage by borrowing rather than by buying calls
When implied interest rates high, conservative investors buy puts/sell calls to protect a portfolio
Fixed-income investors might even choose to buy stocks and puts, and sell calls, to create a synthetic
xed-income position with a yield higher than market yields
Short-selling penalties
Short-selling penalties are generally even worse than borrowing penalties
An investor must go to the expense of borrowing stock if he wants to sell it short
Part of his expense involves putting up cash collateral, generally at interest below market rates
Also, investors may have to put up margin with brokers in cash, and may not receive interest
For options, the penalties tend to be much less severe
An investor need not borrow an option to sell it short
Investor loses much less interest income in selling an option short than in selling a stock short
Penalties on short selling that apply to all investors will aect option values
When even professional investors have trouble selling a stock short, we will want to include an
element in the option formula to reect the strength of these penalties
Sometimes we approximate this by assuming an extra dividend yield on the stock, in an amount
up to the cost of maintaining a short position as part of a hedge
If you want to short a stock but face penalties/restrictions:
You can short it indirectly by holding put options
Or by taking a naked short position in call options
When facing penalties in selling short, you face rewards for lending stock to those who want to short it
In this situation, strategies involving holding stock/lending it out may dominate
E.g., you might create a position with a limited downside by holding a stock and a put on the
stock, and by lending the stock to those who want to short it
82
Trading costs
Trading costs make it hard to create an option-like payo by trading in the underlying stock
They can also make it hard to create a stock-like payo by trading in the option
Sometimes trading costs can increase an options value, and sometimes they can decrease it
We cant tell how trading costs aect an options value They create a band of possible values
Within this band, it will be impractical for most investors to take advantage of mispricing by selling
the option and buying the stock, or by selling the stock and buying the option
Trading costs make options useful if you want to shift exposure to the stock after it goes up/down
To shift your exposure to the market as a whole, rather than to a stock, options even more useful
It is often more costly to trade in a basket of stocks than in a single stock
But you can use index options to reduce your trading in the underlying stocks or futures
Taxes
The very existence of taxes will aect option values
A hedged position giving the same return as lending may have a tax ,= tax on interest
The fact that investor tax rates dier will aect values too
Tax authorities adopt a variety of rules to restrict tax arbitrage
Rules to limit interest/capital loses deductions, rules to tax gains/losses before position closed out
E.g., most US index option positions are now taxed each year, partly as short-term capital gains
and partly as long-term capital gains, whether or not the taxpayer has closed out his positions
If you pay taxes on gains and cannot deduct losses, you may want to limit the volatility of your positions
and have the freedom to control the timing of gains and losses
This will aect how you use options, and may aect option values as well
Hard to predict, though, whether it will increase or decrease option values
Investors who buy a put have a capital gain/loss at end of year, or when option expires
Investors who simulate the put by trading in underlying stock sell after decline, buy after rise
By choosing which lots of stock to buy and which lots to sell, they will be able to generate a series
of realized capital losses and unrealized gains
The tax advantages of this strategy may reduce put values for many taxable investors
Similarly, tax advantages of a simulated call reduce call values for most taxable investors
Dividends and early exercise
Dividends reduce call option values and increase put option values
Dividends make early exercise of a call more likely, and early exercise of a put less likely
Several ways to change the BS formula to account for dividends:
1. Assume dividend yield constant for all possible stock price levels/all future times
2. Assume issuer has money set aside to pay dollar dividends due before option expires
3. Assume dividend depends in a known way on stock price at each ex-dividend date
Cox, Ross and Rubinstein tree approach is more exible than BS formula
Use dividend that the rm will pay for each possible future stock price at each future time
We can also test, at each node of the tree, whether an investor will exercise the option early
Option values reect markets belief about stocks future dividends and likelihood of early exercise:
If dividends higher than market thinks, buy puts or sell calls, other things equal
If option holders exercise too early/late, sell options to prot from opportunities the holders create
Takeovers
If A takes over B through exchange of stock, options on Bs stock will become options on As stock
We will use As volatility rather than Bs in valuing the option
If rm A takes over rm B through a cash tender oer, there are two eects:
1. Outstanding options on B will expire early Reduced values for both puts and calls
2. Bs stock price rises through tender oer premium Increased call values/decreased put values
But when market knows of a possible tender oer from A, Bs stock price higher than normal
All these factors work together to inuence option values
83
The chance of a takeover will make an options value sometimes higher and sometimes lower
For short-term out-of-the-money call, chance of takeover increases the option value
For short-term out-of-the-money put, chance of a takeover reduces the option value
Eects of takeover probability can be dramatic for these short-term out-of-the-money options
Portfolio insurance
Any strategy where you reduce your stock positions when prices fall, and increase them when prices rise
Some use option formulas to gure how much to increase/reduce positions as prices change
However, assumptions behind BSM aect portfolio insurance strategies that dont use the formula
The higher your trading costs, the less likely you are to create synthetic options
Sometimes, futures are priced against the portfolio insurers
This makes all portfolio insurance strategies less attractive
Portfolio insurance using synthetic strategies wins when market jumps big, but without much volatility
It loses when market volatility is high, because an investor will sell after a fall, and buy after a rise
The investor loses money on each cycle
But the true cost of portfolio insurance is a factor that doesnt even aect option values:
It is the mean reversion in market: Rate at which expected return on market falls as market rises
Mean reversion is what balances supply and demand for portfolio insurance:
High mean reversion will discourage portfolio insurers because it will mean they are selling when
expected return is higher and buying when expected return is lower
For the same reason, high mean reversion will attract value investors or tactical asset allocators
who buy after a decline and sell after a rise
Value investors use indicators like P/E ratios and dividend yields to decide when to buy/sell
They act as sellers of portfolio insurance
If mean reversion were zero, more investors would want to buy portfolio insurance than to sell it:
People have a natural desire to try to limit their losses
But, on balance, there must be as many sellers as buyers of insurance
What makes this happen is a positive normal level of mean reversion
The October 1987 crash
During 1987, investors shifted toward wanting more portfolio insurance
This increased the markets mean reversion
Because of mean reversion, market rise in 87 caused sharper-than-usual fall in expected return
But investors didnt see this at rst
They continued to buy, as their portfolio insurance strategies suggested
Around October 19, the full truth of what was happening hit investors:
They saw that at existing levels of market, expected return was much lower than assumed
They sold at those levels. Market fell, and expected return rose, until equilibrium restored
Mean reversion and stock volatility
Mean reversion and volatility estimates
If good estimate of volatility, stocks expected return wont aect option values
Neither will mean reversion
But mean reversion may inuence your estimate of the stocks volatility
With mean reversion day-to-day volatility will be higher than month-to-month volatility, which
will be higher than year-to-year volatility
Your volatility estimates for options with several years of life should be generally lower than
your volatility estimates for options with several days or several months of life
Using your views of mean reversion for investing
If mean reversion higher than markets, buy short-term options/sell long-term options
If you think mean reversion is lower, you can do the reverse
If you are a buyer of options, you will favor short-term options when you think mean reversion is
high, and long-term options when you think it is low
84
If you are a seller of options, you will favor long-term options when you think mean reversion is
high, and short-term options when you think its low
Eects most striking in stock index options, but also in individual stock options
Trend followers act like portfolio insurers, but they trade individual stocks rather than portfolios
When the stock rises, they buy. When it falls, they sell
They act as if the past trend in a stocks price is likely to continue
In individual stocks, as in portfolios, mean reversion should normally make implied volatilities
higher for short-term options than for long-term options
85
86
Hull - Ch. 15: Options on stock indices and currencies
Options on stock indices
Several exchanges trade options on stock indices
Some indices track movement of whole market, others based on performance of particular sector
One index option contract is on 100 times the index, index options are settled in cash
On exercise, holder of call receives (S K) 100 in cash and writer pays this amount in cash,
where S = index value at close of trading day and K = strike price
Similarly, holder of put option (K S) 100 in cash, paid by writer of the option
Portfolio insurance
Portfolio managers can use index options to limit their downside risk:
Suppose that value of an index today is S
0
, consider well-diversied portfolio ( = 1.0)
Assuming portfolio dividend yield = index dividend yield, we expect:
[% changes in portfolio value] [% changes in index value]
The portfolio value is protected against index falling below K if, for each 100 S
0
dollars in the
portfolio, the manager buys one put option contract with strike price K
When the portfolios beta is not 1.0
If portfolios ,= 1.0, put options must be purchased for each 100S
0
dollars in portfolio
Suppose a $500,000 portfolio has = 2.0 instead of 1.0, and S&P 500 index = 1,000
The number of put options required is: 2.0 500, 000/(1, 000 100) = 10
To calculate the appropriate strike price, the CAPM can be used
Suppose risk free rate = 12%, dividend yield on both index/portfolio is 4%, and protection is
required against portfolio value dropping below $450,000 in next three months
Calculation of expected value of portfolio when the index is 1,040 in three months and = 2.0
Value of index in three months: 1, 040
Return from change in index: 40/1, 000, or 4% per three months
Dividends from index: 0.25 4 = 1% per three months
Total return from index: 4 + 1 = 5% per three months
Risk-free interest rate: 0.25 12 = 3% per three months
Excess return from index over r
f
: 5 3 = 2% per three months
Expected excess return from portfolio over r
f
: 2 2 = 4% per three months
Expected return from portfolio: 3 + 4 = 7% per three months
Dividends from portfolio: 0.25 4 = 1% per three months
Expected increase in value of portfolio: 7 1 = 6% per three months
Expected value of portfolio: $500, 000 1.06 = $530, 000
Strike price for options purchased = index level corresponding to required protection level
Two reasons why the cost of hedging increases as the of a portfolio increases:
1. More put options are required
2. They have a higher strike price
Currency options
Currency options are primarily traded in the OTC market
The advantage of this market is that large trades are possible, with strike prices, expiration dates,
and other features tailored to meet the needs of corporate treasurers
Example - European call to buy 1M with USD at 1.2000 USD per
If actual exchange rate at maturity = 1.25, payo = 1, 000, 000 (1.25 1.20) = $50, 000
Similarly, example of European put option giving the holder the right to sell 10M Australian dollars
for USD at an exchange rate of 0.70 USD per AUD
If actual exchange rate at maturity = 0.67, payo is 10, 000, 000 (0.70 0.67) = $300, 000
To hedge FX exposure, foreign currency options are a good alternative to forward contracts:
If due to receive at known future time, hedge risk with a put on that mature at that time
Hedging strategy guarantees that exchange rate applicable to will not be < strike price, while
allowing company to benet from favorable exchange-rate movements
87
Similarly, a company due to pay sterling at a known time in the future can hedge by buying calls
on sterling that mature at that time
This hedging strategy guarantees that the cost of the sterling will not be greater than a certain
amount while allowing the company to benet from favorable exchange-rate movements
Forward contract locks in exchange rate for future transaction, option provides type of insurance
This is not free. It costs nothing to enter a forward, but options require a premium up front
Range forwards
Range forward: Variation on a standard forward contract for hedging FX risk:
Consider a US company that knows it will receive 1M in three months
Suppose that the three-month forward exchange rate is 1.9200 dollars per pound
Company can lock in this rate by entering a short forward to sell 1M in 3 months
This would ensure that the amount received for the 1M is $1,920,000
Short range forward contract
Alternative: Buy a European put with strike price of K
1
and sell a European call with strike
price K
2
, where K
1
< 1.9200 < K
2
. See payo in Fig. 1(a), both options are on 1M
If the exchange rate in three months proves to be less than K
1
, the put option is exercised and
as a result the company is able to sell the 1M at an exchange rate of K
1
If the exchange rate is between K
1
and K
2
, neither option is exercised and the company gets
the current exchange rate for the 1M
If the exchange rate is greater than K
2
, the call option is exercised against the company with
the result that the 1M is sold at an exchange rate of K
2
The exchange rate realized for the 1M is shown in Fig. 2
(b) Long range-forward contract
1
K
2
K
(a) Short range-forward contract
2
K
1
K Asset
Price
Price
Asset
P
a
y
o
f
f
P
a
y
o
f
f
Figure 1: Payos from short/long range-forwards
- Long range-forward contract used to hedge a future foreign currency outflow
- Short range-forward contract used to hedge a future foreign currency inflow
Exchange rate in market
Exchange rate realized with range-forward contract
2
K
1
K
1
K
2
K
Figure 2: FX rate realized with range-forwards
Long range forward contract
If company knew it was due to pay rather than receive 1M in 3 months, it could sell a
European put with strike price K
1
and buy a European call with strike price K
2
[Fig. 1(b)]
Exchange rate paid for 1M = rate received for 1M in short range-forward example
In practice, range forward set up so that price of put = price of call (select K
1
, K
2
such that p = c)
It costs nothing to set up the range forward ( costs nothing to set up regular forward)
As strike prices of call/put become closer in a range forward, it becomes a regular forward
A short range forward contract becomes a short forward contract
A long range forward contract becomes a long forward contract
Options on stocks paying known dividend yields
Dividends cause stock prices to reduce on the ex-dividend date by amount of dividend payment
The payment of a dividend yield at rate q therefore causes the growth rate in the stock price to be
less than it would otherwise be by an amount q
If, with a dividend yield of q, the stock price grows from S
0
today to S
T
at time T, then in the
absence of dividends it would grow from S
0
today to S
T
e
qT
at time T
Alternatively, in the absence of dividends it would grow from S
0
e
qT
today to S
T
at time T
88
We get the same probability distribution for the stock price at time T in each of the two cases:
1. The stock starts at price S
0
and provides a dividend yield at rate q
2. The stock starts at price S
0
e
qT
and pays no dividends
When valuing European option lasting for T on stock paying known dividend yield q, we reduce current
stock price from S
0
to S
0
e
qT
and then value the option as if the stock pays no dividends
Lower bounds for option prices
Substituting S
0
e
qT
for S
0
in Eq. (9.1), a lower bound for the European call c is:
c S
0
e
qT
Ke
rT
(1)
We can also prove this directly by considering the following two portfolios:
Portfolio A: One European call option plus an amount of cash = Ke
rT
Portfolio B: e
qT
shares with dividends being reinvested in additional shares
To obtain a lower bound for European put, we similarly replace S
0
by S
0
e
qT
in Eq. (9.2) to get:
p Ke
rT
S
0
e
qT
(2)
This result can also be proved directly by considering the portfolios:
Portfolio C: One European put plus e
qT
shares with dividends reinvested in more shares
Portfolio D: An amount of cash equal to Ke
rT
Put-call parity
Replacing S
0
by S
0
e
qT
in Eq. (9.3), put-call parity for option on stock paying dividend yield q:
c +Ke
rT
= p +S
0
e
qT
(3)
This result can also be proved directly by considering the two portfolios:
Portfolio A: One European call option plus an amount of cash = Ke
rT
Portfolio C: One European put plus e
qT
shares with dividends reinvested in more shares
Both portfolios are both worth max(S
T
, K) at time T
They must be worth the same today, and the put-call parity result in Eq. (3) follows
For American options, the put-call parity relationship is:
S
0
e
qT
K < C P < S
0
Ke
rT
Pricing formulas
By replacing S
0
by S
0
e
qT
in Black-Scholes, Eqs. (13.15) and (13.16), we obtain the price c of a
European call and the price p of a European put on a stock paying a dividend yield q as:
c = S
0
e
qT
N(d
1
) Ke
rT
N(d
2
) (4)
p = Ke
rT
N(d
2
) S
0
e
qT
N(d
1
) (5)
Since ln(S
0
e
qT
/K) = ln(S
0
/K) qT, d
1
and d
2
are given by:
d
1
=
ln(S
0
/K) + (r q +
2
/2)T
T
and d
2
=
ln(S
0
/K) + (r q
2
/2)T
T
= d
1
T
Dividend should, for the purposes of option valuation, be dened as the reduction in the stock
price on the ex-dividend date arising from any dividends declared
If the dividend yield rate is known but not constant during the life of the option, Eqs. (4) and (5)
are still true, with q equal to the average annualized dividend yield during the options life
Dierential equation and risk-neutral valuation
When including a dividend yield of q in analysis of Ch. 13, the equation (13.13) becomes:
f
t
+ (r s)S
f
S
+
1
2
2
S
2
2
f
S
2
= rf (6)
89
Like Eq. (13.13), this does not involve any variable aected by risk preferences
The risk-neutral valuation procedure can be used
Risk-neutral world: Expected growth rate must be r q Risk-neutral process for stock price:
dS = (r q)Sdt +Sdz (7)
To value a derivative dependent on a stock that provides a dividend yield equal to q, we set the
expected growth rate of the stock equal to r q and discount the expected payo at rate r
When expected growth rate in stock price is r q, expected stock price at T = S
0
e
(rq)T
The expected payo for a call option in a risk-neutral world is then:
e
(rq)T
S
0
N(d
1
) KN(d
2
)
Discounting at rate r for time T leads to Eq. (4)
Valuation of European stock index options
The index can be treated as an asset paying a known yield
Eqs. (1) and (2) provide a lower bound for European index options
Eq. (3) is the put-call parity result for European index options
Eqs. (4) and (5) can be used to value European options on an index
The binomial tree approach can be used for American options
In all cases, S
0
is equal to the value of the index, is equal to the volatility of the index, and q is
equal to the average annualized dividend yield on the index during the life of the option
Calculation of q should include only dividends whose ex-dividend date occurs during option life
If absolute amount of dividend paid on stocks underlying index (rather than dividend yield) is known,
Black-Scholes can be used with initial stock price reduced by PV of dividends
Dicult to implement for broad stock index (requires dividends on each stock underlying index)
Forward prices
Dene F
0
as the forward price of the index for a contract with maturity T:
As shown by Eq. (5.3), F
0
= S
0
e
(rq)T
European call/put price c/p in Eqs. (4)/(5) are:
c = F
0
e
rT
N(d
1
) Ke
rT
N(d
2
) (8)
p = Ke
rT
N(d
2
) F
0
e
rT
N(d
1
) (9)
d
1
=
ln(F/K) +
2
T/2
T
and d
2
=
ln(F/K)
2
T/2
T
The put-call parity relationship in Eq. (3) can be written:
c +Ke
rT
= p +F
0
e
rT
F
0
= K + (c p)e
rT
(10)
If, as common in exchange-traded markets, pairs of puts/calls with same strike price are traded
actively for a maturity date, Eq. (10) can estimate the forward price of index for that maturity
Once forward prices of the index for dierent maturities have been obtained, the term structure
of forward rates can be estimated, and other options valued using Eqs. (8)/(9)
Advantage: The dividend yield on the index does not have to be estimated explicitly
Implied dividend yields
If estimates of the dividend yield are required (e.g because an American option is being valued),
calls and puts with the same strike price and time to maturity can be used. From Eq. (3),
q =
1
T
ln
c p +Ke
rT
S
0
For a particular strike price and time to maturity, the estimates of q calculated from this equation
are liable to be unreliable. But when the results from many matched pairs of calls and puts are
combined, a clear picture of the dividend yield being assumed by the market emerges
90
Valuation of European currency options
For currency options, S
0
= spot exchange rate. S
0
= value of one unit of foreign currency in USD
A foreign currency is analogous to a stock paying a known dividend yield:
Owner of foreign currency receives a yield = risk-free interest rate r
f
in foreign currency
Eqs. (1) and (2) with q replaced by r
f
provide bounds for the European call/put price c/p:
c S
0
e
r
f
T
Ke
rT
and p Ke
rT
S
0
e
r
f
T
Eq. (3) with q replaced by r
f
provides the put-call parity result for currency options:
c +Ke
rT
= p +S
0
e
r
f
T
Finally, Eqs. (4) and (5) provide pricing formulas for currency options when q replaced by r
f
:
c = S
0
e
r
f
T
N(d
1
) Ke
rT
N(d
2
) (11)
p = Ke
rT
N(d
2
) S
0
e
r
f
T
N(d
1
) (12)
d
1
=
ln(S
0
/K) + (r r
f
+
2
/2)T
T
and d
2
=
ln(S
0
/K) + (r r
f
2
/2)T
T
= d
1
T
Both the domestic interest rate r and the foreign interest rate r
f
are the rates for a maturity T
Put and call options on a currency are symmetrical in that a put option to sell currency A for currency
B at strike price K is the same as a call option to buy B with currency A at strike price 1/K
Using forward exchange rates
Because banks and other nancial institutions trade forward contracts on foreign exchange rates
actively, foreign exchange rates are often used for valuing options
Eq. (5.9): Forward rate F
0
for maturity T is: F
0
= S
0
e
(rr
f
)T
Eqs. (11) and (12) simplify to:
c = e
rT
[F
0
N(d
1
) KN(d
2
)] (13)
p = e
rT
[KN(d
2
) F
0
N(d
1
)] (14)
d
1
=
ln(F
0
/K) +
2
T/2
T
and d
2
=
ln(F
0
/K)
2
T/2
T
= d
1
T
The maturity of forward/futures contract must be the same as maturity of European option
American options
Binomial trees can be used to value American options on indices and currencies
As with American options on non-dividend-paying stock, u determines size of up moves and is set
equal to e
t
, where = volatility and t = length of time steps
The parameter determining the size of down movements d is set equal to 1/u, or e
t
For a non-dividend-paying stock, the probability of an up movement is:
p =
a d
u d
where a = e
rt
For options on indices and currencies, the formula for p is the same, but a is dened dierently. In
the case of options on an index, with q as the dividend yield on the index: a = e
(rq)t
In the case of options on a currency, with r
f
as the foreign risk-free rate: a = e
(rr
f
)t
In some circumstances it is optimal to exercise American currency options prior to maturity:
Thus, American currency options are worth more than their European counterparts
In general, call options on high-interest currencies and put options on low-interest currencies are
the most likely to be exercised prior to maturity
Reason: A high-/low-interest currency is expected to depreciate/appreciate
91
92
Hull - Ch. 16: Futures options
Nature of futures options
A futures option is the right to enter into a futures contract at a certain futures price by a certain date
Futures options are generally American, can be exercised any time during contract life
A call futures option is the right to enter into a long futures contract at a certain price
If a call futures option is exercised, holder acquires a long position in underlying futures contract
plus cash amount equal to [most recent settlement futures price] [strike price]
Eective payo from call futures option = [futures price at time of exercise] [strike price]
A put futures option is the right to enter into a short futures contract at a certain price
If a put futures option is exercised, holder acquires a short position in underlying futures
contract plus cash amount equal to [strike price] [most recent settlement futures price]
Eective payo from put futures option = [strike price] [futures price at time of exercise]
Example
Suppose it is August 15 and an investor has one September futures call option contract on copper
with a strike price of 240 cents per pound. One futures contract is on 25,000 pounds of copper
Suppose that the futures price of copper for delivery in September is currently 251 cents, and at
the close of trading on August 14 (the last settlement) it was 250 cents
If the option is exercised, the investor receives a cash amount of 25, 000(250240) cents = $2,500
plus a long position in a futures contract to buy 25,000 pounds of copper in September
Position in futures contract can be closed out immediately: This leaves investor with $2,500 cash
payo plus 25, 000(251250) cents = $250 reecting change in futures price since last settlement
The total payo from exercising the option on August 15 is $2,750, which equals 25, 000(F K),
where F is the futures price at the time of exercise and K is the strike price
Expiration months
Futures options referred to by delivery month of underlying futures, not by expiration of option
Options on interest rate futures
The most actively traded interest rate options oered by exchanges in the US are those on Treasury
bond futures, Treasury note futures, and Eurodollar futures
Treasury bond futures option (traded on CBOT):
One Treasury bond futures contract is for the delivery of $100,000 of Treasury bonds
Price of T-bond futures option quoted as % of face value of underlying T-bonds to nearest
1
64
Option on Eurodollar futures (traded on the CME):
When the Eurodollar futures quote changes by 1 basis point, or 0.01%, there is a gain or loss
on a Eurodollar futures contract of $25
Similarly, in the pricing of options on Eurodollar futures, 1 basis point represents $25
Interest rate futures option contracts work in the same way as other futures options contracts
In addition to cash payo, holder of call option obtains a long position in the futures contract
when the option is exercised and option writer obtains a corresponding short position
The total payo from the call, including the value of the futures position, is max(F K, 0),
where F is the futures price at the time of exercise and K is the strike price
Interest rate futures prices increase when bond prices increase (i.e., when interest rates fall).
They decrease when bond prices decrease (i.e., when interest rates rise)
Example - Eurodollar futures options
It is February and the futures price for the June Eurodollar contract is 93.82 (corresponding
to a 3-month Eurodollar interest rate of 6.18% per annum)
Price of call option on the contract with strike price of 94.00 is quoted as 0.1, or 10 bp
Option attractive to investor who feels that interest rates are likely to come down
Suppose short-term rates drop by 100 bp and investor exercises the call when Eurodollar futures
price is 94.78 ( 3-month Eurodollar interest rate of 5.22% per annum)
The payo is 25 (94.78 94.00) 100 = $1, 950. The cost of the contract is 10 25 = $250
The investors prot is $1,700
93
Example - Treasury bond futures options
In August, futures price for December T-bond is 96-09 (or 96
9
32
= 96.28125)
An investor who feels that the yield on long-term government bonds will fall by December
might choose to buy December calls with a strike price of 98
Assume that the price of these calls is 1-04 (or 1
4
64
= 1.0625% of the principal)
If long-term rates fall and the Treasury bond futures price rises to 100-00, the investor will
make a net prot per $100 of bond futures of 100.00 98.00 1.0625 = 0.9375
Since one option contract is for the purchase or sale of instruments with a face value of $100,000,
the investor would make a prot of $937.50 per option contract bought
Reasons for the popularity of futures options
People choose to trade options on futures rather than options on the underlying asset because a futures
contract is, in many circumstances, more liquid and easier to trade than the underlying asset
Furthermore, a futures price is known immediately from trading on the futures exchange, whereas
the spot price of the underlying asset may not be so readily available
Futures on commodities are also often easier to trade than the commodities themselves
An important point about a futures option is that exercising it does not usually lead to delivery of the
underlying asset, as in most circumstances the underlying futures contract is closed out prior to delivery
Futures options are therefore normally eventually settled in cash
This is appealing to many investors, particularly those with limited capital who may nd it dicult
to come up with the funds to buy the underlying asset when an option is exercised
Other advantage: Futures and futures options are traded in pits side by side in the same exchange
This facilitates hedging, arbitrage, and speculation and make the markets more ecient
Futures options also tend to entail lower transactions costs than spot options in many situations
European spot and futures options
The payo from a European call option with strike price K on the spot price of an asset is max(S
T
K, 0)
where S
T
is the spot price at the options maturity
The payo from a European call option with the same strike price on the futures price of the asset is
max(F
T
K, 0) where F
T
is the futures price at the options maturity
If futures contract matures at same time as option, then F
T
= S
T
and the two options are equivalent
Similarly, a European futures put option is worth the same as its spot put option counterpart when
the futures contract matures at the same time as the option
European futures options can be used to value corresponding European spot options
Put-call parity
Consider European call/put futures options, both with strike price K/time to expiration T
Portfolio A: European call futures option plus an amount of cash = Ke
rT
Portfolio B: European put futures option + long futures contract + cash = F
0
e
rT
In portfolio A, the cash can be invested at the risk-free rate r and grows to K at time T
Let F
T
be the futures price at maturity of the option
If F
T
> K, the call option in portfolio A is exercised and portfolio A is worth F
T
If F
T
< K, the call is not exercised and portfolio A is worth K
The value of portfolio A at time T is therefore max(F
T
, K)
In portfolio B, the cash can be invested at the risk-free rate to grow to F
0
at time T
The put option provides a payo of max(K F
T
, 0)
The futures contract provides a payo of F
T
F
0
The value of portfolio B at time T is therefore F
0
+ (F
T
F
0
) + max(K F
T
, 0) = max(F
T
, K)
Because portfolios have same value at T and no early exercise, they are worth the same today
The value of portfolio A today is: c +Ke
rT
where c is the price of the call futures option
Marking-to-market ensures that futures contract in portfolio B is worth zero today. Portfolio B is
therefore worth p +F
0
e
rT
where p is the price of the put futures option. Hence,
c +Ke
rT
= p +F
0
e
rT
(1)
94
The dierence between this put-call parity relationship and the one for a non-dividend-paying stock in
Eq. (9.3) is that the stock price S
0
is replaced by the discounted futures price F
0
e
rT
When futures contract matures at same time as option, European futures/spot options are the same
Eq. (1) gives a relationship between price of call option on spot price, price of put option on spot
price, and futures price when both options mature at same time as futures contract
For American futures options, the put-call relationship is:
F
0
e
rT
K < C P < F
0
Ke
rT
(2)
Bounds for futures options
Because the price of a put p cannot be negative, it follows from Eq. (1) that:
c (F
0
K)e
rT
(3)
Similarly, because the price of a call option cannot be negative, it follows from Eq. (1) that:
p (K F
0
)e
rT
(4)
Prices of European call/put options are close to lower bounds when options are deep in the money:
Call option deep in the money Put option deep out of the money p is very close to zero
Dierence between c and its lower bound equals p Price of call option very close to lower bound
Because American futures options can be exercised at any time: C F
0
K and P K F
0
Lower bound for American option price is always higher than lower bound for European option price,
because always some chance that an American futures option will be exercised early
Valuation of futures options using binomial trees
Key dierence between futures-/stock options: No up-front costs with a futures option
Consider futures price starting at F
0
and rising to F
0
u or declining to F
0
d over time period T
We consider an option maturing at time T
Suppose that its payo is f
u
if the futures price moves up and f
d
if it moves down
Riskless portfolio: Short position in one option combined with long position in futures contracts:
=
f
u
f
d
F
0
u F
0
d
The value of the portfolio at time T is then always: (F
0
u F
0
)f
u
Denoting the risk-free interest rate by r, the portfolio value today is: [(F
0
u F
0
)f
u
]e
rT
Another expression for the PV of the portfolio is f, where f is the value of the option today
Thus: f = [(F
0
u F
0
)f
u
]e
rT
. Substituting for :
f = e
rT
[pf
u
+ (1 p)f
d
] where p =
1 d
u d
(5)
Eq. (5) gives the risk-neutral probability of an up movement
Multistep trees
Multistep binomial trees are used to value American-style futures options like stock options
The parameter u dening up movements in the futures price is e
t
, where is the volatility of
the futures price and t is the length of one time step
The probability of an up movement in the future price is [Eq. (5)]: p = (1 d)/(u d)
Drift of a futures price in a risk-neutral world
In a risk-neutral world, a futures price behaves in the same way as a stock paying a dividend yield at
the domestic risk-free interest rate r. Clues:
p in a binomial tree for futures price is like for stock paying a dividend yield q when q = r
The put-call parity relationship for futures options prices is the same as that for options on a stock
paying a dividend yield q when the stock price is replaced by the futures price and q = r
95
Calculation of the drift of a futures price in a risk-neutral world:
Dene F
t
as the futures price at time t
If we enter into a long futures contract today, its value is zero
At time t (the rst time it is marked to market) it provides a payo of F
t
F
0
If r is the very-short-term (t-period) interest rate at time 0, risk-neutral valuation gives the value
of the contract at time 0 as: e
rt
E[F
t
F
0
] where
E = expectations in a risk-neutral world
Hence, e
rt
E[F
t
F
0
] = 0, showing that
E[F
t
] = F
0
Similarly,
E[F
2t
] = F
t
,
E[F
3t
] = F
2t
, . . .
E[F
T
] = F
0
for any time T
The drift of the futures price in a risk-neutral world is therefore zero. From Eq. (15.7), the futures price
behaves like a stock providing a dividend yield q equal to r
Usual assumption made for process followed by futures price F in risk-neutral world is ( constant):
dF = Fdz (6)
Dierential equation
The dierential equation satised by a derivative dependent on a futures price is:
f
t
+
1
2
2
f
S
2
2
F
2
= rf (7)
Same form as Eq. (15.6) with q r. This conrms that, for valuing derivatives, a futures price
can be treated like a stock providing a dividend yield at rate r
Blacks model for valuing futures options
Assuming that the futures price follows (lognormal) process in Eq. (6), European call/put price c/p for
futures option are given by Eqs. (15.4)/(15.5) with S
0
replaced by F
0
and q = r:
c = e
rT
[F
0
N(d
1
) KN(d
2
)] (8)
p = e
rT
[KN(d
2
) F
0
N(d
1
)] (9)
d
1
=
ln(F
0
/K) +
2
T/2
T
and d
2
=
ln(F
0
/K)
2
T/2
T
= d
1
T
When the cost of carry and the convenience yield are functions only of time, the volatility of the futures
price is the same as the volatility of the underlying asset
Blacks model does not require the option contract and the futures contract to mature at same time
Using Blacks model instead of Black-Scholes
Futures-/spot options are equivalent when option/futures contract matures at same time
Eqs. (8) and (9) allow to value European options on spot price of an asset
The variable F
0
in Eqs. (8) and (9) is set equal to either the futures or the forward price of the
underlying asset for a contract maturing at the same time as the option
Eqs. (15.13) and (15.14) show Blacks model being used to value European options on the spot
value of a currency: They avoid the need to estimate the foreign risk-free interest rate explicitly
Eqs. (15.8) and (15.9) show Blacks model being used to value European options on the spot value
of an index: They avoid the need to estimate the dividend yield explicitly
Blacks model useful to imply term structure of forward rates from actively traded index options
The forward rates can then be used to price other options on the index
The same approach can be used for other underlying assets
American futures options vs. American spot options
Traded futures options are in practice usually American
Assuming risk-free rate r > 0, it may be optimal to exercise an American futures option early
American futures options are worth more than their European counterparts
It is not generally true that an American futures option is worth the same as the corresponding American
spot option when the futures and options contracts have the same maturity
96
Assume a normal market with futures prices consistently higher than spot prices prior to maturity
Common with most stock indices, gold, silver, low-interest currencies, and some commodities
When futures option are exercised early, they provide a greater prot to the holder
American call futures option must be worth corresponding American spot call option
Similarly, American put futures option must be worth corresponding American spot put option
If there is an inverted market with futures prices consistently lower than spot prices, as is the case with
high-interest currencies and some commodities, the reverse must be true
The later the futures contract expires the greater the dierences tend to be
Futures-style options
These are futures contracts on the payo from an option
A futures-style option is a bet on what the payo from an option will be
Traders who buy/sell a futures-style option post margin like on a regular futures contract
The contract is settled daily, and the nal settlement price is the payo from the option
If interest rates constant, futures price in a futures-style option = forward price in forward contract on
the option payo Futures price for a futures-style option = price paid for the option if payment were
made in arrears It is the value of a regular option compounded forward at the risk-free rate
Dening d
1
and d
2
as in Eqs. (8) and (9):
The futures price in a call futures-style option is: F
0
N(d
1
) KN(d
2
)
The futures price in a put futures-style option is: KN(d
2
) F
0
N(d
1
)
Formulas ok for futures-style option on futures contract and futures-style option on spot value of asset
In the rst case, F
0
is the current futures price for the contract underlying the option
In the second case, it is the current futures price for a futures contract on the underlying asset
maturing at the same time as the option
The put-call parity relationship for a futures-style options is: p +F
0
= c +K
American futures-style option can be exercised early Immediate settlement at options intrinsic value
It is never optimal to exercise an American futures-style options on a futures contract early because
the futures price of the option is always greater than the intrinsic value
American futures-style option can be treated like European futures-style option
97
98
E Asset-Liability Management
BKM - Ch. 16: Managing bond portfolios
Introduction
A passive investment strategy takes market prices of securities as set fairly
Rather than attempting to beat the market by exploiting superior information or insight, passive
managers act to maintain an appropriate risk-return balance given market opportunities
Special case of passive management: Immunization strategy against interest rate risk
In contrast, active investment strategy tries to achieve returns greater than commensurate with risk
In the context of bond management this style of management can take two forms:
1. Either use interest rate forecasts to predict movements in the entire bond market
2. Or use intramarket analysis to identify particular sectors of the market/bonds mispriced
Interest rate risk is crucial to formulating both active and passive strategies
Interest rate risk
An inverse relationship exists between bond prices and yields
As interest rates rise (fall), bondholders experience capital losses (gains)
These gains/losses make xed-income investments risky, even if coupon/principal guaranteed
Interest rate sensitivity - Malkiels rules
6%
10%
10%
10%
30 years
30 years
30 years
5 years 12%
12%
3%
3% D
C
B
A
Coupon Initial YTM Maturity Bond
Change in Yield to Maturity (%)
4 2 -4 -2
100
150
50
0
-50
P
e
r
c
e
n
t
a
g
e
c
h
a
n
g
e
i
n
b
o
n
d
p
r
i
c
e
0
D
Figure 1: Change in bond price as a function of change in YTM
1. Bond prices/yields inversely related: As yields increase, bond prices fall, and conversely
2. Increase in bonds YTM results in smaller price change than decrease in YTM of same magnitude
3. Long-term bonds more sensitive to interest rate changes than short-term bonds
4. The sensitivity of bond prices to changes in yields increases at a decreasing rate as maturity
increases. I.e., interest rate risk is less than proportional to bond maturity
5. Interest rate risk is inversely related to the bonds coupon rate: Prices of low-coupon bonds are
more sensitive to changes in interest rates than prices of high-coupon bonds
6. Sensitivity of bonds price to change in its yield is inversely related to current YTM
Maturity is a major determinant of interest rate risk, but other factors as well
Higher-coupon-rate bonds have higher fraction of value tied to coupons vs. nal par payment
Portfolio of coupons more heavily weighted toward earlier, short-maturity payments Lower
eective maturity Malkiels rule #5 - Price sensitivity falls with coupon rate
Similarly, for rule #6 - Price sensitivity falls with YTM: Higher yield reduces PV of all bonds
payments, but more so for distant payments
At higher yield, higher fraction of bonds value due to its earlier payments (lower eective
maturity/interest rate sensitivity) Overall sensitivity to changes in yields is lower
Duration
To deal with the ambiguity of the maturity of a bond making many payments, we need a measure
of the average maturity of the bonds promised CFs to serve as a useful summary statistic
Macaulays duration
The eective maturity concept is called the duration of the bond
101
Macaulays duration = wtd avg. of times to each coupon/principal payment
Weight associated with each payment time related to importance of that payment to bond
value: It is the proportion of bond total value accounted for by that payment
I.e., weight w
t
associated with CF at t = [PV of payment] [bond price]:
w
t
=
CF
t
/(1 +y)
t
Bond price
where y is the bonds YTM
weights = 1.0 because the sum of the CFs discounted at YTM is the bond price
Macaulays duration formula
D =
T
t=1
t w
t
(1)
Duration is a key concept in xed-income portfolio management for at least three reasons:
1. It is a simple summary statistic of the eective average maturity of the portfolio
2. It turns out to be an essential tool in immunizing portfolios from interest rate risk
3. Duration is a measure of the interest rate sensitivity of a portfolio
The duration measure enables us to quantify the relationship that long-term bonds are more sen-
sitive to interest rate movements than are short-term bonds
When interest rates change, proportional change in bonds price related to change in YTM y:
P
P
= D
_
(1 +y)
1 +y
_
(2)
Proportional price change = proportional change in [1 + bonds yield] [bonds duration]
Modied duration D
=
D
1 +y
P
P
= D
y D
=
1
P
dP
dy
(3)
% change in bond price = [modied duration] [change in bonds YTM]
Because the % change in the bond price is proportional to modied duration, modied duration
is a natural measure of the bonds exposure to changes in interest rates
What determines duration?
If we speculate on interest rates, duration tells us how strong a bet we are making. Conversely, if we
remain neutral on rates, and simply match the interest rate sensitivity of a chosen bond-market
index, duration allows to measure that sensitivity and mimic it in our portfolio
Rule 1 for duration: The duration of a zero-coupon bond equals its time to maturity
Rule 2 for duration: At constant maturity, bonds duration lower when coupon rate higher
This property corresponds to Malkiels fth relationship and is attributable to the impact of
early coupon payments on the weighted-average maturity of a bonds payments
The higher these coupons, the higher the weights on the early payments and the lower is the
weighted average maturity of the payments
Rule 3 for duration: Holding coupon rate constant, bonds duration generally increases with
time to maturity. Duration always increases with maturity for bonds selling at par/premium
Duration need not always increase with time to maturity
For deep-discount bonds (3% coupon in Fig. 2), duration can fall when maturity increases
For coupon bonds, duration increases < 1-yr with 1-yr increase in maturity (slope 1.0)
Rule 4 for duration: All else equal, duration of coupon bond higher when bonds YTM lower
While a higher yield reduces the PV of all of the bonds payments, it reduces the value of
more distant payments by a greater proportional amount At higher yields, higher fraction
of bonds total value lies in earlier payments, thereby reducing eective maturity
102
15% Coupon
YTM = 15%
Z
e
r
o
-
C
o
u
p
o
n
B
o
n
d
YTM = 15%
3% Coupon
YTM = 6%
15% Coupon
20
30
10
20 15 10 5 Maturity (years)
D
u
r
a
t
i
o
n
(
y
e
a
r
s
)
0
0
Figure 2: Bond duration vs. bond maturity
Rule 5 for duration: The duration of a level perpetuity is:
Duration of perpetuity =
1 +y
y
Maturity and duration can dier substantially (4)
Notice from Fig. 2 that as their maturities become ever longer, the durations of the two coupon
bonds with yields of 15% both converge to the duration of the perpetuity with the same yield
Convexity
Duration rule P/P = D
t=1
_
CF
t
(1 +y)
t
(t
2
+t)
_
Where CF
t
= coupon payment before maturity or nal coupon plus par value at maturity
Convexity allows to improve the duration approximation for bond price changes. Eq. (3) becomes:
P
P
= D
y +
1
2
Convexity (y)
2
(5)
Bond with positive convexity: 2nd term 0, regardless of whether yield rises/falls
Duration rule always underestimates new value of a bond following a change in its yield
For small y, the linear approximation given by duration rule is suciently accurate
Convexity is more important as a practical matter when potential rate changes are large
Why do investors like convexity?
Bonds with greater curvature gain more in price when yields fall than they lose when yields rise
If interest rates are volatile, this is an attractive asymmetry that increases the expected return on
the bond, because bond will benet more from rate decreases/suer less from rate increases
103
Bond B (less convex) - Actual price change
Bond A - Actual price change
Bond A - Duration approximation
D
0 P
e
r
c
e
n
t
a
g
e
c
h
a
n
g
e
i
n
b
o
n
d
p
r
i
c
e
-50
0
50
150
100
-2 -4 2 4
Change in Yield to Maturity (%)
Figure 3: Bond price convexity
Of course, if convexity is desirable, it will not be available for free
Investors pay higher prices and accept lower YTM on bonds with greater convexity
Duration and convexity of callable bonds
When interest rates high, price-yield curve for callable bond is convex (as for straight bond)
But as rates fall, there is a ceiling on price: Bond cannot be worth more than its call price
As rates fall, the bond is subject to price compression
Its value is compressed to the call price
In this region, price-yield curve lies below its tangency line, and has negative convexity
In the region of negative convexity, the price-yield curve exhibits an unattractive asymmetry
Rate increases result in larger price decline than price gain for rate decreases of equal magnitude
Asymmetry arises from the fact that bond issuer has an option to call back the bond
Investors are compensated for this unattractive situation when purchasing the bond: Callable
bonds sell at lower prices (higher initial yields) than comparable straight bonds
The eect of negative convexity is highlighted in Eq. (5)
When convexity negative, 2nd term on RHS is necessarily negative Bond price performance
will be worse than would be predicted by the duration approximation
However, callable bonds/bonds with embedded options dicult to analyze with Macaulays
duration. Because of such options, future CFs from the bonds no longer known
10% 5%
Price
Call
Region of
Positive Convexity
Region of
Negative Convexity
Interest Rate (%)
B
o
n
d
P
r
i
c
e
Figure 4: Price-yield curve for a callable bond
For bonds with embedded options, the convention is to compute the eective duration
Eective duration: % change in bond price per unit change in market interest rates:
Eective duration =
P/P
r
(6)
Eective duration is not computed relative to a change in bonds own YTM because for bonds
with embedded options, the YTM is often not a relevant statistic
104
Eective duration formula relies on pricing methodology that accounts for embedded options
The eective duration will be a function of variables that would not matter to conventional
duration, e.g. the volatility of interest rates
In contrast, modied/Macaulay duration comes directly from promised bond CFs/YTM
Example of eective duration calculation
Suppose that a callable bond with a call price of $1,050 is selling today for $980
If the yield curve shifts up by .5%, the bond price will fall to $930
If it shifts down by .5%, the bond price will rise to $1,010
To compute eective duration, we compute:
r = Assumed increase in rates Assumed decrease in rates = .5%(.5%) = 1%
P = Price at .5% rate increase Price at 5% rate decrease = 930 1,010 = -$80
Then the eective duration of the bond is: (80/980)/0.01 = 8.16 years
The bond price changes by 8.16% for a 1% point swing in rates around current values
Duration and convexity
Biggest market where call provisions are important is market for mortgage-backed securities
Such securities are called pass-throughs because CFs from borrowers are rst passed through to an
agency (Fannie Mae/Freddie Mac) and then passed through to ultimate purchaser of MBS
Example
Suppose that ten 30-yr mortgages, each with principal of $100k, are grouped into $1M pool
If the mortgage rate is 8%, then the monthly payment on each loan would be $733.76
Owner of the MBS receives $7,337.60 = total payment from the 10 pooled mortgages
The right to prepay the loan is precisely analogous to the right to refund a callable bond:
The call price for the mortgage is simply the remaining principal balance on the loan
The MBS is best viewed as a portfolio of callable amortizing loans
Mortgage-backs are subject to the same negative convexity as other callable bonds:
When rates fall and homeowners prepay mortgages, repayment of principal is passed through
Rather than capital gains on investment, investors receive the principal balance on the loan
Dierences between the mortgage-backs and callable corporate bonds:
Mortgage-backs are commonly selling for more than their principal balance because homeown-
ers do not renance as soon as rates drop MBS exhibit negative convexity at low rates, but
implicit call price (principal balance) is not a rm ceiling
Credit enhancement: Common feature of the asset-backed market: The credit risk of the un-
derlying borrower is enhanced by the guarantee of Freddie or Fannie that any mortgage default
will be treated from the point of view of the investor as if the mortgage had been prepaid
Simple mortgage-backs have also given rise to a rich set of mortgage-backed derivatives that can
be used to help investors manage interest rate risk:
E.g., a CMO (collateralized mortgage obligation) further redirects the CF stream of the MBS
to several classes of derivative securities called tranches
Tranches designed to allocate interest rate risk to investors most willing to bear that risk
The following table is an example of a very simple CMO structure
The underlying mortgage pool is divided into three tranches, each with a dierent eective
maturity and therefore interest rate risk exposure
Suppose the original pool is subdivided into three tranches as:
Tranche A $4 million principal Short-pay tranche
Tranche B $3 million principal Intermediate-pay tranche
Tranche C $3 million principal Long-pay tranche
In each period, each tranche receives interest owed based on promised coupon/principal
But initially, all principal payments (prepayments and amortization) go to tranche A, tranches
B and C receive only interest payments until tranche A is retired
Once tranche A is fully paid o, all principal payments go to tranche B
Finally, when B is retired, all principal payments go to C
105
This makes tranche A a short-pay class, with the lowest eective duration, while tranche C
is the longest-pay tranche Simple way to allocate interest rate risk among tranches
In essence, the mortgage pool is treated as a source of CFs that can be reallocated to dierent
investors in accordance with the tastes of dierent investors
Passive bond management
Passive managers take bond prices as fair and only seek to control only risk of their xed-income portfolio
Two broad classes of passive management are pursued in the xed-income market:
1. An indexing strategy that attempts to replicate the performance of a given bond index
2. Immunization techniques shield from exposure to interest rate uctuations
Although indexing and immunization strategies are alike in that they accept market prices as correctly
set, they are very dierent in terms of risk exposure
A bond-index portfolio has same risk-reward prole as bond market index to which it is tied
In contrast, immunization seeks to establish a virtually zero-risk prole (no impact from rate moves)
Bond-index funds
In principle, bond market indexing is similar to stock market indexing: The idea is to create a
portfolio that mirrors the composition of an index that measures the broad market
Indexes of the broad bond market: (i) Lehman Aggregate Bond Index, (ii) Salomon Smith Barney
Broad Investment Grade (BIG) Index, and (iii) Merrill Lynch US Broad Market Index
All are market-value-weighted indexes of total returns
All three include government, corporate, mortgage-backed, and Yankee bonds in their universes
All three indexes include only bonds with maturities greater than 1 year
Problems in the formation of an indexed bond portfolio
Indexes include more than 5,000 securities Dicult to replicate
Many bonds thinly traded Dicult to identify owners/buy securities at fair market price
Bond-index funds also face more dicult rebalancing problems than do stock-index funds
Bonds are continually dropped from the index as their maturities fall below 1 year
As new bonds are issued, they are added to the index Securities used to compute bond
indexes constantly change. As they do, the manager must update/rebalance portfolio
Bonds generate considerable interest income that must be reinvested Further complication
In practice, a stratied sampling/cellular approach is pursued to replicate broad bond indexes:
First, the bond market is stratied into several subclasses
Criteria (maturity, issuer, bonds coupon rate, issuer credit risk) are used to form cells
Bonds falling within each cell are then considered reasonably homogeneous
Next, the percentages of the entire universe (i.e., the bonds included in the index to be matched)
falling within each cell are computed and reported
Finally, the portfolio manager establishes a bond portfolio with representation for each cell
that matches the representation of that cell in the bond universe
Characteristics of portfolio for maturity, coupon rate, credit risk, industrial representation match
index characteristics, and portfolio performance should match the index
Immunization
Many institutions try to insulate their portfolios from interest rate risk altogether:
1. Some institutions, such as banks, are concerned with protecting the current net worth or net
market value of the rm against interest rate uctuations
2. Other investors (pension funds) may face an obligation to make payments after a given number
of years and are more concerned with protecting the future values of their portfolios
Immunization techniques refer to strategies used by such investors to shield their overall nancial
status from exposure to interest rate uctuations
Banks/thrift institutions have natural mismatch between asset/liability maturities
Bank liabilities are primarily deposits from customers, short-term in nature (low duration)
Bank assets by contrast are composed largely of outstanding loans or mortgages. These assets
are of longer duration than deposits, and more sensitive to interest rate uctuations
106
In periods when interest rates increase unexpectedly, banks can suer serious decreases in net
worth, their assets fall in value by more than their liabilities
Similarly, a pension fund may have a mismatch between the interest rate sensitivity of the assets
held in the fund and the PV of its liabilities - the promise to make payments to retirees
In some recent years pension funds lost ground despite excellent investment returns
As interest rates fell, value of their liabilities grew even faster than value of their assets
The idea behind immunization is that duration-matched assets and liabilities let the asset portfolio
meet the rms obligations despite interest rate movements
Two osetting types of interest rate risk: (i) Price risk and (ii) Reinvestment rate risk
Increases in interest rates cause capital losses but at same time, reinvested income grows faster
If the portfolio duration is chosen appropriately these two eects will cancel out exactly
For horizon equal to portfolios duration, price risk/reinvestment risk exactly cancel out
Duration matching balances the dierence between the accumulated value of coupon payments
(reinvestment rate risk) and the sale value of a bond (price risk)
We can also analyze immunization in terms of present as opposed to future values
Even with immunization, bond value at the horizon can be ,= obligation
This happens because the bond and the obligation have dierent convexities
The bond and the obligation are not duration-matched across interest rate shifts
Importance of rebalancing immunized portfolios
As interest rates and asset durations change, a manager must rebalance the portfolio of xed-
income assets continually to realign its duration with the duration of the obligation
Moreover, asset durations will change solely because of the passage of time
Duration generally decreases less rapidly than does maturity
Even if obligation immunized at the outset, as time passes, durations of the asset/liability fall
at dierent rates Without portfolio rebalancing, durations become unmatched
Immunization is a passive strategy (does not involve attempts to identify undervalued securities)
However, immunization managers still actively update and monitor their positions
Constructing an immunized portfolio
An insurance company must make a payment of $19,487 in 7 years. The market interest rate
is 10%, so the PV of the obligation is $10,000. The portfolio manager wishes to fund the
obligation using 3-year zero-coupon bonds and perpetuities paying annual coupons
Duration of liability: Single-payment obligation with duration of 7 years
Duration of assets: The duration of the perpetuity is 1.10/.10 = 11 years. With a weight w
invested in the zero, the asset duration is: w 3 years + (1 w) 11 years
Asset mix such that assets and liabilities have same duration: w = 1/2
Fully fund the obligation: Purchase $5,000 of zero-coupon bond and $5,000 of perpetuity
Rebalancing
Suppose that 1 year has passed, and the interest rate remains at 10%. The portfolio manager
needs to reexamine his position: Is the position still fully funded? Is it still immunized?
First, examine funding:
PV of obligation = $11,000 (1 year closer to maturity)
Managers funds = $11,000: Zero-coupon bonds have increased from $5,000 to $5,500 with
the passage of time, and perpetuity paid its annual $500 coupons and remains worth $5,000
The obligation is still fully funded
Portfolio weights must be changed, however: Zero-coupon bond now has duration of 2 years,
while perpetuity duration remains at 11 years. Obligation now due in 6 years
The weights must now satisfy: w 2 + (1 w) 11 = 6 which implies that w = 5/9
To rebalance the portfolio and maintain the duration match, the manager now must invest a
total of $11, 000 5/9 = $6, 111 in the zero-coupon bond
Requires that entire $500 coupon payment be invested in the zero, with an additional
$111.11 of the perpetuity sold and invested in the zero-coupon bond
107
Rebalancing portfolio entails transaction costs, so one cannot rebalance continuously
In practice, compromise between desire for perfect immunization (continual rebalancing) and
need to control trading costs (less frequent rebalancing)
Cash ow matching and dedication
If we follow the principle of CF matching we automatically immunize portfolio from interest rate
movement because bond CF and obligation CF exactly oset each other
Dedication strategy: Cash ow matching on a multiperiod basis
In this case, the manager selects either zero-coupon or coupon bonds that provide total cash
ows in each period that match a series of obligations
Advantage of dedication: Once-and-for-all approach to eliminating interest rate risk
Once CFs are matched, no need for rebalancing: Dedicated portfolio provides the cash neces-
sary to pay the rms liabilities regardless of the eventual path of interest rates
CF matching is not more widely pursued because of constraints it imposes on bond selection:
Immunization/dedication are appealing to rms that do not wish to bet on general movements
in interest rates, but they may want to use bonds perceived as undervalued
Cash ow matching, however, places so many more constraints on the bond selection process
that it can be impossible to pursue a dedication strategy using only underpriced bonds
Firms looking for underpriced bonds give up exact and easy dedication for the possibility of
achieving superior returns from the bond portfolio
Sometimes, cash ow matching is simply not possible:
Pension fund obligated to pay out perpetual ow of income to current/future retirees would
need to purchase xed-income securities with maturities ranging up to hundreds of years
Such securities do not exist, making exact dedication infeasible
Other problems with conventional immunization
Duration in Eq. (1) strictly valid only for at yield curve (all payments discounted at same rate)
If yield curve not at, then denition of duration must be modied and [CF
t
/(1+y)
t
] replaced with
[PV of CF
t
] calculated by discounting with the appropriate spot interest rate from the zero-coupon
yield curve corresponding to the date of the particular CF
Even with this modication, duration matching immunizes only for parallel shifts in yield curve
Multifactor duration models have been developed to allow for tilts and other distortions in the
shape of the yield curve, in addition to shifts in its level
Added complexity does not pay o in terms of substantially greater eectiveness
Finally, immunization can be an inappropriate goal in an inationary environment:
Immunization is essentially a nominal notion and makes sense only for nominal liabilities
No sense to immunize an obligation that will grow with price level using nominal assets (bonds)
Active bond management
Sources of potential prot
Two sources of potential value in active bond management:
1. Interest rate forecasting
Tries to anticipate movements across the entire spectrum of the xed-income market
If interest rate declines anticipated, managers increase portfolio duration (and vice versa)
2. Identication of relative mispricing within the xed-income market
An analyst might believe that the default premium on one particular bond is unnecessarily
large and therefore that the bond is underpriced
These techniques generate abnormal returns only if insight superior to the market
Interest rate forecasters have a notoriously poor track record
Portfolio rebalancing activities can be characterized as one of ve types of bond swaps
In 1st two swaps, investor believes that yield relationship between bonds/sectors is temporarily out
of alignment When aberration eliminated, gains realized on underpriced bond
The period of realignment is called the workout period
108
1. The substitution swap
Exchange of one bond for a nearly identical substitute
Substituted bonds: Essentially equal coupon, maturity, quality, call features, SF provisions
Swap motivated by a belief that the market has temporarily mispriced the two bonds, and that
the discrepancy between the prices of the bonds represents a prot opportunity
2. The intermarket spread swap
The yield spread between two sectors of the bond market seems temporarily out of line
E.g., if current spread between corporate/government bonds is considered too wide and is
expected to narrow, investor will shift from government bonds into corporate bonds
Consider carefully whether there is a good reason that yield spread seems out of alignment
3. The rate anticipation swap
Pegged to interest rate forecasting
If investors believe that rates will fall, they will swap into bonds of longer duration
Conversely, when rates are expected to rise, they will swap into shorter duration bonds
4. The pure yield pickup swap
Pursued not in response to mispricing, but to increase return by holding higher-yield bonds
When yield curve is upward-sloping, yield pickup swap entails moving into longer-term bonds
5. Tax swap
A swap to exploit some tax advantage
E.g., an investor may swap from one bond that has decreased in price to another if realization
of capital losses is advantageous for tax purposes
Horizon analysis
Horizon analysis is one form of interest rate forecasting
Select a particular holding period and predict the yield curve at the end of that period
Given a bonds time to maturity at the end of the holding period, its yield can be read from the
predicted yield curve and its end-of-period price calculated
Then, add coupon income/prospective capital gain to obtain the total return over holding period
Example of horizon analysis
A 20-yr maturity bond with a 10% coupon rate (annual) sells at a YTM of 9%. An investor
with a 2-yr horizon needs to forecast the total return on the bond over the coming 2 years
In 2 years, the bond will have an 18-yr maturity. The analyst forecasts that 2 years from
now, 18-year bonds will sell at YTM of 8%, and that coupon payments can be reinvested in
short-term securities over the coming 2 years at a rate of 7%
Calculation of the 2-year return on the bond:
1. Current price = $100 Annuity(9%, 20 years) + $1,000 PV(9%, 20 years) = $1091.29
2. Forecast price = $100 Annuity(8%, 18 years) + $1, 000 PV(8%,18 years) = $1,187.44
3. The future value of reinvested coupons will be ($100 1.07) + $100 = $207
4. The 2 -year return is [$207 + ($1, 187.44 $1, 091.29)]/1, 091.29 = 27.8%
Contingent immunization
Contingent immunization: Mixed passive/active strategy suggested by Liebowitz/Weinberger
E.g., the manager wishes to pursue active management but is willing to risk losses only to the
extent that the terminal value of the portfolio would not drop lower than $11 million
Manager can risk some losses at outset and start o with active strategy vs. immunizing
Key: Calculate funds needed to lock in FV of $11M at current rates, via immunization
If T = time left until horizon date, and r = market interest rate at any particular time, then
the value of the fund necessary to guarantee the min. acceptable terminal value is $11M/(1 +
r)
T
: This portfolio size, if immunized, will grow risk-free to $11M by the horizon date
This value becomes the trigger point
If and when the actual portfolio value dips to the trigger point, active management will cease
Contingent upon reaching the trigger, an immunization strategy is initiated instead, guarantee-
ing that the minimal acceptable performance can be realized
109
110
Feldblum: Asset liability matching for P&C insurers
The asset liability matching problem
If a life insurers writings are large and stable, and its investment returns are steady, then its prots will
depend primarily on internal pricing and external competition
Complications arise when interest rates change:
If rates decrease, insurers II may be insucient to satisfy policy obligations
If rates increase, insureds may take policy loans or lapse their policies to obtain higher returns
elsewhere Insurer may be forced to sell bonds at capital losses to meet cash needs
Life insurers have responded to these uncertainties in three ways:
1. Write term vs. permanent policies Short duration of term policies reduces interest rate risk
2. Shift investment reward/risk to p/h through variable/universal insurance policies/annuities
3. Match durations/CFs of liabilities/assets to mitigate eect of interest rate uctuations
Characteristics of P&C insurers
First-party coverages:
Insurers pay claims soon after the accident date
Such P&C policies are similar to term life insurance
Long-tailed commercial liability lines
GL, PL, Medmal, CAL, CMP
Average loss paid about four years after accident date
Moreover, investment risk on assets supporting loss reserves can not be shifted to p/h
Insurers seek ways to match their investment and insurance portfolios
Crucial dierences between the characteristics of P&C and life insurance operations:
1. Nominal vs. ination sensitive
Traditional life insurance and pension fund liabilities are expressed in nominal terms
P&C loss determined at settlement Ination from accident/settlement dates impacts liability
In other words, P&C liabilities are ination sensitive
2. ALM involves holding asset portfolio whose duration = duration of liabilities
P&C: Short duration assets whose returns vary directly with ination (T-bills, CP)
But with normal, upward-sloping yield curve, these assets have lower returns than long duration
assets (corporate bonds) Balance benets of immunization with overall portfolio yield
3. Many actuaries/nancial analysts ascribe long durations to common stocks
True if one examines only CFs resulting from current and expected dividends, but not price
changes or expected dividend changes due to changes in interest rates
However, common stock prices are ination sensitive, just as insurance liabilities between the
acquisition/occurrence and the disposal/payment dates
Asset/liability matching risk stems from factors other than interest rate changes
For common stocks: Systematic stock market uctuations
For insurance liabilities: Contagion risks and changes in legal interpretation of coverage
4. P&C insurers do not face the same disintermediation problems that life insurers face
Regardless of interest rate changes, P&C insurers expect a steady stream of premium inow
Moreover, P&C insurers do not segment funds. Investment returns must be sucient for the
company as a whole, not for any given block of policies
5. Measurements of asset risk often concentrate on nominal returns
Long-term bonds with amortized book values show high/steady nominal returns
But except for SAP, P&C depends on real returns. By this measure, long term bonds are risky
P&C insurers have a dicult balancing act
Loss reserves vary directly with interest rates
Immunization theory for xed income assets recommends holding short-term bonds
But these securities have lower yields than long-term bonds
Moreover, interest rates have a minor inuence on insurance CFs, since premium income does not
vary greatly with investment returns for most LOBs
111
As a result, insurers invest in long-term bonds, which are risky when marked-to-market
Yet insurers do not want to add investment risks to uctuations of insurance u/w cycle
A common solution is to diversify into equity investments, such as stocks and real estate
Real estate holdings are limited by state statutes, illiquid, and require investment expertise
Common stocks reported at market values on AS Book values uctuate more than bonds
Long-term bonds therefore are the investment of choice for P&C industry
Were bonds reported on AS at their market values, instead of amortized values, their actual riskiness
would be apparent, and insurers would invest more heavily in common stocks
Accounting rules inuence security selection as much as operating income does
Federal income tax laws also inuence nancial portfolios
Tax rate on long term capital gains was lower than rate on net investment income
Current law taxes both equally
Growth stocks have lost their tax advantage over income stocks/corporate bonds
The reduced tax exemptions for municipal bond/corporate dividend income reduces the tax
advantages of these securities over corporate bonds, federal bonds, and growth stocks
The strengthening of the Alternative Minimum Tax rules reduces the tax advantages of mu-
nicipal bonds and income stocks over corporate bonds, federal bonds, and growth stocks
Nominal versus ination sensitive liabilities
Asset liability matching theory is grounded on two characteristics of conventional life insurance policies:
(i) Nominally valued liabilities and (ii) Disintermediation
1. Traditional life liabilities stated in nominal terms, but funded (partially) by investment returns
If the assumed interest rate used for pricing the policy is conservative enough, i.e. if it is
suciently below actual investment returns, interest rate changes pose little risk
But investment returns have become more volatile, and competitive pressures have forced
insurers to be less conservative in their interest rate assumptions
Interest rate changes have a large eect on life insurance protability
2. Life insurers faced strong disintermediation risks in the 1970s
No disintermediation in P&C because policy terms are short/reserves do not accumulate
When a change in new money interest rates aects the market values of liabilities and assets dierently,
the liabilities and assets are mismatched
The greater the mismatch, the greater the interest rate risk
This is a speculative risk, not a pure risk: Insurer can gain/lose from interest rate changes
Aggressive insurer, condent in forecasting rate changes, may seek asset/liability mismatch
A conservative insurer would attempt to match assets and liabilities more closely
Matching techniques
Two common methods of ALM: (i) Cash ow matching and, (ii) Duration matching
Cash ow matching
Creates an asset/liability portfolio impervious to interest rate changes
The insurer forecasts net insurance cash ows from its book of business and buys xed income
securities whose coupons and maturities provide the needed monies at the needed times
Exact cash ow matching can be cumbersome, inecient, and costly
One bond may closely match liability CFs, but alternative bond may provide better yield
Exact cash ow matching is worthwhile only when the benets of risk reduction outweigh the
costs of lower yields and administrative expenses (rare)
Duration matching
Hedges against small interest rate changes
A change in new money interest rates has two eects on bond prices:
1. Coupons reinvested at new money interest rate (higher returns when rates rise)
2. The bonds price declines when interest rates rise and rises when interest rates fall
The relative importance of changes in reinvestment returns and prices depends on the bonds
term, its coupons, and the date of the liability payment
112
At a bonds expiration date, only the par value is received
Interim price changes have no eect on the proceeds
But reinvestment rate changes do inuence the nal wealth
Macaulay duration
Macaulay duration of the bond: The point at which a small change in interest rates just
balance the change in bonds market value
Formally, the Macaulay duration of a bond, or of any group of assets or liabilities, is the
weighted average of the cash ow dates, where the weights are the PV of each CF
Consider a 10-year 10% annual coupon bond issued on 1/1/88. CFs consist of $100 coupon pay-
ments each January 1, and the $1,000 principal repayment on 1/1/97
At a 10% current yield, the bonds duration is:
1 90.9 + 2 82.6 + + 10 424.1
90.9 + 82.6 + + 424.1
= 6.76 yrs
At a 5% yield, the duration is:
1 95.2 + 2 90.7 + + 10 675.3
95.2 + 90.7 + + 675.3
= 7.29 yrs
Duration of two bonds = wtd avg. of individual durations, weights = current market prices
More precisely, the weights are the sums of the discounted values of each bonds cash ows.
Barring marketplace imperfections, these are the market prices of the bonds
Match assets/liabilities by equating their (overall) durations
First determine the duration of the liabilities, either by examining total annual CFs or by
computing the weighted average of the durations of all policies
Then purchase FI securities whose overall duration matches duration of liabilities
Duration matching eliminates asset/liability mismatch risk only for small interest rate changes
As the rates change, the bond durations change too Even when asset/liability portfolios
have the same duration at one interest rate, they may have dierent durations at another
Fortunately, interest rates change slowly, giving insurer time to rebalance asset portfolio
Duration matching for P&C insurers
First determine loss payout patterns by line of business
Data are available either from internal company reports or from statutory AS
Loss reserve payout patterns for the Schedule P LOBs can be determined for 10 years
Loss payouts after 10 years may be estimated by an exponential decay model
Accident Development Year
Year 1 2 3 4 5 6 7 8 9 10 11+
1983 16.1% 15.3% 14.6% 11.9% 9.2% 7.0% 5.3% 4.2% 3.3% 2.6% 10.5%
Loss reserve durations then determined by discounting nominal loss at appropriate investment rate
The timing of loss payments within each CY, as well as the pattern of payments after the tenth
CY, do not have a major eect on the duration
For simplicity, assume that all loss payments are made at mid-year, and that loss reserves still
held after 10 years are paid out evenly over the subsequent ve years
The General Liability 1983 loss reserve duration is therefore:
(16.1)(0.5)(1.12)
0.5
+ (15.3)(1.5)(1.12)
1.5
+ + (2.1)(14.5)(1.12)
14.5
(16.1)(1.12)
0.5
+ (15.3)(1.12)
1.5
+ + (2.1)(1.12)
14.5
= 3.2 yrs
Interest rate
Risk free rate ne for discounting loss reserves Separate insurance/investment returns
For duration matching, must use same interest rate for liabilities and new investable assets
Payout pattern
For liability durations, Woll uses the payout pattern for loss reserves, not for incurred losses
113
The two types of loss payment patterns are quite dierent
Many losses paid during year of occurrence Not shown as reserve liabilities on year-end
statement. Losses outstanding as of 12/31 are generally slower settling ones
The loss reserve payout pattern is usually slower than the incurred loss payout pattern
The proper payout pattern depends on the type of matching:
If match assets held/liability obligations as of 12/31, use loss reserve payout pattern
If match assets purchased/liabilities incurred, use loss incurred payout pattern
The apparent conclusion is that to duration match a GL liability portfolio, you should invest in
medium term bonds with an average duration of 3.2 years. This conclusion is misleading
Asset/liability matching mitigates the risk of interest rate changes
But procedure outlined does not model true eects of interest rate changes on loss payments
Ination sensitive cash ows
ALM either: (i) Balances insurance and investment nominal cash ows or (ii) Balances insurance
cash ows with changes in investment cash ows plus capital gains/losses
If losses sensitive to ination through settlement date, then reserve asset with 0 duration
To eliminate inuence of interest rate changes on net worth, invest either in short term securities
(T-Bills, CP) or in securities ination sensitive (common stocks, real estate)
In practice, most reserves are not fully ination sensitive through the settlement date:
WC indemnity payments are largely xed at the accident date
BI wage loss/medical bills determined soon after accident, often 1-2yr before settlement
Nevertheless, general damages which form the bulk of insurance payments in GL, PL, Med-
mal, CMP liability coverages, and BI are ination sensitive through settlement date
Ination is increasingly important for insurance liability losses through the settlement date
When the losses are not easily quantied, such as pain and suering awards, juries are
inuenced by the value of money at the settlement date, not at the accident date
Medical bills depend on time of treatment, which falls between accident/settlement dates
Even in disability cases, the plaintis attorneys usually incorporate the eects of ination and
expected earnings changes in the demand for damages
Property reserves, on the other hand, are not ination sensitive
Since they are paid out quickly, however, they have equally short duration
Short-term commercial paper has a duration similar to that of GL loss reserves:
If interest rates increase, the ultimate expected loss payment increases
Reinvestment returns from the commercial papers maturity payments increase similarly
Since the assets are short term, there is little change in market price
Assets (commercial paper) and liabilities (GL losses) change in the same direction
If LC trends and commercial paper yields change by approximately the same amount, the
magnitudes of the asset and liability changes are also equal
This is by no means a recommendation for investments in commercial paper
Short-term CP provides lower yields/incurs higher transaction costs than other FI securities
Equity durations
Introduction
Securities whose annual payments and maturity values vary directly with ination rates and
interest rates, such as common stocks, are like casualty insurance liabilities
Macaulay durations are misleading for asset/liability matching
Asset and liability durations help quantify eects of interest rate changes on market values:
In general, [proportional change in market value] [duration] [change in interest rate]
Long-term bonds more sensitive to interest rate changes than CP and T-bills
Example
Consider two zero coupon bonds, with ve and ten year terms, respectively
Each has a par value of $1,000, and each pays 10% per annum
The issue prices, therefore, are 1, 000 (1/1.10)
5
= $621 and 1, 000 (1/1.10)
10
= $386
114
A zero coupon bonds duration equals its term The two durations are 5 and 10 years
Change in Price = 1 Duration Price Change in Interest Rate
Accordingly, % change in market price for the 10-yr bond is double that for the 5-yr bond
Common stocks
Traditional measurement of common stock duration uses the DDM of equity valuation
This model views a common stock as a perpetual bond that pays dividends for an innite term
If dividends grow at G% per annum, and values are discounted at K%, PV[stocks dividends] is:
(Current dividend)
(1 +G)
(K G)
The duration of a xed income security equals the negative of the derivative of the natural log of
its PV with respect to the discount rate For common stocks: