Ulcer Index
An Alternative Approach to the Measurement of
Investment
Risk & Risk-Adjusted Performance
by Peter G. Martin
This page also available in PDF format
What
is the Ulcer Index?
Ulcer Index (UI) is a method for measuring investment risk
that addresses the real concerns of investors, unlike the widely used standard
deviation of return (SD). UI is a measure of the depth and duration of
drawdowns in prices from earlier highs.
Using UI instead of SD can lead to very different conclusions
about investment risk and risk-adjusted return, especially when evaluating
strategies that seek to avoid major declines in portfolio value (market
timing, dynamic asset allocation, hedge funds, etc.).
The Ulcer Index was originally developed by the author
of this page in 1987. Since then, it has been widely recognized and adopted by the
investment community. According to Nelson Freeburg, editor of Formula
Research, Ulcer Index is “perhaps the most fully realized statistical
portrait of risk there is.”
The Index was first described in The Investor's
Guide to Fidelity Funds: Winning Strategies for Mutual Fund Investors,
by myself and Byron McCann. Originally
published by John Wiley & Sons in 1989, this out-of-print book is available
in PDF format from http://www.tangotools.com/ui/fkbook.pdf.
Ulcer Index is also explained on Wikipedia at http://en.wikipedia.org/wiki/Ulcer_Index
Note: There have been instances of the term "Ulcer Index" being
used for risk measures that do not strictly follow the details described
here. This document explains the correct use of the concept.
What's
Wrong with Standard Deviation of Return?
Standard deviation is a statistical measure of the variability
or unpredictability of an investment's return. As a measure of risk, it suffers from a number
of serious drawbacks:
-
Both upward and downward changes in value add to the calculated
SD. Real investors associate risk only with the downside. Rising prices
create profits, not risk.
-
The calculated value of SD is not affected by the sequences
in which gains and losses occur. Thus, SD does not recognize the strings
of losses that result in significant drawdowns in value. The three hypothetical
investments in the chart below have the same annualized return (-0.52%/year) and the
same SD (4.66%/month), but no rational investor would consider them as having the same
risk.

This chart shows the NAV of an S&P 500 Index
mutual fund with dividends reinvested, using monthly data over the 10 years
2000-2009. The Actual (dark blue) line is the fund's actual performance. The
Sorted (magenta) line was obtained by re-arranging the monthly percentage returns, so
that the worst months occur first and the best ones occur last. The Flattened
(yellow) line was obtained by shuffling the monthly returns in such a way as to
minimize the depth and duration of drawdowns. The set of monthly returns is the
same in each case; only their order has been changed. See http://www.tangotools.com/ui/SD.xls.
-
When SD is used to measure the risk of a market timing strategy,
it will tell you roughly how often you were out of the market, but nothing
about whether you were out at the right times. SD doesn't tell you if your
strategy reduced risk by avoiding market downturns.
Because of these weaknesses, SD does not reward an investment
strategy for avoiding market downturns. Using Ulcer Index as a risk measure
avoids all of these problems.
What
About Other Risk Measures?
Other established risk measures have weaknesses too. For
example:
-
Some are based on the single worst event over a time period,
which by definition has no statistical significance (e.g. Worst Trade and
Maximum Drawdown).
-
Some are based on absolute rather than percentage price changes,
which distorts results over longer periods with strong price trends (e.g. Average
Maximum Retracement).
-
Some tell you how often you lost money, but
not how much (e.g.
Percentage Losing Quarters).
-
Some cannot be used to compare investment alternatives.
For example, Percentage Losing Trades cannot be used to compare a market timing strategy
with a buy-and-hold approach, because the latter has no
trades.
How is Ulcer Index
Calculated?
Ulcer Index measures the depth and duration of
percentage drawdowns
in price from earlier highs. The greater a drawdown in
value, and the longer it takes to recover to earlier highs, the higher
the UI. Technically, it is the square root of the
mean of the squared percentage drawdowns in value. The squaring effect penalizes large drawdowns proportionately more
than small drawdowns (the SD calculation also uses squaring).
In effect, UI measures the "severity"
of drawdowns, as represented by the dark regions in the charts below:
Drawdowns in value: S&P 500 index with dividends reinvested
Drawdowns in value: Fidelity Select Precious Metals & Minerals
fund
The algorithm for computing UI is simple, and can be seen
in the pseudo-code fragment below:
SumSq = 0
MaxValue = 0
for T = 1 to NumOfPeriods do
if Value[T] > MaxValue then MaxValue
= Value[T]
else SumSq = SumSq + sqr(100 * ((Value[T]
/ MaxValue) - 1))
UI = sqrt(SumSq / NumOfPeriods)
An Excel spreadsheet showing how to calculate the
Ulcer Index is available at http://www.tangotools.com/ui/UlcerIndex.xls
UI has a further advantage over SD. Its
calculated value is essentially the
same over a wide range of time intervals between data points. Weekly price data
is a robust choice, but daily data can be used as well. As the interval
is extended beyond a week, there is an increasing danger of missing significant
intra-period drawdown-and-recovery events. The use of quarterly or longer
intervals is strongly discouraged for this reason.
Unlike UI, the calculated value of SD depends
directly on the time
period used. For example, the SD of annual return is roughly 7.2 times the SD of
weekly return (7.2 is the square root of 52 weeks per year). Since the time
period is often unstated, this creates opportunities for serious
misunderstandings about an investment's risk.
Measuring
Investment Performance
A popular method for measuring investment "performance" is
to divide the excess return of an investment by its risk. (Excess return
is total return minus the return offered by risk-free investments such as money
market funds.) This
calculation provides a single number that accounts for both return and
risk. It reports the additional return achieved per unit of risk assumed.
Traditionally the Sharpe Ratio is used, where risk is again represented
by the standard deviation of return:
Sharpe Ratio = (Total return - Risk-free
return) / SD
Just as SD is a poor risk measure, so is this formula a poor
performance measure. This problem is solved by simply replacing SD with
UI. This new performance measure is known as the "Ulcer Performance Index"
(UPI) or
"Martin Ratio".
UPI = (Total return - Risk-free
return) / UI
When plotting investments on a risk vs return chart, UI
can be used instead of SD for the horizontal (risk) axis.
If a line is drawn between the points representing the
risk-free return and a risky investment, the slope of the line is equal
to the Ulcer Performance Index. As with the Sharpe Ratio, if an investment lies above the line joining
the risk-free return with the S&P 500, the investment is "beating the
market" on a risk-adjusted basis.
Market
Timing Example
The table below shows the results achieved with both UI and
SD. We compared two strategies over the period 1940-1997: buy-and-hold
the S&P 500 index, and timing the index with a simple momentum indicator. Results
include reinvestment of dividends in both cases.
|
Buy-and-Hold |
Timing System |
% Change |
| Annualized Total Return (%/yr) |
12.59 |
14.79 |
17.5 |
| Ulcer Index (%) |
8.85 |
5.14 |
-41.9 |
| Ulcer Performance Index (UPI) |
0.92 |
2.01 |
118.9 |
| Standard Deviation (%/yr) |
16.10 |
13.18 |
-18.1 |
| SD Performance (Sharpe Ratio) |
0.51 |
0.78 |
52.9 |
For the timing system, annualized total return is increased
by a modest 2.2 percentage points over buy-and-hold. SD reports 18% lower risk and 53% higher performance
(Sharpe Ratio). UI reports 42% lower risk and 119% higher performance (UPI). Thus, UI places a much higher value on the market timing system than
does SD.
Other experimental work has shown that many popular market
timing systems have little value when SD is used to measure risk, but significant
value when UI is used instead. This arises because SD fails to
recognize the success of timing systems in avoiding major market downturns.
Caveat
Emptor
With any method for computing investment risk and performance, it is
important to use data covering as long a time period as possible. In particular,
the measurement period must include both bull and bear markets for the investments
of interest. Needless to say, when comparing multiple investments, the
same time period must be used in each case.
All risk and return calculations should include reinvestment of interest, dividends
and other distributions; and should be net of all recurring fees, transaction
costs and trading slippage.
Any use of Ulcer Index or UPI that falls
outside of these constraints would be inappropriate and of little value.
This page
also available in PDF format
(C) Copyright
1987-2011 by Peter G. Martin.
All rights reserved.
|