Winton: Value, Size And Momentum On Equity Indices – A Likely Example Of Selection BiasVW Staff
Value, Size And Momentum On Equity Indices – A Likely Example Of Selection Bias by Winton Global Capital Management
by Allan Evans, PhD, Senior Researcher and Carsten Schmitz, PhD, Head of Research (Zurich)
Value, size and momentum have a long history as stock price predictors, and similar indicators have been applied to stock indices in order to predict the performance of one national index against another. Published back tests of trading systems based on these ideas have shown impressive performance, but in this paper we find that this performance does not continue past the publication dates.
We argue that selection bias at the time of publication has a part to play in the disappointing out?of?sample performance of these indicators. We show how the combination of estimation uncertainty and selective reporting can readily explain the observed deterioration in performance. Importantly, with a fuller understanding of these effects, the long?term poor performance of the indicators could have been anticipated at the time.
Value, Size And Momentum On Equity Indices – A Likely Example Of Selection Bias – Introduction
Efforts to find predictors for stock returns have a long history. Quantitative work on momentum goes back at least to the 1960s, with the observation that, over timescales of a few months, stocks that performed well in the past tend to also perform well in the future . Later work showed some evidence for a negative effect (mean reversion) on a longer timescale [2, 3]. Valuation ratios also have a long history. The controversy over the ‘Value Line’ system in the 1960s [4, 5] is a well?known example. Fama and French recently reviewed the evidence for value, size and momentum factors across the world’s stock markets .
Starting in the 1990s, a series of papers were published suggesting that similar effects could be seen in national stock indices [2, 3, 7, 8]. In analogy to cross?sectional equity systems , countries were ranked on size, value or momentum indicators to form portfolios with long positions in the indices for the highest?ranked countries and short positions for the lowest?ranked countries. The published work showed excellent historical performance for such trading systems.
This paper reviews the performance of these cross?country equity portfolios. Nearly twenty years have passed, so we have the advantage of a large set of historical data that was not available to the authors of the listed papers. In Section 1 we replicate the published evaluations in the period before 1995, but in Section 2 we show that the performance of the trading systems on more recent data is disappointing. We believe this is because of selection bias in the published results, and in Sections 3 and 4 we show how this may have led to over?optimistic assessments of the systems. More importantly we demonstrate how this could have been avoided at the time.
1. Replicating the published work
The papers [2, 3, 8] use national stock market indices provided by MSCI  across 18 developed countries in the period 1970?1995. Table 1 lists the four indicators calculated from this data set that were used in these papers.
The momentum indicator (MOM) is the total return of an index over the last year, ignoring the return from the last month. The value indicator (V) is the book?to?market ratio of the index and size (S) is the inverse free?float market capitalization. The book value and market capitalization are each calculated for the index as a whole by summing the values for the component stocks. These three indicators are the same as those used in . The mean?reversion indicator (MR) is defined in a number of different ways in  and . We choose the fractional return between 3 years and 1 year ago so that the MR indicator is nearly independent of the MOM indicator.
Each indicator defines a distinct trading system which will be associated with a series of portfolios through time. The indicator is calculated for each country at the end of a calendar month. The countries are then ranked by the values of their indicator. Out of the 18 countries, we take a positive position in the top six countries and a negative position in the bottom six. All positions are equal in their allocation, so the portfolio is dollar neutral. The positions are held for one month.
Table 1 shows Sharpe ratios from the published sources and from our own tests over the period 1970?1995. We believe that the remaining differences between the published values and our own are due to small differences in the definition of the trading systems (for example, the way in which ties in the ranks are resolved).
All four systems have worse performance in the out?of-sample period than in the original test period. Two of the four systems lose money after 1995, and even the momentum system shows almost no profit after 2000.
See full PDF below.