這是用戶在 2024-10-25 16:11 為 https://onlinelibrary.wiley.com/doi/full/10.1111/jofi.13099 保存的雙語快照頁面,由 沉浸式翻譯 提供雙語支持。了解如何保存?
ARTICLE 文章
Full Access 完全存取權限

Anomalies and the Expected Market Return
異常現象和預期市場回報

XI DONGYAN LIDAVID E. RAPACHGUOFU ZHOU

Corresponding Author

GUOFU ZHOU

Correspondence: Guofu Zhou, Washington University in St. Louis; e-mail: zhou@wustl.edu.

Search for more papers by this author
First published: 06 December 2021
Citations: 37

首次發布: 2021 年 12 月 6 日
引用次數: 37
OpenURL Soochow University (Taiwan)

Xi Dong is at the City University of New York Baruch College. Yan Li is at the Southwestern University of Finance and Economics. David E. Rapach is at Saint Louis University. Guofu Zhou is at Washington University in St. Louis. Rapach was a visiting professor at Washington University in St. Louis when this project was undertaken. We thank Daniel Andrei, Svetlana Bryzgalova, John Cochrane, Andrew Detzel, Victor DeMiguel, Michael Hasler, Dashan Huang, Lawrence Jin, Sicong (Allen) Li, Lin Peng, Joel Peress, Kelly Shue, Jack Strauss, Yajun Wang, Xuemin (Sterling) Yan, and Lu Zhang; as well as conference and seminar participants at the 2020 FinTech Conference in China, Chinese University of Hong Kong-Shenzhen, City University of New York Baruch College, London Business School, Syracuse University, University of Bath, University of North Carolina at Charlotte, University of Liverpool, and Washington University in St. Louis, for helpful comments. We are especially grateful to the editor (Stefan Nagel), associate editor, and two referees for insightful and detailed comments that substantially improved the paper. We have read The Journal of Finance disclosure policy and have no conflicts of interest to disclose.
董奚就讀於紐約市立大學巴魯克學院。李岩就讀於西南財經大學。 David E. Rapach 在聖路易大學。週國富就讀聖路易華盛頓大學。當這個計畫進行時,拉帕奇是聖路易斯華盛頓大學的客座教授。我們感謝Daniel Andrei、Svetlana Bryzgalova、John Cochrane、Andrew Detzel、Victor DeMiguel、Michael Hasler、Dashan Huang、Lawrence Jin、Sicong (Allen) Li、Lin Peng、Joel Peress、Kelly Shue、Jack Strauss、Yajun Wang、Xuemin (Sterling ) ) 嚴和張璐;以及2020年中國金融科技大會的會議和研討會參與者、香港中文大學(深圳)、紐約市立大學巴魯克學院、倫敦商學院、雪城大學、巴斯大學、北卡羅萊納大學夏洛特分校、大學利物浦大學和聖路易斯華盛頓大學的有用評論。我們特別感謝編輯(Stefan Nagel)、副主編和兩位審稿人富有洞察力和詳細的評論,這些評論極大地改進了論文。我們已閱讀《金融雜誌》的揭露政策,沒有需要揭露的利益衝突。

ABSTRACT 抽象的

We provide the first systematic evidence on the link between long-short anomaly portfolio returns—a cornerstone of the cross-sectional literature—and the time-series predictability of the aggregate market excess return. Using 100 representative anomalies from the literature, we employ a variety of shrinkage techniques (including machine learning, forecast combination, and dimension reduction) to efficiently extract predictive signals in a high-dimensional setting. We find that long-short anomaly portfolio returns evince statistically and economically significant out-of-sample predictive ability for the market excess return. The predictive ability of anomaly portfolio returns appears to stem from asymmetric limits of arbitrage and overpricing correction persistence.
我們提供了關於多空異常投資組合回報(橫斷面文獻的基石)與市場總超額回報的時間序列可預測性之間聯繫的第一個系統證據。利用文獻中的 100 個代表性異常,我們採用各種收縮技術(包括機器學習、預測組合和降維)來有效提取高維度環境中的預測訊號。我們發現,多空異常投資組合報酬率表現出對市場超額報酬的統計和經濟顯著的樣本外預測能力。異常投資組合報酬的預測能力似乎源自於套利的不對稱限制和定價過高修正的持續性。

Stock return predictability is a fundamental topic in finance. Two major—and voluminous—lines of research focus on this subject. The first examines whether firm characteristics can predict the cross-sectional dispersion in stock returns. These studies identify numerous equity market anomalies (e.g., Fama and French (2015), Harvey, Liu, and Zhu (2016), McLean and Pontiff (2016), Hou, Xue, and Zhang (2020)). The second line of research investigates the time-series predictability of the aggregate market excess return based on a variety of economic and financial variables, such as valuation ratios, interest rates, and inflation (e.g., Nelson (1976), Campbell (1987), Fama and French (1988, 1989), Pástor and Stambaugh (2009)). Studies in this vein attempt to shed light on the variables that affect the equity risk premium.1
股票回報的可預測性是金融領域的一個基本議題。有兩個主要且大量的研究領域集中在這個主題上。第一項研究檢視公司特徵是否可以預測股票報酬的橫斷面離散度。這些研究發現了許多股票市場異常現象(例如,Fama 和 French ( 2015 )、Harvey、Liu 和 Zhu ( 2016 )、McLean 和 Pontiff ( 2016 )、Hou、Xue 和 Zhu ( 2020 ))。第二條研究線是基於各種經濟和金融變量,例如估值比率、利率和通貨膨脹,調查總市場超額收益的時間序列可預測性(例如,Nelson( 1976 ),Campbell( 1987 ), Fama 和French ( 1988 , 1989 ), Pástor 和 Stambaugh ( 2009 ))。這方面的研究試圖揭示影響股票風險溢酬的變數。 1

In this paper, we investigate whether these two leading lines of the finance literature are linked. Specifically, we analyze the ability of long-short anomaly portfolio returns from the cross-sectional literature to predict the market excess return. Our investigation has a number of key features. First, we focus on a cornerstone of the cross-sectional literature, namely, long-short anomaly portfolio returns, as the literature often uses such returns as evidence of cross-sectional mispricing. Second, we employ out-of-sample tests, since such tests provide the most rigorous and relevant evidence on stock return predictability (e.g., Goyal and Welch (2008), Martin and Nagel (2021)). Third, we examine the predictive ability of a large number (100) of long-short anomaly portfolio returns representative of those from the cross-sectional literature and simultaneously aggregate the information in the set of anomaly portfolio returns. Because accommodating a large number of anomaly portfolio returns poses problems for the conventional multiple predictive regression approach, we apply a variety of shrinkage techniques—including machine learning, forecast combination, and dimension reduction—to guard against overfitting the data in a high-dimensional setting. Fourth, we explore economic rationales for the ability of long-short anomaly portfolio returns to predict the market return, thereby complementing our statistical findings with economic analysis.
在本文中,我們研究了金融文獻的這兩條主線是否有關聯。具體來說,我們從橫斷面文獻中分析了多空異常投資組合收益預測市場超額收益的能力。我們的調查有許多關鍵特徵。首先,我們關注橫斷面文獻的基石,即多空異常投資組合收益,因為文獻經常使用此類收益作為橫斷面錯誤定價的證據。其次,我們採用樣本外測試,因為此類測試提供了關於股票收益可預測性的最嚴格和相關的證據(例如,Goyal 和 Welch ( 2008 )、Martin 和 Nagel ( 2021 ))。第三,我們檢查了代表橫斷面文獻的大量(100)多空異常投資組合收益的預測能力,並同時匯總了異常投資組合收益集中的資訊。由於容納大量異常投資組合回報會為傳統的多重預測迴歸方法帶來問題,因此我們應用了各種收縮技術(包括機器學習、預測組合和降維)來防止高維度設定中的資料過度擬合。第四,我們探討多空異常投資組合報酬預測市場報酬能力的經濟原理,進而以經濟分析補充我們的統計結果。

Empirically, we find that the information in the group of 100 long-short anomaly portfolio returns is indeed useful for forecasting the monthly market excess return on an out-of-sample basis, provided that we rely on strategies that guard against overfitting the data. The out-of-sample R2 (ROS2) statistics are economically sizable, ranging from 0.89% to 2.81%, all of which exceed the 0.5% threshold for economic significance suggested by Campbell and Thompson (2008). According to the Clark and West (2007) test, forecasts based on long-short anomaly returns provide statistically significant reductions in mean squared forecast error (MSFE) vis-à-vis the prevailing mean (i.e., random walk with drift) benchmark forecast. Furthermore, a mean-variance investor with a relative risk aversion coefficient of three would be willing to pay 259 to 638 basis points in annualized portfolio management fees to have access to the market excess return forecasts based on the information in the group of 100 long-short anomaly portfolio returns. Overall, the evidence of out-of-sample market excess return predictability is statistically and economically strong and robust to various shrinkage methods.

Economically, the predictive power of long-short anomaly portfolio returns for the market return can be explained via asymmetric mispricing correction persistence (MCP), which arises from asymmetric limits of arbitrage (Shleifer and Vishny (1997)). To analyze the implications of MCP for market return predictability, we specify a data-generating process with stationary under- and overpricing components in the prices of the long and short legs, respectively, of the anomaly portfolio.2 Consider, for example, the short leg. We show that when the return momentum effect from the correction of old overpricing shocks dominates the return reversal effect from the immediate correction of a new overpricing shock, the short-leg return in period
從經濟上講,多空異常投資組合收益對市場收益的預測能力可以透過不對稱錯誤定價修正持續性(MCP)來解釋,MCP是由套利的不對稱限制引起的(Shleifer和Vishny( 1997 )) 。為了分析 MCP 對市場回報可預測性的影響,我們指定了一個資料產生過程,其中異常投資組合的多頭和空頭價格分別具有固定的定價過低和過高的成分。 2例如,考慮短腿。我們表明,當舊的定價過高衝擊修正的回報動量效應主導新的定價過高衝擊立即修正的回報反轉效應時,短期回報
t will be positively related to the market return in period
與期間市場回報呈正相關
t+1. Intuitively, the period
。直觀上看,時期
t short-leg return measures the correction of the overpricing identified by the anomaly characteristic at the end of period
短腿回報衡量期末異常特徵所識別的定價過高的糾正
t1 (Akbas et al. (2015), Engelberg, McLean, and Pontiff (2018)). If the overpricing correction process is sufficiently persistent, then there will be positive serial dependence in the short-leg return; because the short-leg return is part of the market return, this means that the short-leg return is positively related to the future market return. The short-leg return can also signal overpricing in market segments beyond the short leg, thereby magnifying the effect on the market return. Indeed, we present empirical evidence that short-leg returns contain relevant overpricing information for a broad swath of the market. Analogous reasoning explains how the long-leg return can be positively related to the future market return.
(Akbas 等人( 2015 年),Engelberg、McLean 和 Pontiff( 2018 年))。如果定價過高修正過程夠持久,那麼短線報酬將存在正序列依賴性;因為短邊收益是市場收益的一部分,這意味著短邊收益與未來市場收益呈正相關。短邊回報也可能表明短邊以外的細分市場定價過高,從而放大對市場回報的影響。事實上,我們提供的經驗證據表明,短期回報包含了大部分市場的相關定價過高資訊。類似的推理解釋了長腿回報如何與未來市場回報呈正相關。

Our empirical results indicate that long- and short-leg returns are positively related to the future market return. However, the out-of-sample predictive ability of long-leg returns is weak, indicating that MCP is considerably stronger for overpricing vis-à-vis underpricing. Consequently, long-short anomaly returns strongly negatively predict the market return. Via recursive regressions, we find evidence that such negative relationships remain important over time. Stronger MCP for overpricing relative to underpricing is consistent with arbitrage with respect to overpriced shares being less aggressive, due to factors such as short-sale constraints (Miller (1977)), feedback effects (Edmans, Goldstein, and Jiang (2015)), and price drops coupled with declines in liquidity (Dong, Krystyniak, and Peng (2019)). In line with relatively strong overpricing correction persistence, evidence suggests that overvaluation, as reflected in the short legs of anomaly portfolios, drives the profitability of many anomaly returns (e.g., Hong, Lim, and Stein (2000), Stambaugh, Yu, and Yuan (2012, 2015), Avramov et al. (2013)).
我們的實證結果表明,多頭和空頭回報與未來市場回報呈正相關。然而,長腿回報的樣本外預測能力較弱,這表明 MCP 對於定價過高相對於定價過低的預測能力要強得多。因此,多空異常報酬對市場報酬有強烈的負面預測。透過遞歸回歸,我們發現證據表明這種負面關係隨著時間的推移仍然很重要。由於賣空限制(Miller( 1977 ))、反饋效應(Edmans、Goldstein 和 Jiang( 2015 ))等因素,定價過高的MCP 相對於定價過低而言更強,這與定價過高股票的套利行為不那麼激進是一致的。與相對較強的高估修正持續性相一致,有證據表明,高估(反映在異常投資組合的空頭中)推動了許多異常回報的盈利能力(例如,Hong,Lim和Stein( 2000 ),Stambaugh ,Yu和Yuan) ( 2012 年2015 年),阿夫拉莫夫等人( 2013 年))。

If MCP is stronger in the short leg compared to the long leg, it may seem that using short-leg anomaly portfolio returns instead of long-short anomaly returns would work better for forecasting the market return. However, we show econometrically that long-short anomaly returns can provide a stronger predictive signal for the market return. Intuitively, the long- and short-leg returns contain a common component unrelated to the future market return. By taking the difference between the long- and short-leg returns, we filter the noise in the predictor variable and thereby provide a sharper signal for anticipating the market return; such filtering is akin to alleviating the errors-in-variables problem. We also show that aggregating information across long-short anomaly returns can further sharpen the predictive signal by filtering the idiosyncratic noise in the individual predictors.
如果短邊的 MCP 比長邊更強,那麼似乎使用短邊異常投資組合收益而不是多空異常收益來預測市場收益會更有效。然而,我們從計量經濟學角度表明,多空異常收益可以為市場收益提供更強的預測訊號。直觀上,多頭和空頭回報包含與未來市場回報無關的共同成分。透過獲取多頭和空頭回報之間的差異,我們過濾了預測變數中的噪聲,從而為預測市場回報提供了更清晰的訊號;這種過濾類似於減輕變數錯誤問題。我們還表明,聚合多空異常收益的資訊可以透過過濾各個預測變數中的特殊雜訊來進一步銳化預測訊號。

Methodologically, the most straightforward approach for incorporating the information in a large set of potential predictors is to specify a multiple predictive regression that includes all of the lagged predictors as explanatory variables. However, conventional ordinary least squares (OLS) estimation of a high-dimensional predictive regression is highly susceptible to overfitting. By construction, OLS maximizes the fit of the model over the in-sample estimation period, which can lead to poor out-of-sample performance; intuitively, OLS is prone to misinterpreting noise in the data for a predictive signal. Because we are interested in forecasting the monthly market excess return, which inherently contains a small predictable component, we are operating in a quite noisy environment that exacerbates the danger of overfitting. As anticipated, we find that the conventional forecast based on OLS estimation of the multiple predictive regression that includes all of the long-short anomaly returns exhibits symptoms of extreme overfitting: The forecast is highly volatile and substantially less accurate than the prevailing mean benchmark.
從方法上來說,將資訊納入大量潛在預測變數的最直接方法是指定多重預測迴歸,其中包括所有滯後預測變數作為解釋變數。然而,高維預測迴歸的傳統普通最小平方法 (OLS) 估計很容易出現過度擬合。透過構造,OLS 最大化模型在樣本內估計期間的適合度,這可能會導致樣本外表現不佳;直觀上,OLS 很容易將資料中的雜訊誤解為預測訊號。因為我們感興趣的是預測每月的市場超額收益,它本身包含一個小的可預測成分,所以我們在一個相當嘈雜的環境中操作,這加劇了過度擬合的危險。正如預期的那樣,我們發現基於包含所有多空異常收益的多重預測回歸的 OLS 估計的傳統預測表現出極端過度擬合的症狀:預測波動性很大,並且遠低於普遍的平均基準準確度。

We use a variety of forecasting strategies to guard against overfitting high-dimensional predictive regressions, all of which essentially rely on shrinkage. The first employs the elastic net (ENet; Zou and Hastie (2005)), a refinement of the well-known least absolute shrinkage and selection operator (LASSO; Tibshirani (1996)), to estimate the forecasting model. The LASSO and ENet are machine-learning techniques that use penalized regression to directly shrink the parameter estimates and thereby avoid overfitting the data.3 We also consider forecast combination (Bates and Granger (1969)). We use a simple combination approach that takes the arithmetic mean of univariate predictive regression forecasts based on the individual predictors (Rapach, Strauss, and Zhou (2010)), as well as a refinement due to Rapach and Zhou (2020) and Han et al. (2021) that incorporates insights from Diebold and Shin (2019) by using machine-learning techniques to select the individual forecasts to include in the combination forecast. Finally, we employ dimension-reduction techniques to combine the predictors into a single variable, which subsequently serves as the explanatory variable in a univariate predictive regression. We consider three dimension-reduction techniques: The first takes the cross-sectional average of the individual predictors, the second extracts the first principal component from the set of predictors (Ludvigson and Ng (2007, 2009)), and the third, an implementation of partial least squares (PLS; Wold (1966)), extracts the first target-relevant factor from the set of predictors (Kelly and Pruitt (2013, 2015), Huang et al. (2015)). As indicated previously, the forecasts based on strategies designed to circumvent overfitting all produce statistically and economically significant ROS2 statistics and generate large economic gains for an investor.
我們使用各種預測策略來防止過度擬合高維預測迴歸,所有這些策略本質上都依賴收縮。第一個採用彈性網絡(ENet;Zou 和 Hastie( 2005 )),這是對眾所周知的最小絕對收縮和選擇算子(LASSO;Tibshirani( 1996 ))的改進,來估計預測模型。 LASSO 和 ENet 是機器學習技術,它們使用懲罰回歸來直接縮小參數估計,從而避免過度擬合資料。 3我們也考慮了預測組合(Bates 和 Granger ( 1969 ))。我們使用簡單的組合方法,該方法採用基於各個預測變數的單變量預測回歸預測的算術平均值(Rapach、Strauss 和 Zhou( 2010 )),以及 Rapach 和 Zhou( 2020 )以及 Han 等人的改進。 ( 2021 ) 結合了 Diebold 和 Shin ( 2019 ) 的見解,透過使用機器學習技術來選擇要包含在組合預測中的單一預測。最後,我們採用降維技術將預測變數組合成單一變量,該變數隨後充當單變量預測迴歸中的解釋變數。 我們考慮三種降維技術:第一個採用各個預測變數的橫斷面平均值,第二個從預測變數集中提取第一個主成分(Ludvigson 和 Ng ( 2007 , 2009 )),第三個是實現偏最小平方法(PLS;Wold( 1966 )),從預測變數集中提取第一個目標相關因子(Kelly 和Pruitt2013,2015 ),Huang 等人( 2015 ))。如前所述,基於旨在避免過度擬合的策略的預測都產生了統計和經濟上的顯著影響 ROS2 統計並為投資者帶來巨大的經濟收益。

To further assess the relevance of asymmetric limits of arbitrage for generating our findings, we form subgroups of anomalies using three proxies for limits of arbitrage: bid-ask spread, idiosyncratic volatility, and market capitalization. Specifically, we use the proxies to construct subgroups of anomalies characterized by more stringent limits of arbitrage in their short legs vis-à-vis their long legs. To the extent that the proxies capture limits of arbitrage, we expect the subgroups of long-short anomaly portfolio returns that consist of anomalies with relatively strong (weak) limits of arbitrage in their short legs to display more (less) out-of-sample predictive ability for the market return. This is what we find.
為了進一步評估不對稱套利限制與得出我們的研究結果的相關性,我們使用三個套利限制代理形成異常子組:買賣價差、特殊波動性和市值。具體來說,我們使用代理來建立異常子組,其特徵是短腿相對於長腿的套利限制更嚴格。就代理捕獲套利限製而言,我們預期多空異常投資組合收益的子組(由短邊套利限制相對強(弱)的異常組成)將顯示更多(更少)樣本外對市場回報的預測能力。這就是我們發現的。

Limits of arbitrage emerge from a variety of frictions (Gromb and Vayanos (2010)). In the models of Gârleanu and Pedersen (2013, 2016) and Dong, Kang, and Peress (2020), frictions such as limited risk-bearing capacity and transactions costs induce arbitrageurs to correct mispricing only slowly, resulting in MCP. To examine the relevance of limits of arbitrage, we analyze how various frictions are related to the out-of-sample predictive ability of long-short anomaly portfolio returns. Intuitively, we expect arbitrage activity to be more constrained during periods when frictions are more acute, implying stronger out-of-sample predictive power for long-short anomaly returns during such periods. We consider a number of variables as proxies for frictions that can affect MCP in anomaly portfolios (especially in the short legs), including aggregate liquidity (Pástor and Stambaugh (2003)), idiosyncratic risk (Ang et al. (2006), Pontiff (2006)), trading noise (Hu, Pan, and Wang (2013)), the VIX, economic uncertainty and risk aversion indices (Jurado, Ludvigson, and Ng (2015), Bekaert, Engstrom, and Xu (2021)), and short costs (Asness et al. (2018)). Consistent with stronger MCP for overpricing vis-à-vis underpricing, we detect significant increases in out-of-sample market return predictability during periods of heightened frictions.
套利的限制源自於各種摩擦(Gromb 和 Vayanos ( 2010 ))。在Gârleanu和Pedersen( 2013,2016 )以及Dong、Kang和Peress( 2020 )的模型中,有限的風險承受能力和交易成本等摩擦導致套利者只能緩慢地糾正錯誤定價,從而導致MCP。為了檢驗套利限制的相關性,我們分析了各種摩擦與多空異常投資組合報酬的樣本外預測能力之間的關係。直觀上,我們預計,在摩擦更加嚴重的時期,套利活動會受到更多限制,這意味著在該時期,多空異常回報的樣本外預測能力更強。我們考慮了許多變數作為摩擦的代理,這些摩擦可能影響異常投資組合中的 MCP(尤其是短邊),包括總流動性(Pástor 和 Stambaugh( 2003 ))、特殊風險(Ang 等人( 2006 )、 Pontiff( 2006 ))、交易噪音(Hu、Pan 和 Wang( 2013 ))、VIX、經濟不確定性和風險厭惡指數(Jurado、Ludvigson 和 Ng( 2015 )、Bekaert、Engstrom 和 Xu( 2021 )),以及短期成本(Asness 等人( 2018 ))。與定價過高相對於定價過低的更強的 MCP 一致,我們發現在摩擦加劇期間樣本外市場回報的可預測性顯著增加。

Finally, we provide more direct evidence that the predictive power of long-short anomaly portfolio returns for the market return stems from slow arbitrage, especially with respect to overpricing. Following Chen, Da, and Huang (2019), we construct net arbitrage positions in the overall market by aggregating the value-weighted trades of hedge funds and short sellers. We find that an increase in long-short anomaly portfolio returns predicts a statistically and economically significant decrease in net arbitrage positions in the market, primarily due to increases in short positions. We also examine the tone of public news about U.S. financial markets extracted from news articles from Thomson Reuters by Calomiris and Mamaysky (2019). We find that an increase in long-short anomaly returns leads to more negative market news tone.
最後,我們提供了更直接的證據,表明多空異常投資組合收益對市場收益的預測能力源於緩慢套利,特別是在定價過高方面。繼Chen、Da和Huang( 2019 )之後,我們透過匯總對沖基金和賣空者的價值加權交易來建構整個市場的淨套利部位。我們發現,多空異常投資組合收益的增加預示著市場中淨套利頭寸在統計和經濟上的顯著下降,這主要是由於空頭頭寸的增加。我們也研究了從 Calomiris 和 Mamaysky 的湯森路透新聞文章中提取的有關美國金融市場的公共新聞的語氣( 2019 年)。我們發現,多空異常報酬的增加會導致更多負面的市場消息基調。

Our study is related to Engelberg et al. (2021), which presently is the only other study to systematically investigate connections between cross-sectional and time-series stock return predictability. Engelberg et al. (2021) follow the existing approach in the time-series literature by using the (equal- or value-weighted) average of firm-level values for a given characteristic as a predictor of the market return. Taking into account a large number of anomaly characteristics from the cross-sectional literature, they find little evidence that characteristics on average have out-of-sample time-series predictive ability for the market return. In contrast, we introduce a new approach that relies on long-short anomaly portfolio returns—a bedrock of the cross-sectional literature—to predict the market return, and we find strong evidence that the aggregate information in long-short anomaly returns is valuable for forecasting the market return. The two papers complement each other by shedding light on the relationship between cross-sectional and time-series stock return predictability.4
我們的研究與 Engelberg 等人相關。 ( 2021 ),這是目前唯一一項系統性研究橫斷面股票報酬可預測性和時間序列股票報酬可預測性之間關聯的研究。恩格伯格等人。 ( 2021 )遵循時間序列文獻中的現有方法,使用給定特徵的公司層面價值的(等權或價值加權)平均值作為市場回報的預測因子。考慮到橫斷面文獻中的大量異常特徵,他們發現幾乎沒有證據表明特徵平均對市場回報具有樣本外時間序列預測能力。相較之下,我們引入了一種依賴多空異常投資組合回報(橫斷面文獻的基石)來預測市場回報的新方法,並且我們發現強有力的證據表明多空異常回報中的匯總資訊是有價值的用於預測市場回報。這兩篇論文透過闡明橫斷面股票收益預測性和時間序列股票收益預測性之間的關係來相輔相成。 4

The remainder of the paper is organized as follows. Section I provides intuition on the predictive power of long-short anomaly portfolio returns for the market return. Section II describes the construction and evaluation of the out-of-sample forecasts. Section III presents the data, while Section IV reports forecast performance results. Section V provides additional results pertaining to asymmetric limits of arbitrage. Section VI concludes.
本文的其餘部分組織如下。第一節提供了多空異常投資組合報酬對市場報酬的預測能力的直覺。第二節描述了樣本外預測的建構和評估。第三部分介紹了數據,第四部分報告了預測績效結果。第五節提供了與不對稱套利限制有關的其他結果。第六節總結。

I. Intuition on Predictability
I. 對可預測性的直覺

In this section, we use a stylized data-generating process to provide intuition on the predictive ability of anomaly portfolio returns for the market return.5
在本節中,我們使用程式化的資料產生流程來直觀地了解異常投資組合回報對市場回報的預測能力。 5

A. Data-Generating Process
A. 資料產生過程

Assume that the prices for the long and short legs of an anomaly portfolio contain a common martingale component with period
假設異常投資組合的多頭和空頭價格包含一個共同的鞅成分,週期為
increment  增量, while the long-leg (short-leg) price contains a stationary component
,而長邊(短邊)價格包含固定成分
() reflecting the level of underpricing (overpricing), which is uncorrelated with the common component. Intuitively, mispricing implies the presence of a stationary component in the price: Under the assumption that the mispricing eventually corrects, any mispricing shock affects the price only temporarily, although the correction process can last for multiple periods. The log return (in terms of price changes) in each leg is then given by
)反映了定價過低(定價過高)的水平,與共同成分不相關。直觀上,錯誤定價意味著價格中存在固定成分:假設錯誤定價最終會糾正,任何錯誤定價衝擊只會暫時影響價格,儘管糾正過程可能會持續多個時期。每條腿的對數回報(以價格變動而言)由下式給出
(1)
where  在哪裡 () is the long-leg (short-leg) return,
) 是長腿(短腿)回報,
is the change in mispricing, and
是錯誤定價的變化,並且
. Using equation (1), the long-short anomaly portfolio return
。使用方程式( 1 ),多空異常投資組合收益
can be expressed as  可以表示為
(2)
For expositional ease, assume that the long and short legs together comprise the market. Then
為了方便說明,假設多頭和空頭共同構成市場。然後
(3)
where  在哪裡 is the (equal-weighted) market return.6
是(等權重)市場回報。 6
According to the Wold representation theorem, the stationary component in each leg related to mispricing (i.e., the pricing error) can be expressed as
(4)
where , () is a serially uncorrelated underpricing (overpricing) shock, , (square summability), and for (to ensure that and ). For simplicity, we assume that and are uncorrelated. When , there is no mispricing and hence the market return in equation (3) reduces to .

Equation (4) provides a general representation for the mispricing component in each leg, as any stationary autoregressive moving-average (ARMA) process can be expressed via the infinite-order moving-average (MA) process. The equation expresses the current-period level of mispricing as a function of current and past mispricing shocks. The MA process can be interpreted as an impulse-response function: is the response (ceteris paribus) of for to a period unit mispricing shock. For example, consider a unit overpricing shock and suppose that , , and for . The pricing error is at the end of the period immediately after the shock, so that , that is, 10% of the overpricing shock is corrected during the first period after the shock. The pricing error is at the end of the second period after the shock, which means that an additional 45% () of the overpricing shock is corrected. Finally, the remaining 45% of the overpricing shock is corrected during the third period after the shock ().

Taking the first difference of equation (4), we obtain the expression for the change in mispricing,
(5)
where and for . To simplify the exposition, we consider the mispricing correction assumption
(6)
which assumes that arbitrage is sufficiently active to ensure that the mispricing associated with a period mispricing shock is not exacerbated in any subsequent period.7

B. Mispricing Correction Persistence

Consider a predictive regression relating the long- or short-leg return of the anomaly portfolio to next period's market return:
(7)
where is a zero-mean, serially uncorrelated disturbance term. Using equations (1) and (3), the standardized slope coefficient in equation (7) is given by8
(8)
Equation (8) indicates that the predictive ability of the long- or short-leg return depends on . Based on equation (5), the latter is given by
(9)
Our empirical results generally indicate that , so that , especially for the short leg.
To understand the conditions that produce , we can use equation (5) to write the changes in the level of mispricing for the current and next period as
(10)
(11)
respectively. Equations (10) and (11) reveal that a new (i.e., period ) pricing shock affects the current and future changes in mispricing in opposite directions. Consider an overpricing shock (). According to equation (10), exacerbates the current level of overpricing, corresponding to a positive change in overpricing in period . According to equation (11), the overpricing induced by the period shock is corrected in proportion to in period , corresponding to a nonpositive change in overpricing. Thus, the consecutive changes in mispricing due to a new overpricing shock can generate negative serial dependence in the short-leg return. In contrast, old pricing shocks ( for ) can produce positive serial dependence in the short-leg return. According to equations (10) and (11), the changes in overpricing corresponding to these shocks are nonpositive in consecutive periods; for example, the overpricing associated with is corrected in proportion to in and in .

In sum, two opposing effects determine : (i) the extent to which the overpricing associated with a new shock is corrected in the next period and (ii) the extent to which the overpricing associated with old overpricing shocks is corrected in the current and next periods. These two effects are evident in the expression in brackets in equation (9), which accounts for all of the consecutive pairs of return responses to new and old overpricing shocks. The first term, , is the product of the period and period return responses to a new unit overpricing shock, which captures the potential reversal due to the immediate correction of the overpricing induced by the new shock. The second term, , reflects the potential momentum due to the persistent correction of the overpricing induced by old shocks. For example, is the product of the period and period overpricing corrections corresponding to a period unit overpricing shock, is the product of the period and period overpricing corrections corresponding to a period unit overpricing shock, and so forth.9

For it to be the case that , the return momentum generated by the correction of the overpricing induced by old shocks needs to outweigh the magnitude of the return reversal generated by the immediate correction of the overpricing induced by a new shock. When the momentum effect dominates, MCP is sufficiently strong that in equation (9) and in equation (8). We can express this condition as
(12)
An analogous condition to equation (12) holds for and . Equation (12) is consistent with various theoretical models that can explain strong MCP.10

The previous example (, , and for ) numerically illustrates the intuition. A unit overpricing shock increases both the price and return by one unit at the time of the shock. Since , the price falls in the period after the shock, corresponding to a return of , so that the magnitude of the reversal effect is 0.10. The size of the momentum effect associated with old unit overpricing shocks is given by the second term is brackets, which equals . In this case, the magnitude of the momentum effect outweighs that of the reversal effect, so that in equation (9).11

Next, consider a predictive regression based on the long-short anomaly portfolio return:
(13)
The standardized slope coefficient in equation (13) is given by
(14)
Empirically, we find that , which holds when
(15)
that is, when there is stronger MCP with respect to overpricing vis-à-vis underpricing. Our finding that long-short anomaly portfolio returns negatively predict the market return is consistent with existing theoretical and empirical evidence. Miller (1977) argues that, due to short-sale impediments, overpricing should be more prevalent and persistent than underpricing. Edmans, Goldstein, and Jiang (2015) and Dong, Krystyniak, and Peng (2019) further show that feedback effects and price drops in conjunction with declines in liquidity, respectively, can generate overpricing persistence. In line with the relative importance of overpricing, studies such as Hong, Lim, and Stein (2000); Stambaugh, Yu, and Yuan (2012, 2015); and Avramov et al. (2013) find that the short legs of anomaly portfolios are largely responsible for the profitability of long-short anomaly portfolio returns.

C. Noise Reduction

In our setup, the long-short anomaly portfolio return can produce a better predictive signal for predicting the market return than the short-leg return, which we also find empirically. To see this, for simplicity we assume in this and the next subsection that , which is in line with the weak predictive ability of long-leg returns in the data (as discussed in Section IV.A). In this case, , so equation (14) becomes
(16)
The magnitude of in equation (16) is greater than that of in equation (8) when
(17)
Intuitively, using in lieu of as the predictor in the predictive regression removes the common unpredictable component , thereby filtering noise from the predictor to provide a sharper predictive signal for the market return. In other words, the problem of using the short-leg return alone to predict the market return is akin to the errors-in-variables problem. Consistent with this intuition, we empirically find that is considerably smaller than .
A similar argument explains why the long-short anomaly portfolio return is a better predictor of next month's market return than the market return itself. The standardized slope coefficient for a regression of the market return on its own lag can be expressed as
(18)
Comparing equations (16) and (18), the numerators have the same magnitude. However, the presence of in the denominator of equation (18) creates noise in the predictive signal, so that the magnitude of the slope coefficient in equation (16) is larger than that in equation (18). Consistent with this intuition, in Section IV.C we empirically find that the lagged market return cannot significantly predict the market return on an out-of-sample basis.

D. Extensions

For expositional ease, we assume that the long and short legs cover the market in equation (3). When the long and short legs are the last and first decile portfolios, respectively—as in our empirical application—the same intuition applies. We address this issue empirically in Section V.B, where we show that the predictive ability of long-short anomaly returns extends across multiple market segments. Thus, our intuition behind market return predictability based on long-short anomaly returns extends beyond serial dependence in the long and short legs (especially the latter) and includes the relevance of long-short anomaly returns for signaling mispricing correction more generally across the market.12

Again for expositional ease, we focus on a single long-short anomaly return in the predictive regression in equation (13). In our empirical application, we incorporate the information in a large number of long-short anomaly portfolio returns to predict the market return. In the spirit of Section I.C, aggregating predictors can help reduce the noise in individual predictors to better uncover the predictive signal. To see this, we can extend our framework to multiple observable anomalies, each of which contains an idiosyncratic component:
(19)
where is a serially uncorrelated idiosyncratic shock, , and for . The standardized slope coefficient for a predictive regression relating to is given by
(20)
Next, consider combining the information in the long-short anomaly returns by taking their period cross-sectional average. Based on equation (19), we have
(21)
The standardized slope coefficient for a predictive regression relating to is given by
(22)
where . If , then the slope coefficient in equation (22) will be larger in magnitude than that in equation (20). Intuitively, like portfolio diversification, averaging across predictors can help filter the idiosyncratic noise, thereby providing a stronger signal for predicting the market return.13

II. Methodology

This section describes the construction of the out-of-sample forecasts and their evaluation using both statistical and economic criteria.

A. Forecast Construction

Suppose that we want to forecast the market excess return () and that we have multiple potential predictors; in our context, the predictors are 100 long-short anomaly portfolio returns. We are interested in generating , a forecast of the month market excess return based on information available through month . All of our market excess return forecasts are out of sample, as we only use data available through month to forecast .

The prevailing mean forecast, which implicitly assumes that the market excess return is unpredictable (apart from its mean value), is the most popular benchmark in the literature. The prevailing mean forecast is simply the average of the market excess return observations available at the time of forecast formation. Because the predictable component in the monthly market excess return is inherently small (i.e., return data are quite noisy), the prevailing mean forecast is difficult to beat in practice (e.g., Goyal and Welch (2008)). With this in mind, successful out-of-sample strategies effectively shrink the market excess return forecast toward the prevailing mean benchmark to reduce the likelihood of overreacting to the noise in return data.

We compare the prevailing mean benchmark to the seven forecasts summarized below, each of which incorporates the information in the group of predictors. Section II of the Internet Appendix provides details for the construction of each forecast.
  • Conventional OLS: The conventional OLS forecast is based on a fitted multiple predictive regression that includes all of the lagged predictors as explanatory variables. Although it is straightforward to compute, the conventional OLS forecast is likely to perform poorly in practice, especially when the number of predictors is large. By construction, conventional OLS estimation maximizes the fit of the model (i.e., the in-sample statistic) over the estimation sample, which can lead to in-sample overfitting and poor out-of-sample performance, especially for high-dimensional models. The inherently large unpredictable component in the market excess return exacerbates the overfitting problem.
  • ENet: The ENet forecast is based on the multiple predictive regression fitted via the ENet instead of OLS. The ENet (Zou and Hastie (2005)) relies on penalized regression to guard against overfitting. The ENet penalty term includes both (LASSO) and (ridge; Hoerl and Kennard (1970)) components. The component permits shrinkage to zero, so that the ENet performs variable selection. Based on Flynn, Hurvich, and Simonoff (2013), we use the Hurvich and Tsai (1989) corrected version of the Akaike (1973) information criterion to select the value of the regularization parameter governing the degree of shrinkage. The ENet directly addresses overfitting by shrinking the slope coefficients of the fitted model, which results in shrinking the forecast toward the prevailing mean benchmark.
  • Simple Combination: Instead of OLS estimation of the multiple predictive regression, forecast combination begins by computing a set of forecasts based on OLS estimation of univariate predictive regressions that include each lagged predictor (considered in turn). The simple combination forecast is the arithmetic mean of the individual univariate forecasts. Rapach, Strauss, and Zhou (2010) show that the simple combination forecast exerts a strong shrinkage effect.
  • Combination ENet: When the number of predictors is large, the simple combination forecast can be too conservative in the sense that it “overshrinks” the forecast toward the prevailing mean, thereby neglecting too much of the relevant information in the predictor variables. Using insights from Diebold and Shin (2019), Rapach and Zhou (2020) and Han et al. (2021) employ the ENet to refine the simple combination forecast. Instead of averaging across all of the individual univariate predictive regression forecasts, the combination ENet (C-ENet) forecast takes the average of the individual forecasts selected by the ENet in a Granger and Ramanathan (1984) multiple regression relating the actual market excess return to the individual univariate forecasts.
  • Predictor Average: An alternative strategy for guarding against overfitting is to first combine the predictors themselves into a small number of variables and then use the reduced set of variables as predictors in a low-dimensional predictive regression. Intuitively, as discussed in Section I.D, we consolidate the predictors to filter the noise in the individual predictors. The predictor average forecast is based on OLS estimation of a univariate predictive regression in which the lagged cross-sectional average of the predictors serves as the explanatory variable.
  • Principal Component: We can also combine the predictors by extracting the first principal component from the set of predictors. The principal component forecast uses the lagged principal component as the explanatory variable in a univariate predictive regression estimated via OLS.
  • PLS: The first principal component explains as much variation as possible in the predictors themselves. However, from a forecasting standpoint, we are interested in explaining the target variable. Instead of extracting a factor that explains as much of the variation in the predictors as possible, Kelly and Pruitt (2013, 2015) develop a three-pass regression filter to construct a target-relevant factor from a set of predictors that is maximally correlated with the target variable. The lagged target-relevant factor then serves as the explanatory variable in a univariate predictive regression estimated via OLS. The three-pass regression filter is essentially a version of PLS.

B. Forecast Evaluation—Statistical Accuracy

We first assess market excess return forecasts in terms of statistical accuracy via MSFE. Denote the errors for the prevailing mean benchmark and a competing forecast by
(23)
(24)
respectively, where is the prevailing mean benchmark forecast and generically denotes a competing forecast. The sample MSFE is given by
(25)
where is the number of out-of-sample observations. We test for a difference in the population MSFEs using the Clark and West (2007) procedure, which can be conveniently implemented in a simple regression framework:
(26)
where is the period loss differential. The -statistic corresponding to the OLS estimate of in equation (26) is used to test
(27)
where is the population MSFE for .14 The -statistic is computed using a heteroskedasticity- and autocorrelation-consistent (HAC) standard error (Newey and West (1987)).
When comparing MSFEs for the prevailing mean benchmark and a competing market excess return forecast, it is common to report the Campbell and Thompson (2008) statistic:
(28)
Equation (28) gives the proportional reduction in the sample MSFE for the competing forecast vis-à-vis the prevailing mean benchmark. Using the Clark and West (2007) statistic to test equation (27) is tantamount to testing : against : (in population). Because the predictable component in the monthly market excess return is necessarily limited, the statistic will be small. Nevertheless, based on the market Sharpe ratio, Campbell and Thompson (2008) suggest that a monthly statistic as small as 0.5% can signal economic significance. As described in Section II.C, we also assess the economic significance of market return forecasts more directly by measuring their economic value to an investor.
In addition, we examine whether out-of-sample return predictability (as measured by the statistic) is related to market frictions. To the extent that greater frictions exacerbate limits of arbitrage, we expect anomaly portfolio returns to generate stronger out-of-sample gains during high-friction periods. To test whether out-of-sample return predictability changes with the state of market frictions, we augment the Clark and West (2007) framework in equation (26) as follows:
(29)
where is an indicator variable that equals one (zero) if market frictions are high (low). We use the -statistic corresponding to the OLS estimate of in equation (29) to test
(30)
where () is the value of the statistic during periods of high (low) market frictions. We again compute the -statistic using an HAC standard error.

C. Forecast Evaluation—Economic Value

In addition to statistical accuracy, we compare the benchmark and competing forecasts in terms of their economic value to an investor. Specifically, we consider a mean-variance investor who allocates across equities and risk-free Treasury bills each month. At the end of month , the investor faces the objective function
(31)
where represents the coefficient of relative risk aversion, and are the allocations to the market portfolio and risk-free bills, respectively, in period , is the investor's market excess return forecast, and is the investor's forecast of the variance of the market excess return. The well-known solution to equation (31) takes the form15
(32)
We assume that the investor uses the sample variance computed over a 60-month rolling estimation window to forecast the variance in equation (32). The average utility realized by the investor is given by
(33)
where () and () are the mean and variance, respectively, for the portfolio return over the out-of-sample period when the investor uses the prevailing mean (competing) forecast for in equation (32). Mean-variance utility has the same units as returns, and equation (33) can be interpreted as the certainty-equivalent return. Finally, we compute the average utility gain (or increase in certainty-equivalent return) when the investor uses the competing forecast in lieu of the prevailing mean benchmark in equation (32):
(34)
After multiplying the average utility gain in equation (34) by 12, it can be interpreted as the annualized portfolio management fee (as a proportion of wealth) that the investor would be willing to pay to have access to the information in the competing forecast relative to that in the prevailing mean benchmark. For our empirical analysis in Section IV, we assume that ; the results are similar for reasonable alternative values for .

III. Data

We consider 100 long-short anomaly portfolio returns that are representative of anomalies from the cross-sectional literature and that can be replicated using publicly available data from CRSP, Compustat, and I/B/E/S. The anomalies correspond to numerous categories, such as value versus growth, profitability, investment, issuance activity, momentum, and trading frictions. Table IA.I in the Internet Appendix lists the 100 anomalies.16

For each anomaly, at the beginning of each month we sort stocks into value-weighted decile portfolios based on the relevant characteristic. We consider all stocks traded on the NYSE, AMEX, and NASDAQ, after excluding stocks with a price below $5. The long-short anomaly portfolio goes long (short) the tenth (first) decile portfolio, where the long (short) leg is expected to generate relatively high (low) returns. We exclude anomalies that are interactions of two separate signals (since such anomalies are essentially based on multiple anomalies) and those that are indicator variables (such as IPOs). We also use anomalies that have a nonmissing return starting in 1985.17 The market excess return is the CRSP value-weighted market return minus the risk-free return (also from CRSP).

The sample period spans 1970:01 to 2017:12. Table I reports summary statistics for the anomaly portfolio returns. Of the 100 representative anomalies, 75, 71, 56, and 49 generate alphas with -statistics (in magnitude) above 1.645, 1.96, 2.58, and 3, respectively, in the context of the Fama and French (1993) three-factor model. The average correlation between decile rankings of any two anomalies is 0.05, which is similar to those reported in McLean and Pontiff (2016) and Green, Hand, and Zhang (2017), so that a stock considered overvalued or undervalued by one anomaly has a nearly equal chance of being considered overvalued or undervalued by another anomaly. This is consistent with the argument of Stambaugh, Yu, and Yuan (2012, 2015) and Akbas et al. (2015) that individual anomaly characteristics can be noisy indicators of whether a stock is mispriced. It also suggests that aggregating the information in anomalies can help reduce the noise in individual anomalies.

Table I. Summary Statistics
The table reports summary statistics for monthly anomaly portfolio returns for 100 anomalies. The sample period is 1970:01 to 2017:12. For each anomaly, we sort stocks into value-weighted decile portfolios according to the characteristic underlying the anomaly. The long-short anomaly portfolio goes long (short) the tenth (first) decile portfolio.
Number of anomalies 100
Fama and French (1993) three-factor model alpha
Number of long-short anomaly portfolio returns with 1.645 75
Number of long-short anomaly portfolio returns with 1.96 71
Number of long-short anomaly portfolio returns with 2.58 56
Number of long-short anomaly portfolio returns with 3 49
Average correlation across anomaly decile rankings 0.05
Average correlation across monthly anomaly excess returns
Long leg 0.76
Short leg 0.82
Long-short 0.08
Long-leg anomaly portfolio excess returns
Average of sample means 0.71%
Average of sample standard deviations 5.16%
Short-leg anomaly portfolio excess returns
Average of sample means 0.33%
Average of sample standard deviations 6.20%
Long-short anomaly portfolio returns
Average of sample means 0.38%
Average of sample standard deviations 4.37%

On average, the long- and short-leg portfolio returns of any two anomalies are strongly correlated (0.76 and 0.82, respectively). In the context of Section I, this likely reflects the common component in the long- and short-leg returns unrelated to mispricing (). The long-short returns are relatively weakly correlated on average (0.08). This is consistent with the presence of idiosyncratic components in the individual long-short anomaly returns, as discussed in Section I.D. In support of the condition in equation (17), the average of the sample standard deviations for the long-short portfolio returns is about 70% of that for the short-leg excess returns, indicating that long-short returns provide a sharper predictive signal than short-leg returns, as discussed in Section I.C. Figure IA.1 in the Internet Appendix provides histograms for the sample autocovariances for the long- and short-leg excess returns. Consistent with stronger MCP for overpricing vis-à-vis underpricing in equation (15), the mean of the autocovariances for the short-leg excess returns is considerably larger than that for the long-leg excess returns (4.39 and 2.14, respectively), and the difference in means is significant at the 1% level.18

IV. Out-of-Sample Results

In constructing the out-of-sample forecasts, we use the first 10 years (1970:01 to 1979:12) of the full sample period as the initial in-sample estimation period. The subsequent five years (1980:01 to 1984:12) serve as the initial holdout out-of-sample period for computing the C-ENet forecast, so that 1985:01 to 2017:12 (396 observations) constitutes the out-of-sample period for forecast evaluation. Because the methods in Section II.A require nonmissing predictor data (with the exception of the predictor average), for an anomaly with a missing return in a given month, we fill in the missing return with the cross-sectional average for the available anomaly returns in that month.

A. Forecast Accuracy

Figure 1 depicts monthly market excess return forecasts based on the 100 long-short anomaly portfolio returns and strategies described in Section II.A. The conventional OLS forecast is highly volatile. In fact, the conventional OLS forecast is more than 6% (72% annualized) in magnitude for a number of months; such extreme forecasts point to severe overfitting. The other forecasts are substantially less volatile than the conventional OLS forecast. In other words, as intended, the ENet, simple combination, C-ENet, predictor average, principal component, and PLS strategies shrink the forecasts toward the prevailing mean (while still incorporating information from the 100 long-short anomaly portfolio returns). The simple combination forecast exerts a particularly strong shrinkage effect (as discussed in Section II.A). Excluding the conventional OLS forecast, the PLS forecast demonstrates the most volatility, with the volatilities for the ENet, C-ENet, predictor average, and principal component forecasts falling between those for the simple combination and PLS forecasts. We note that the market excess return forecasts in Figure 1 are typically more volatile around business-cycle recessions, which indicates that the long-short anomaly portfolio returns themselves are more volatile around recessions.

Details are in the caption following the image
Market excess return forecasts based on 100 long-short anomaly portfolio returns. Each panel depicts the competing market excess return forecast in the panel heading and the prevailing mean benchmark forecast. Vertical bars indicate business-cycle recessions as dated by the National Bureau of Economic Research. [Color figure can be viewed at wileyonlinelibrary.com]

Table II reports statistics for monthly market excess return forecasts based on the 100 long-short anomaly portfolio returns. The conventional OLS forecast produces a negative statistic that is extremely large in magnitude, confirming that the forecast is plagued by overfitting.19 In sharp contrast, the other forecasts all generate positive statistics, so that they are more accurate than the prevailing mean benchmark over the 1985:01 to 2017:12 out-of-sample period. Using the Campbell and Thompson (2008) threshold of 0.5%, the monthly statistics for all six of the forecasts designed to guard against overfitting are economically significant; indeed, the statistics for the ENet, C-ENet, predictor average, and PLS forecasts are quite sizable (2.03%, 2.81%, 1.89%, and 2.06%, respectively), making them among the highest in the literature to date. Furthermore, according to the Clark and West (2007) statistics, all six of the forecasts designed to guard against overfitting provide statistically significant improvements in MSFE vis-à-vis the prevailing mean benchmark.

Table II. Statistics
The table reports Campbell and Thompson (2008) out-of-sample () statistics in percent for market excess return forecasts based on 100 anomaly portfolio returns. The out-of-sample period is 1985:01 to 2017:12. The OLS (ENet) forecast is based on ordinary least squares (elastic net) estimation of a multiple predictive regression that includes all 100 of the anomaly portfolio returns. Combine is the arithmetic mean of univariate predictive regression forecasts based on the 100 individual anomaly portfolio returns (considered in turn). C-ENet is the arithmetic mean of the univariate predictive regression forecasts selected by the elastic net in a Granger and Ramanathan (1984) regression. Avg is a univariate predictive regression forecast based on the cross-sectional average of the 100 anomaly portfolio returns. PC (PLS) is a univariate predictive regression forecast based on the first principal component (target-relevant factor) extracted from the 100 anomaly portfolio returns. Based on the Clark and West (2007) test, *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively, for the positive statistics.
(1) (2) (3) (4) (5) (6) (7) (8)
Anomaly
Portfolio OLS ENet Combine C-ENet Avg PC PLS
Long-short −2,513.86 2.03** 0.89*** 2.81*** 1.89** 1.25** 2.06***
return
Long-leg −344,960.22 −0.90 0.29 −0.68 0.26 0.24 0.41
excess return
Short-leg −13,284.68 1.81* 0.72* 0.39* 0.75* 0.74* 0.84*
excess return

Overall, we find that the information in the group of 100 long-short anomaly portfolio returns is quite useful for predicting the monthly market excess return, provided that we employ forecasting strategies that guard against overfitting. The fact that the ENet, simple combination, C-ENet, predictor average, principal component, and PLS forecasts all generate statistically and economically significant improvements in out-of-sample accuracy indicates that our results are robust and not overly reliant on a particular method for circumventing overfitting.

Table II also reports results for monthly market excess return forecasts based on long- and short-leg anomaly portfolio excess returns. A clear pattern emerges: The short-leg excess returns provide some evidence of statistically and economically significant predictive ability, while there is no statistically or economically significant evidence of predictive ability for the long-leg excess returns. As discussed in Section I.B, this pattern is consistent with asymmetric limits of arbitrage and stronger MCP for overpricing vis-à-vis underpricing.

Comparing the results in Table II for the long-short and short-leg excess returns, the former perform better than the latter for forecasting the monthly market excess return. As explained in Section I.C (and supported by the average volatilities reported in Table I), the long-short return can provide a sharper predictive signal when the long and short legs contain a sizable common component that is unrelated to the future market excess return, as the long-short return filters the common component to produce a less-diluted signal.

B. Economic Value

As described in Section II, we also measure the marginal economic benefit of the predictive ability of long-short anomaly portfolio returns for a mean-variance investor with a relative risk aversion coefficient of three who allocates between the market portfolio and risk-free Treasury bills. Figure 2 plots log cumulative excess returns for portfolios using market excess return forecasts based on the 100 long-short anomaly returns. The figure also depicts the log cumulative excess return for the portfolio based on the prevailing mean benchmark forecast, as well as that for the market portfolio. Figure 2 reveals that portfolios that incorporate the information in the 100 long-short anomaly returns generally exhibit superior performance compared to both the portfolio based on the prevailing mean benchmark forecast and the market portfolio.

Details are in the caption following the image
Log cumulative excess returns for portfolios constructed using market excess return forecasts based on 100 long-short anomaly returns. Each panel depicts the log cumulative excess return for a portfolio constructed using the market excess return forecast in the panel heading and the prevailing mean benchmark forecast, as well as the log cumulative excess return for the CRSP value-weighted market portfolio. Each panel also reports the annualized average utility gain when the investor uses the competing forecast in the panel heading in lieu of the prevailing mean benchmark forecast. Vertical segments indicate business-cycle recessions as dated by the National Bureau of Economic Research. [Color figure can be viewed at wileyonlinelibrary.com]

Figure 2 also shows annualized average utility gains when the investor uses a competing forecast of the market excess return in lieu of the prevailing mean benchmark. For the conventional OLS forecast based on the 100 long-short anomaly returns (not shown in the figure), there is a large loss of −4.97%, which further manifests the overfitting problem for the conventional approach. In contrast, the forecasts designed to guard against overfitting all generate substantial utility gains. The smallest annualized gain obtains for the simple combination forecast (2.59%), which is still economically sizable.20 The annualized gain reaches as high as 6.38% for the PLS forecast, and it is above 600 basis points for the ENet and C-ENet forecasts (6.26% and 6.06%, respectively).

Although largely ignored by the market return predictability literature, it is important to assess the statistical significance of the average utility gains. To do so, we use the moving-block bootstrap procedure recommended by McCracken and Valente (2018) to compute critical values for testing : against : in equation (34). The utility gains for the portfolios based on the ENet, predictor average, and principal component (simple combination, C-ENet, and PLS) forecasts are significant at the 5% (1%) level. Overall, the utility gains provide further evidence that long-short anomaly portfolio returns contain valuable information for predicting the market excess return.21

For the portfolios based on the ENet, C-ENet, predictor average, principal component, and PLS forecasts, the information in the long-short anomaly returns appears especially valuable for improving performance during the latter half of the Great Recession. It is worth noting that most of the 100 anomalies were published before that time, suggesting that the utility gains accruing to the information in the group of anomalies are not limited to the anomalies' prepublication periods.22

C. Additional Results

Because the long- and short-leg anomaly returns are components of the market return, the lagged market return itself potentially contains relevant information for forecasting the monthly market return. For the 1970:01 to 2017:12 sample period, the autocorrelation for the market excess return is 0.08 (significant at the 10% level). However, the lagged market excess return does not evince significant out-of-sample predictive ability for the 1985:01 to 2017:12 out-of-sample period. As explained in Section I.C, this is not surprising, since the market return is a noisier predictor than the long-short anomaly portfolio return.

Long-short anomaly portfolio returns also show stronger predictive power for the monthly market excess return than popular predictors from the literature for the 1985:01 to 2017:12 out-of-sample period. Table IA.VI in the Internet Appendix reports statistics for univariate predictive regression forecasts based on 16 popular predictors, including a variety of economic variables (e.g., valuation ratios, interest rates, interest rate spreads, and inflation) from Goyal and Welch (2008) and technical indicators from Neely et al. (2014).23 Among the 16 predictors, three produce a positive statistic: the 10-year Treasury bond yield (0.54%, significant at the 10% level) and two of the technical indicators, MA(1,12) and MOM(6) (0.45% and 0.19%, respectively). When we aggregate the information in the 16 predictors using the methods in Section II.A, only the simple combination forecast outperforms the prevailing mean benchmark ().24 Hence, these predictors appear less useful than long-short anomaly returns for predicting the market excess return.

To directly compare the information in the popular predictors to that in the long-short anomaly returns, Table IA.VIII in the Internet Appendix reports Harvey, Leybourne, and Newbold (1998) forecast encompassing test results for monthly market excess return forecasts based on the two sets of predictors. Forecasts based on the popular predictors do not encompass those based on the long-short anomaly returns, and thus the anomaly returns contain information useful for forecasting the monthly market excess return beyond that contained in the popular predictors. The information content of long-short anomaly returns therefore appears quite different from that of popular predictors from the literature.25 Moreover, as shown in Table II and Figure 2, the differential information in long-short anomaly returns delivers statistically and economically significant out-of-sample gains.

The out-of-sample predictive ability of long-short anomaly portfolio returns at longer horizons is also of interest. Table IA.X in the Internet Appendix reports statistics for market excess return forecasts based on the 100 anomalies for horizons of two, three, six, and 12 months.26 Predictability tends to fall as the horizon increases, and we find no significant evidence of predictability at the 12-month horizon. The results are consistent with our economic explanation, as we expect the mispricing related to anomalies to correct reasonably quickly, so that long-short anomaly returns are primarily short-horizon predictors.

As an external validity test, we examine the out-of-sample predictive ability of long-short anomaly portfolio returns for industry excess returns.27 Specifically, we consider five industry portfolios (spanning the market) from Kenneth French's Data Library: Consumer, Manufacturing, Hi-Tech, Health, and Other.28 Table IA.XI in the Internet Appendix reports statistics for the five industry excess returns. The conventional OLS forecast continues to perform poorly, as the statistics are negative and large in magnitude for all of the industries. As with the market excess return, the forecasting strategies designed to guard against overfitting perform substantially better, producing positive statistics for three to five of the industries. For these strategies, 21 of the 30 statistics are significant at conventional levels. Out-of-sample return predictability is especially strong for Consumer, Manufacturing, and Other, and a number of the statistics are above 3% for these industries.29 The results for the industry excess returns provide further evidence that long-short anomaly portfolio returns are genuine predictors of the market excess return.

As a final assessment of out-of-sample return predictability in this section, we explore the effects of the selection of the long-short portfolio returns used to forecast the market excess return.30 We begin with a set of more than 18,000 long-short portfolio returns from Yan and Zheng (2017), which are constructed by sorting on individual firm characteristics from Compustat.31 From the complete set of long-short portfolio returns, we form two subsets with insignificant and significant alphas.32 Based on the evidence in Yan and Zheng (2017), long-short portfolio returns with highly significant alphas are likely to be meaningful anomalies and not the result of data mining. We randomly draw two groups of 100 long-short portfolio returns from the respective subsets. For each group, we generate out-of-sample market excess return forecasts beginning in 1985:01 using the predictor average and compute the statistics. Repeating this process 1,000 times generates empirical distributions for the statistics for market excess return forecasts based on randomly selected groups of 100 insignificant and significant long-short portfolio returns.

Figure 3 presents histograms for the statistics for the two subsets of long-short portfolio returns. The statistics corresponding to the significant long-short portfolio returns are clustered to the right of those for the insignificant long-short portfolio returns. The difference between the two distributions' means (1.91) is significant at the 1% level. The histograms in Figure 3 indicate that groups of significant long-short portfolio returns generally improve out-of-sample market excess return forecasts. Overall, groups of long-short portfolio returns that are significantly more likely to constitute anomalies appear more informative for predicting the market excess return, providing further evidence for the importance of anomalies.

Details are in the caption following the image
Histograms for statistics. The figure depicts histograms for Campbell and Thompson (2008) out-of-sample () statistics in percent for market excess return forecasts based on the predictor average. The forecasts are based on 1,000 randomly selected groups of 100 long-short portfolio returns from Yan and Zheng (2017) with insignificant (Panel (a)) and significant (Panel (b)) Fama and French (1993) three-factor model alphas. The vertical dashed line delineates the statistic for the predictor average forecast based on the long-short anomaly portfolio returns in Table II. [Color figure can be viewed at wileyonlinelibrary.com]

V. Asymmetric Limits of Arbitrage

In this section, we further examine the relevance of asymmetric limits of arbitrage for market excess return predictability based on long-short anomaly portfolio returns.

A. Slope Coefficients

We first examine recursive estimates of the slope coefficients in the predictive regressions underlying the predictor average, principal component, and PLS forecasts. These predictive regressions provide a convenient means for capturing the information in all 100 of the long-short anomaly portfolio returns in a single slope coefficient.

The predictive regression underlying the predictor average forecast is given by
(35)
where and is the long-short portfolio return for the th anomaly for (). For the principal component forecast, we replace in equation (35) with , the first principal component extracted from the 100 long-short anomaly portfolio returns. We set the sign of the principal component so that it is positively correlated with , meaning that an increase in can be interpreted as a general increase in the long-short anomaly portfolio returns. For the PLS forecast, we replace in equation (35) with , the first target-relevant factor extracted from the 100 long-short anomaly portfolio returns, and we again set the sign of so that it is positively correlated with . When computing the recursive slope coefficients estimates based on and , the principal components and target-relevant factors are computed using data available at the time of forecast formation, so that there is no look-ahead bias in the recursive estimates.

Panels (a) to (c) of Figure 4 depict recursive estimates of the standardized slope coefficients in equation (35) and their 90% confidence intervals when the explanatory variable is , , and , respectively.33 The recursive estimates are always negative, consistent with stronger MCP for overpricing vis-à-vis underpricing. As expected, the 90% confidence intervals tend to narrow as the estimation sample lengthens. For the predictor average (principal component) regression, the recursive estimates become significant starting in the mid-to-late 1990s (early 2000s) and remain significant thereafter; for the PLS regression, the recursive estimates are always significant. The recursive estimates are quite stable in all three panels of Figure 4 from the early to late 2000s. They then become larger in magnitude around the Global Financial Crisis and concomitant Great Recession and remain relatively stable thereafter, suggesting that asymmetric limits of arbitrage have not substantially diminished in importance since the early 2000s.34

Details are in the caption following the image
Recursive slope coefficient estimates. Solid lines depict standardized recursive slope coefficient estimates used to compute the predictor average, principal component, and partial least squares forecasts of the market excess return based on 100 long-short anomaly portfolio returns. Dashed lines denote 90% confidence intervals. Vertical bars indicate business-cycle recessions as dated by the National Bureau of Economic Research. [Color figure can be viewed at wileyonlinelibrary.com]

B. Market Segments

According to the intuition in Section I, an important part of the predictive ability of long-short anomaly portfolio returns stems from their lead-lag covariances with segments of the market return. To obtain insight into this mechanism, we examine the cross-autocovariances between long-short anomaly portfolio returns and decile excess returns. This allows us to investigate the relevance of the information contained in long-short anomaly returns for different segments of the market related to the anomalies.

To incorporate information from the entire group of 100 anomalies and construct market segments without overlapping stocks, we proceed as follows.35 For a given month, we sort stocks according to each anomaly characteristic (considered in turn). For each stock, we then take the average of its ranks across the anomalies; we take the average of the ranks to account for the fact that some stocks have missing observations for some characteristics in a given month. We then sort stocks into value-weighted decile portfolios according to their average ranks, with the first (tenth) decile constituting the most overvalued (undervalued) stocks. The long-short portfolio again goes long (short) the tenth (first) decile portfolio.

Panel (a) of Figure 5 plots estimates of for the decile excess returns, indexed by . The cross-autocovariances are all negative. The first decile excess return () displays the largest covariance in magnitude with , so that MCP appears most relevant in the short leg. Nevertheless, the covariances of many of the other decile excess returns with the lagged long-short anomaly return are often significant at conventional levels, so that the mispricing detected in the long-short anomaly return has important implications for the market more broadly. The magnitudes of the cross-autocovariances tend to decrease from the first to the tenth deciles, and the cross-autocovariance for the tenth decile excess return () is the smallest in magnitude (and statistically insignificant). Lagged long-short anomaly returns thus appear more relevant for segments of the market that contain stocks that are relatively more overpriced on average according to the anomalies, consistent with asymmetric limits of arbitrage.36

Details are in the caption following the image
Cross-autocovariances for long-short anomaly and market segment returns. Each panel depicts estimates of cross-autocovariances for the month decile excess return and month long-short anomaly return. The deciles are constructed by sorting stocks according to their average rank across 100 anomaly characteristics. The decile excess returns in Panel (b) are weighted by the deciles' market capitalization shares. *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively. [Color figure can be viewed at wileyonlinelibrary.com]
For the deciles constructed based on the average of anomaly ranks, and in the spirit of Lo and MacKinlay (1990) and Lewellen (2002), we can obtain an exact decomposition of the covariance between the long-short anomaly return and next month's market excess return as follows:
(36)
where is the th decile's market capitalization share. Panel (b) of Figure 5 plots estimates of the cross-autocovariances on the right-hand side of equation (36), which weight the decile returns by their market capitalization shares.37 The results are similar to those in Panel (a) in that the cross-autocovariances are all negative, the cross-autocovariance for the first decile is the largest in magnitude, the cross-autcovariances are significant for many market segments, and the cross-autocovariances tend to decrease in magnitude from the first to the tenth deciles.

To further explore the relevance of overpricing correction persistence, we examine return predictability for portfolios of stocks that are most overvalued according to the predictor average.38 The predictor average (i.e., the cross-sectional average of the long-short anomaly portfolio returns) for a given month is effectively a portfolio of all of the individual available stocks for the month (although some stocks may receive zero weights). Each month, we back out the portfolio weights corresponding to the predictor average and sort stocks according to the predictor average weights. We then form a portfolio (NEG10-PA) that consists of the 10% of stocks with the largest negative weights (in magnitude) in the predictor average portfolio; the weights for the stocks in the NEG10-PA portfolio are proportional to the absolute values of their weights in the predictor average portfolio. Based on a univariate predictive regression, we then use the excess return on the NEG10-PA portfolio to predict the excess return on a portfolio of the same stocks with the same weights in the next month. For the 1985:01 to 2017:12 out-of-sample period, the statistic for the predictive regression forecast is 1.87%, which is significant at the 5% level. We then repeat the same analysis using the same 10% of stocks each month, but instead use portfolio weights proportional to market capitalization. The statistic in this case is nearly a third smaller (0.65%) and insignificant at conventional levels. In sum, we find stronger out-of-sample return predictability for a portfolio of overvalued stocks (as detected by the predictor average) when we weight the stocks according to their relative importance in the predictor average portfolio rather than simply by their market capitalization. This result indicates that the information in the long-short anomaly portfolio returns is linked to overpricing correction persistence.

C. Subgroups

In this section, we generate out-of-sample forecasts based on subgroups of anomalies formed according to proxies for asymmetric limits of arbitrage.39 We form subgroups based on three proxies: bid-ask spread (BA), idiosyncratic volatility (IDIO), and market capitalization (SIZE).40 For a given month and anomaly characteristic, we first sort stocks into deciles as described in Section III and compute the average values for a given proxy for the stocks in the long and short legs. We then compute the long-leg average value for the proxy minus the short-leg average value for the proxy for each month. Finally, we compute the time-series average of the differences for 1970:01 to 1984:12. We denote the time-series averages for the differences for the bid-ask spread, idiosyncratic volatility, and size proxies by DTSA-BA, DTSA-IDIO, and DTSA-SIZE, respectively.

We form the subgroups BA-NEG, IDIO-NEG, and SIZE-NEG (BA-POS, IDIO-POS, and SIZE-POS), which are the subgroups of anomalies for which DTSA-BA, DTSA-IDIO, and DTSA-SIZE, respectively, are negative (positive). In line with our out-of-sample focus, we exclude data from the forecast evaluation period when determining the subgroups.41 Stocks with larger bid-ask spreads, greater idiosyncratic volatility, and smaller market capitalization are typically viewed as having stronger limits of arbitrage. We therefore expect greater market return predictability based on BA-NEG, IDIO-NEG, and SIZE-POS compared to BA-POS, IDIO-POS, and SIZE-NEG, respectively, as the former three subgroups represent anomalies with asymmetrically strong limits of arbitrage in their short legs vis-à-vis their long legs. To combine the information in the different proxies, we also form a subgroup that is the union of the anomalies in BA-NEG, IDIO-NEG, and SIZE-POS, as well as a subgroup that comprises the complement of the union. We expect the union subgroup to have stronger predictive ability relative to its complement.

Table III reports statistics for the different subgroups using the strategies designed to guard against overfitting.42 Although the bid-ask spread, idiosyncratic volatility, and size are quite noisy proxies for limits of arbitrage, the results generally support the relevance of asymmetric limits of arbitrage and stronger MCP for overpricing vis-à-vis underpricing. For example, Panel A reports results for the two subgroups formed using the bid-ask spread. The statistics for the BA-NEG subgroup are all positive and sizable in magnitude, well above the Campbell and Thompson (2008) threshold for economic significance. The statistics for the ENet, C-ENet, and PLS forecasts are particularly large (3.57%, 3.65%, and 2.94%, respectively). All six of the forecasts based on the BA-NEG subgroup are significant at conventional levels according to the Clark and West (2007) statistics. For each forecasting strategy, the statistic for the BA-POS subgroup is always lower than the corresponding statistic for the BA-NEG subgroup. Two of the statistics for the BA-POS subgroup are negative, and only two are above the 0.5% threshold. The results in Panel A align with the intuition in Section I, with relatively stronger limits of arbitrage in the short leg producing greater market return predictability. The results for the subgroups based on idiosyncratic volatility and size in Panels B and C, respectively, deliver similar messages as those for the subgroups based on the bid-ask spread in Panel A. Panel D of Table III reports statistics for the union of the BA-NEG, IDIO-NEG, and SIZE-POS subgroups. All of the statistics for the union subgroup are statistically and economically significant. For the complement subgroup, five of the statistics are negative, and the positive statistic is neither statistically nor economically significant. Continuing the pattern in Panels A to C, the statistics for the subgroup that we expect to have stronger predictive ability (union) are always larger than the corresponding statistics for the subgroup that we expect to have weaker predictive ability (complement).

Table III. Statistics for Subgroups
The table reports Campbell and Thompson (2008) out-of-sample () statistics in percent for market excess return forecasts based on long-short anomaly portfolio returns for the subgroup in the first column. The out-of-sample period is 1985:01 to 2017:12. The ENet forecast is based on elastic net estimation of a multiple predictive regression that includes all of the long-short anomaly portfolio returns in the subgroup. Combine is the arithmetic mean of univariate predictive regression forecasts based on the individual long-sort anomaly portfolio returns in the subgroup (considered in turn). C-ENet is the arithmetic mean of the univariate predictive regression forecasts selected by the elastic net in a Granger and Ramanathan (1984) regression. Avg is a univariate predictive regression forecast based on the cross-sectional average of the long-short anomaly portfolio returns in the subgroup. PC (PLS) is a univariate predictive regression forecast based on the first principal component (target-relevant factor) extracted from the long-short anomaly portfolio returns in the subgroup. Based on the Clark and West (2007) test, *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively, for the positive statistics.
(1) (2) (3) (4) (5) (6) (7)
Subgroup ENet Combine C-ENet Avg PC PLS
Panel A: Bid-Ask Spread
BA-NEG 3.57*** 1.13*** 3.65*** 1.97** 1.39** 2.94***
BA-POS −0.28 0.48** −1.41 0.97** 0.85* 0.29**
Panel B: Idiosyncratic Volatility
IDIO-NEG 1.63*** 0.97*** 2.43*** 1.53** 1.24** 1.85***
IDIO-POS 1.17* 0.42** −0.43 −0.33 0.12 −0.14
Panel C: Market Capitalization
SIZE-NEG 0.39 0.40** −1.01 0.43 0.19 −1.20
SIZE-POS 2.39*** 1.04** 2.91*** 1.60** 1.15** 1.59***
Panel D: Bid-Ask Spread, Idiosyncratic Volatility, Market Capitalization
Union of BA-NEG, 2.67*** 0.98*** 3.10*** 1.93** 1.23** 2.12***
IDIO-NEG,
SIZE-NEG
Complement of union 0.37 0.05 −0.09 −0.26 −0.31 −0.83

We also form subgroups via a double sort to identify subgroups of anomalies with both high limits of arbitrage overall and high asymmetric limits of arbitrage with respect to their short legs. To do so, for BA, we first identify anomalies with relatively low and high average values for the proxy for the stocks in their long and short legs (BA-LOW and BA-HIGH, respectively). For each of the BA-LOW and BA-HIGH subgroups, we then form NEG and POS subgroups as described above, resulting in four subgroups based on BA (BA-LOW-NEG, BA-LOW-POS, BA-HIGH-NEG, and BA-HIGH-POS). For compactness, we take the union of the BA-LOW-NEG, BA-LOW-POS, and BA-HIGH-POS subgroups to form the BA-REST group. We then compare the results between the BA-HIGH-NEG and BA-REST subgroups. In a similar manner, we form IDIO-HIGH-NEG and IDIO-REST (SIZE-LOW-POS and SIZE-REST) subgroups based on IDIO (SIZE). We expect the BA-HIGH-NEG, IDIO-HIGH-NEG, and SIZE-LOW-POS subgroups to exhibit stronger predictive ability compared to the BA-REST, IDIO-REST, and SIZE-REST subgroups, as the former three subgroups are the anomalies with both high limits of arbitrage overall and high asymmetric limits of arbitrage with respect to their short legs. As shown in Table IA.XIV in the Internet Appendix, this is indeed what we find.

D. Market Frictions

As Gârleanu and Pedersen (2013, 2016) and Dong, Kang, and Peress (2020) argue, frictions such as limited risk-bearing capacity and transaction costs induce arbitrageurs to slowly respond to mispricing, resulting in MCP. If our predictability finding is driven by arbitrageurs slowly correcting mispricing in the presence of asymmetric limits of arbitrage and stronger MCP for overpricing vis-à-vis underpricing, then long-short anomaly portfolio returns should contain more relevant information for predicting the market excess return during times of high frictions. We investigate this issue by testing for an increase in the statistic during periods of high frictions using equation (29).

We consider a variety of proxies for market frictions from the literature. We begin with the level of and innovations to aggregate liquidity from Pástor and Stambaugh (2003). We also consider idiosyncratic volatility, which is widely believed to be a major implementation cost of short arbitrage (e.g., Pontiff (2006)). We measure aggregate idiosyncratic risk for a given month by first calculating the idiosyncratic volatility of individual stocks following Ang et al. (2006) and then computing the value-weighted average of the idiosyncratic volatilities for the individual stocks. Furthermore, we use trading noise (Hu, Pan, and Wang (2013)), which tracks the shortage in arbitrage capital via noise in Treasury security prices, and short fees (Asness et al. (2018)), which measure the cost of shorting stocks. For the latter, we again aggregate to the market level by computing the value-weighted average of short fees for individual stocks.43

In addition to the above variables, we consider risk, uncertainty, and risk aversion indices as proxies for the frictions affecting the risk-bearing capacity of arbitrageurs. Along with the VIX, we use macroeconomic, financial, and real uncertainty indices from Jurado, Ludvigson, and Ng (2015), who construct the indices by estimating stochastic volatility series for the residuals from fitted dynamic diffusion-index models based on macroeconomic and financial variables. Finally, we consider the jointly estimated economic uncertainty and risk aversion indices from Bekaert, Engstrom, and Xu (2021). For all of the proxies, we separate high- and low-friction regimes using the sample median.

Table IV reports differences in statistics (in percentage points) between high- and low-friction regimes for market excess return forecasts based on the 100 long-short anomaly portfolio returns. In support of the relevance of asymmetric limits of arbitrage and stronger MCP for overpricing relative to underpricing, the statistics are almost always higher during high-friction periods, and the vast majority of the increases are significant according to the augmented Clark and West (2007) test. Furthermore, the magnitudes of the increases in the statistics are economically sizable, exceeding 10 percentage points for some of the forecasts for frictions proxied by liquidity innovations, trading noise, and short fees.

Table IV. Statistic Differences between High- and Low-Friction Regimes
The table reports percentage-point increases in Campbell and Thompson (2008) out-of-sample () statistics for market excess return forecasts based on 100 long-short anomaly portfolio returns. The out-of-sample period is 1985:01 to 2017:12. The increase is computed between high- and low-friction periods, which are defined using the sample median of the variable in the first column. The ENet forecast is based on elastic net estimation of a multiple predictive regression that includes all 100 of the long-short anomaly portfolio returns. Combine is the arithmetic mean of univariate predictive regression forecasts based on the 100 individual long-short anomaly portfolio returns (considered in turn). C-ENet is the arithmetic mean of the univariate predictive regression forecasts selected by the elastic net in a Granger and Ramanathan (1984) regression. Avg is a univariate predictive regression forecast based on the cross-sectional average of the 100 long-short anomaly portfolio returns. PC (PLS) is a univariate predictive regression forecast based on the first principal component (target-relevant factor) extracted from the 100 long-short anomaly portfolio returns. We use an augmented version of the Clark and West (2007) statistic to test for an increase in the statistic; *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively.
(1) (2) (3) (4) (5) (6) (7)
Friction ENet Combine C-ENet Avg PC PLS
Aggregate liquidity 4.48** 0.86** 5.20** 3.92** 3.64** 9.23**
Liquidity innovations 6.53** 1.42** 6.56** 7.82*** 6.11*** 13.21***
Idiosyncratic volatility 2.73* 1.06** 4.21** 4.29** 3.57** 2.05**
Trading noise 3.67** 1.49*** 5.11** 8.12*** 5.83*** 12.42***
Short fees 10.34* 2.50** 12.56* 9.63** 8.50** 14.70**
VIX −0.25 1.26** 4.87** 6.06** 6.10** 8.93**
Financial uncertainty −1.28 0.02* 1.18* 3.24** 2.80** 3.94**
Macro uncertainty 1.20 0.95** 3.28* 3.97** 4.09** 7.21**
Real uncertainty 2.02* 1.10** 6.19** 4.47** 4.31** 8.02***
Economic uncertainty 1.29 0.88** 3.93* 3.67** 3.48** 3.59**
Risk aversion uncertainty 3.52** 0.60** 6.36** 4.68** 4.19** 7.38**

E. Arbitrage Trading

Next, we examine whether long-short anomaly portfolio returns predict arbitrageurs' trading in the broad market and the tone of market-wide news. If the overpricing correction signaled by long-short anomaly returns during month is only partial and continues to spread throughout the market in the subsequent month, then we expect an increase in long-short anomaly returns in month to lead to a reduction in arbitrageurs' net positions (i.e., long minus short positions) in the broad market in month , which is likely to be driven primarily by an increase in their short positions; we also expect more negative market-wide news tone in month .

We first examine changes in arbitrageurs' aggregate net position in the broad market, where we follow Chen, Da, and Huang (2019) in constructing the net arbitrage trading measure. Specifically, we compute aggregate value-weighted long positions for hedge funds across all stocks in the market, where hedge funds' long positions in individual stocks are based on quarterly reported holdings in the 13F database. Agarwal et al. (2013) identify hedge funds by combining the information in the 13F database and five hedge fund databases. The 13F holdings data cover the largest number of institutional investors, as all institutional investment managers that have discretion over $100 million or more in Section 13(f) securities need to report their quarter-end holdings in these securities. A 13F-filing institution is classified as a hedge fund if its major business is sponsoring and/or managing hedge funds according to information from various sources (such as institution websites, SEC filings, industry publications, and news articles). Our final sample consists of 1,710 unique hedge funds.44 We construct the value-weighted short-selling positions across all stocks in the market by aggregating short interest in individual stocks. A stock's short interest in a month is the total number of uncovered shares sold short for transactions settled on or before the 15th of the month (from Compustat) divided by the total number of shares outstanding (from CRSP). We use short interest from the last month of each quarter to match the quarterly data frequency for hedge-fund holdings. The sample period covers 1980:1 to 2017:4.

Since hedge funds' aggregate long positions and short interest display trends, we compute deviations in the long positions of hedge funds and short interest from trailing four-quarter MAs. Chen, Da, and Huang (2019) show that the difference between the long and short arbitrage trading measures (i.e., net arbitrage trading) effectively aggregates the actions of arbitrageurs.

Table V reports standardized slope coefficient estimates for predictive regressions that relate quarterly arbitrage trading measures to lagged long-short anomaly portfolio returns, where we use long-short anomaly returns from the last month in the quarter. As in Section V.A, to capture the aggregate information across the 100 anomalies in a single slope coefficient, we estimate predictive regressions with , , and serving as the predictor (considered in turn). The estimates for the net market position regressions in the second column are all significant (at the 5% or 1% level). The estimates are all negative, which implies that an increase in long-short anomaly returns leads arbitrageurs to decrease their net positions in the market. The coefficient estimates are also economically sizable: A one-standard-deviation increase in the predictor corresponds to a 4.81% to 5.81% reduction in the net arbitrage market position (in terms of value-weighted shares outstanding).

Table V. Arbitrage Trading and News Tone
The table reports ordinary least squares estimates of standardized slope coefficient estimates for univariate predictive regressions that use the strategy in the first column to combine the information in 100 long-short anomaly portfolio returns to predict the variable in the column heading. Predictor average uses the the cross-sectional average of the 100 long-short anomaly portfolio returns. Principal component (partial least squares) uses the first principal component (target-relevant factor) extracted from the 100 long-short anomaly portfolio returns. The sample period for the second through fourth columns is 1980:1 to 2017:4; the sample period for the fifth column is 1996:01 to 2017:12. The dependent variable is standardized in the fifth column; *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively.
(1) (2) (3) (4) (5)
Strategy Arbitrage Net Market Position Arbitrage Long Market Position Arbitrage Short Market Position Financial Market News Tone
Predictor average −5.81*** −1.91 3.89** −0.36***
Principal component −4.81** −1.18 3.63** −0.32***
Partial least squares −5.32*** −1.62 3.70** −0.35***

The third and fourth columns of Table V report results for arbitrageurs' long and short positions, respectively. The coefficient estimates are all negative (positive) for the long (short) trading measure, indicating that arbitrageurs reduce (increase) their long (short) positions in the market in response to an increase in long-short anomaly returns. The estimates are all significant at the 5% level for the regressions for the short market position, while they are all insignificant for the regressions for the long market position. Comparing the results in the third and fourth columns, the coefficient estimates are larger in magnitude for the short side than for the long side, consistent with stronger MCP for overpricing. Decomposing the net change in the second column, approximately one-third comes from declines in long positions and two-thirds from increases in short positions.

Finally, the last column of Table V reports predictive regression results for monthly U.S. financial market news tone. The news tone measure, from Calomiris and Mamaysky (2019), is based on English-language news articles from Thomson Reuters. Monthly news tone is an aggregation of differences in word tone among news articles on the topic of U.S. financial markets. We detrend news tone by computing deviations from a trailing 12-month MA. To incorporate information from the 100 anomalies into a single slope coefficient, we again estimate predictive regressions with , , and serving as the predictor (considered in turn). We standardize both the dependent variable and the predictor before estimating the predictive regressions. The sample period is 1996:01 to 2017:12.45

To the extent that asymmetric limits of arbitrage generate relatively strong overpricing correction persistence, we expect an increase in long-short anomaly returns to lead to more negative market-wide news tone in the next month; in other words, public news is more likely to confirm that shares were broadly overvalued. The results in the last column of Table V support this conjecture. The coefficient estimates are all significantly negative (at the 1% level), and the economic magnitudes are sizable: A one-standard-deviation increase in the predictor results in a 0.32- to 0.36-standard-deviation drop in market news tone. Overall, the results in Table V are consistent with the view that long-short anomaly portfolio returns predict market returns because arbitrageurs slowly correct market-wide overpricing.

VI. Conclusion

Is cross-sectional stock return predictability related to the time-series predictability of the aggregate market excess return? Our paper provides the first positive systematic answer to this question. Specifically, we find that a representative group of 100 long-short anomaly portfolio returns from the cross-sectional literature contains valuable information for predicting the market excess return on an out-of-sample basis, provided that we use forecasting strategies that guard against overfitting the data. We explore economic channels underlying the predictive ability of long-short anomaly portfolio returns, explaining how asymmetric limits of arbitrage—by giving rise to overpricing correction persistence—can account for the patterns of return predictability in the data. We also provide evidence supporting the relevance of asymmetric limits of arbitrage and overpricing correction persistence for explaining our results. In light of our positive findings for the equity market, it would be interesting in future research to examine links between cross-sectional and time-series return predictability in other markets, such as bonds and currencies, by applying our methods for extracting predictive signals from large sets of cross-sectional anomalies.46

Editors: Stefan Nagel, Philip Bond, Amit Seru, and Wei Xiong

  • 1 Rapach and Zhou (2013) provide a survey of the literature on market excess return predictability.
  • 2 We explain our findings in a framework in which anomalies reflect mispricing. Whether our findings can also be explained in a risk-based framework is beyond the scope of the present paper but is an interesting topic for future research.
  • 3 Our study complements recent studies that employ machine-learning techniques to predict stock returns using alternative predictor variables in high-dimensional settings, including Rapach, Strauss, and Zhou (2013); Chinco, Clark-Joseph, and Ye (2019); Rapach et al. (2019); Freyberger, Neuhierl, and Weber (2020); Gu, Kelly, and Xiu (2020); Kozak, Nagel, and Santosh (2020); Avramov, Cheng, and Metzker (2021); Chen, Pelger, and Zhu (2021); Cong et al. (2021); Han et al. (2021); and Liu, Zhou, and Zhu (2021).
  • 4 In a related but different context, Fama and French (2020) compare cross-sectional and time-series factor models.
  • 5 We thank the editor, Stefan Nagel, for suggesting the framework for the data-generating process in this section.
  • 6 Following much of the cross-sectional literature, the anomaly portfolios in Section IV are based on stocks sorted into decile portfolios, and each long-short anomaly portfolio goes long (short) the 10th (first) decile portfolio. We discuss the extension to additional market segments in Section I.D.
  • 7 Relaxing this assumption does not change our main intuition, as discussed in footnote 11.
  • 8 Section I of the Internet Appendix provides details on the derivations of the standardized slope coefficients for the predictive regressions in Sections I.B to I.D. The Internet Appendix may be found in the online version of this article.
  • 9 As decreases, the degree of overpricing correction in the period immediately after the shock increases, so that the magnitude of the reversal effect increases. In the extreme, , which implies that for , so that the overpricing shock fully corrects in one period and . In this case, the second term in brackets is zero and in equation (9).
  • 10 Andrei and Cujean (2017) show that when information about mispricing spreads among traders at an accelerated rate, immediate correction of a current pricing error is dominated by the corrections of previous pricing errors, resulting in return momentum. From a behavioral perspective, Chan, Jegadeesh, and Lakonishok (1996); Daniel, Hirshleifer, and Subrahmanyams (1998); Hong and Stein (1999); and Da, Gurun, and Warachka (2014) explain return momentum as underreaction to news. Gârleanu and Pedersen (2013, 2016) and Dong, Kang, and Peress (2020) show that risk and/or cost considerations lead arbitrageurs to slowly allocate capital to correct mispricing.
  • 11 When the mispricing correction assumption in equation (6) does not hold, mispricing in response to a shock can be exacerbated after anomaly portfolio formation. Equation (9) still applies, so that again requires the expression in brackets—which depends on the successive pairs of return responses to a mispricing shock—to be positive. In this case, we need the momentum effect of either mispricing correction or exacerbation to be stronger than the reversal effect in successive return pairs.
  • 12 In a different context, Ehsani and Linnainmaa (2021) find evidence of positive autocorrelation in factor returns, which they use to explain cross-sectional momentum. In contrast, we focus on asymmetric MCP in the long and short legs of anomaly portfolios and its implications for market return predictability.
  • 13 For simplicity, we assume that the loadings on in equation (19) are equal to one for all of the long-short anomaly returns. When we allow for different loadings across anomalies, the strength of the signal in an anomaly will also depend on the magnitude of its loading. A number of the forecasting strategies in Section II.A can be interpreted as effectively placing greater emphasis on long-short anomalies with stronger signals (in a manner that guards against overfitting).
  • 14 The well-known Diebold and Mariano (1995) and West (1996) (DMW) procedure uses instead of as the dependent variable in equation (26). Clark and McCracken (2001) and McCracken (2007) show that the DMW test tends to be severely undersized when comparing forecasts from nested models (as in our application), in which case it has little power to detect improvements in forecast accuracy. Clark and West (2007) adjust the DMW test statistic so that its asymptotic distribution is well approximated by the standard normal.
  • 15 To keep the equity allocation in a reasonable range, we impose the restriction that .
  • 16 Section III of the Internet Appendix provides detailed definitions and relevant studies for the anomalies.
  • 17 The exception is the change in forecasted earnings per share, which begins in 1989.
  • 18 We also generate a forecast of the short-leg excess return via a univariate predictive regression with the lagged short-leg excess return serving as the predictor. As shown in Table IA.II in the Internet Appendix, there is significant out-of-sample evidence of serial dependence: The statistics are positive for 89 of the 100 anomalies, and 59, 31, and 2 are significant at the 10%, 5%, and 1% level, respectively. Table IA.III in the Internet Appendix presents the full matrix of cross-autocorrelations for the 100 long-short anomaly portfolio returns.
  • 19 In a univariate predictive regression setting, Campbell and Thompson (2008) find that truncating OLS forecasts from below at zero helps to improve out-of-sample performance. In our application, although this approach improves the accuracy of the conventional OLS forecast, it still performs substantially worse than the prevailing mean benchmark (). Thus, in our high-dimensional setting, truncating the OLS forecast appears insufficient for guarding against overfitting.
  • 20 Pástor and Stambaugh (2000) use a 2% annualized utility gain as a threshold for economic significance when comparing factor models that include the market excess return.
  • 21 Table IA.IV in the Internet Appendix reports Sharpe ratios for the portfolios, which follow the same pattern as the utility gains in Figure 2.
  • 22 Table IA.V in the Internet Appendix reports statistics and annualized average utility gains for the 1985:01 to 2001:12 and 2002:01 to 2017:12 subsamples. The results indicate substantial out-of-sample gains across both subsamples, again provided that we use strategies designed to guard against overfitting.
  • 23 Data used to construct the predictors come from Amit Goyal's website (http://www.hec.unil.ch/agoyal/) and Federal Reserve Economic Data (https://fred.stlouisfed.org/). The predictors are the log dividend-price ratio, log earnings-price ratio, return volatility computed using the approach in Mele (2007), three-month Treasury bill yield (deviation from a 12-month MA), 10-year Treasury bond yield (deviation from a 12-month MA), 10-year Treasury bond yield minus the three-month Treasury bill yield, Baa-rated corporate bond yield minus the Aaa-rated corporate bond yield, long-term Treasury bond return, corporate bond return minus the long-term Treasury bond return, net equity expansion, consumer price index inflation, industrial production growth, MA(1,12) (MA(3,12)) an indicator variable that takes a value of one if the S&P 500 index (three-month MA of the S&P 500 index) is greater than the 12-month MA of the S&P 500 index and zero otherwise, and MOM(6) (MOM(12)) an indicator variable that takes a value of one if the S&P 500 index is greater than its value six (12) months ago and zero otherwise.
  • 24 Table IA.VII in the Internet Appendix reports annualized average utility gains for forecasts based on the 16 predictors. Only the simple combination and principal component forecasts generate positive annualized utility gains (0.29% and 1.16%, respectively).
  • 25 Table IA.IX in the Internet Appendix reports statistics for univariate regressions of the ENet, simple combination, C-ENet, predictor average, principal component, and PLS forecasts based on the long-short anomaly returns on forecasts based on the 16 individual popular predictors (considered in turn). Further confirming the weak links between market excess return forecasts based on long-short anomaly returns and those based on popular predictors, only 12 of the 96 statistics are above 5%, and none are above 15%. Among the popular predictors, the forecast based on the corporate bond return minus the long-term Treasury bond return is generally the most strongly related to the forecasts based on the long-short anomaly returns.
  • 26 To compute market excess return forecasts beyond the one-month horizon, we modify the methods in Section II using the multiperiod return, , as the target variable. The HAC standard error used to compute the -statistic for the intercept in equation (26) accounts for the autocorrelation induced in the forecast errors by the overlap in the multiperiod return observations.
  • 27 We thank a referee for suggesting this external validity test.
  • 28 Available at https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html.
  • 29 Out-of-sample return predictability is the weakest for Health. This is likely due in part to the fact that the Health industry is a relatively small segment of the market. For 1970:01 to 2017:12, the average market capitalization share for Health is 9%, while the shares range from 19% to 30% for the other four industries.
  • 30 We thank a referee for suggesting this insightful analysis.
  • 31 We thank Xuemin Yan and Lingling Zheng for providing the data from Yan and Zheng (2017).
  • 32 For the purpose of forming the subsets, insignificant (significant) alphas are those with -statistics below one (above 2.58) in magnitude in the Fama and French (1993) three-factor model.
  • 33 The horizontal axes in Figure 4 correspond to the forecast month, so that the month estimate is based on data available through month . Confidence intervals for the slope coefficient estimates for the predictive regressions with and are based on the formulas for the standard errors in Bai and Ng (2006) and Kelly and Pruitt (2015), respectively.
  • 34 Table IA.XII in the Internet Appendix reports standardized OLS slope coefficient estimates for univariate predictive regressions based on the 100 individual long-short anomaly portfolio returns (considered in turn) for the 1970:01 to 2017:12 sample period. Further supporting stronger MCP for overpricing vis-à-vis underpricing, the vast majority (78) of the slope coefficient estimates are negative, and 41, 32, and 12 are significant at the 10%, 5%, and 1% level, respectively.
  • 35 We thank a referee for suggesting this informative procedure.
  • 36 The long-short portfolio return constructed from average ranks also demonstrates out-of-sample predictive ability for the market excess return. The statistic is 1.29%, which is significant at the 5% level according to the Clark and West (2007) test.
  • 37 We scale the cross-autocovariances in Figure 5 to makes them comparable across Panels (a) and (b).
  • 38 We thank a referee for suggesting this instructive analysis.
  • 39 We thank a referee for recommending the formation of informative subgroups along these lines.
  • 40 Idiosyncratic volatility is the standard deviation of the residuals for a regression of the daily excess return on the three Fama and French (1993) factors for the previous month.
  • 41 Table IA.XIII in the Internet Appendix lists the anomalies included in the subgroups.
  • 42 The conventional OLS forecasts based on the subgroups are always negative and large in magnitude, and thus overfitting remains a serious problem.
  • 43 The short fee is based on the cost-to-borrow score from Markit (https://ihsmarkit.com/index.html). Short fee data are available beginning in 2002:01.
  • 44 We thank Vikas Agarwal, Wei Jiang, Yuehua Tang, and Baozhong Yang for providing information for identifying hedge funds.
  • 45 We thank Charles Calomiris and Harry Mamaysky for providing updated data through 2017.
  • 46 Another important question is how anomalies are related across countries and impact global equity markets collectively. Dong et al. (2021) pursue this line of research.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.