Anomalies and the Expected Market Return
異常現象和預期市場回報
Xi Dong is at the City University of New York Baruch College. Yan Li is at the Southwestern University of Finance and Economics. David E. Rapach is at Saint Louis University. Guofu Zhou is at Washington University in St. Louis. Rapach was a visiting professor at Washington University in St. Louis when this project was undertaken. We thank Daniel Andrei, Svetlana Bryzgalova, John Cochrane, Andrew Detzel, Victor DeMiguel, Michael Hasler, Dashan Huang, Lawrence Jin, Sicong (Allen) Li, Lin Peng, Joel Peress, Kelly Shue, Jack Strauss, Yajun Wang, Xuemin (Sterling) Yan, and Lu Zhang; as well as conference and seminar participants at the 2020 FinTech Conference in China, Chinese University of Hong Kong-Shenzhen, City University of New York Baruch College, London Business School, Syracuse University, University of Bath, University of North Carolina at Charlotte, University of Liverpool, and Washington University in St. Louis, for helpful comments. We are especially grateful to the editor (Stefan Nagel), associate editor, and two referees for insightful and detailed comments that substantially improved the paper. We have read The Journal of Finance disclosure policy and have no conflicts of interest to disclose.
董奚就讀於紐約市立大學巴魯克學院。李岩就讀於西南財經大學。 David E. Rapach 在聖路易大學。週國富就讀聖路易華盛頓大學。當這個計畫進行時,拉帕奇是聖路易斯華盛頓大學的客座教授。我們感謝Daniel Andrei、Svetlana Bryzgalova、John Cochrane、Andrew Detzel、Victor DeMiguel、Michael Hasler、Dashan Huang、Lawrence Jin、Sicong (Allen) Li、Lin Peng、Joel Peress、Kelly Shue、Jack Strauss、Yajun Wang、Xuemin (Sterling ) ) 嚴和張璐;以及2020年中國金融科技大會的會議和研討會參與者、香港中文大學(深圳)、紐約市立大學巴魯克學院、倫敦商學院、雪城大學、巴斯大學、北卡羅萊納大學夏洛特分校、大學利物浦大學和聖路易斯華盛頓大學的有用評論。我們特別感謝編輯(Stefan Nagel)、副主編和兩位審稿人富有洞察力和詳細的評論,這些評論極大地改進了論文。我們已閱讀《金融雜誌》的揭露政策,沒有需要揭露的利益衝突。
ABSTRACT 抽象的
We provide the first systematic evidence on the link between long-short anomaly portfolio returns—a cornerstone of the cross-sectional literature—and the time-series predictability of the aggregate market excess return. Using 100 representative anomalies from the literature, we employ a variety of shrinkage techniques (including machine learning, forecast combination, and dimension reduction) to efficiently extract predictive signals in a high-dimensional setting. We find that long-short anomaly portfolio returns evince statistically and economically significant out-of-sample predictive ability for the market excess return. The predictive ability of anomaly portfolio returns appears to stem from asymmetric limits of arbitrage and overpricing correction persistence.
我們提供了關於多空異常投資組合回報(橫斷面文獻的基石)與市場總超額回報的時間序列可預測性之間聯繫的第一個系統證據。利用文獻中的 100 個代表性異常,我們採用各種收縮技術(包括機器學習、預測組合和降維)來有效提取高維度環境中的預測訊號。我們發現,多空異常投資組合報酬率表現出對市場超額報酬的統計和經濟顯著的樣本外預測能力。異常投資組合報酬的預測能力似乎源自於套利的不對稱限制和定價過高修正的持續性。
Stock return predictability is a fundamental topic in finance. Two major—and voluminous—lines of research focus on this subject. The first examines whether firm characteristics can predict the cross-sectional dispersion in stock returns. These studies identify numerous equity market anomalies (e.g., Fama and French (2015), Harvey, Liu, and Zhu (2016), McLean and Pontiff (2016), Hou, Xue, and Zhang (2020)). The second line of research investigates the time-series predictability of the aggregate market excess return based on a variety of economic and financial variables, such as valuation ratios, interest rates, and inflation (e.g., Nelson (1976), Campbell (1987), Fama and French (1988, 1989), Pástor and Stambaugh (2009)). Studies in this vein attempt to shed light on the variables that affect the equity risk premium.1
股票回報的可預測性是金融領域的一個基本議題。有兩個主要且大量的研究領域集中在這個主題上。第一項研究檢視公司特徵是否可以預測股票報酬的橫斷面離散度。這些研究發現了許多股票市場異常現象(例如,Fama 和 French ( 2015 )、Harvey、Liu 和 Zhu ( 2016 )、McLean 和 Pontiff ( 2016 )、Hou、Xue 和 Zhu ( 2020 ))。第二條研究線是基於各種經濟和金融變量,例如估值比率、利率和通貨膨脹,調查總市場超額收益的時間序列可預測性(例如,Nelson( 1976 ),Campbell( 1987 ), Fama 和French ( 1988 , 1989 ), Pástor 和 Stambaugh ( 2009 ))。這方面的研究試圖揭示影響股票風險溢酬的變數。 1
In this paper, we investigate whether these two leading lines of the finance literature are linked. Specifically, we analyze the ability of long-short anomaly portfolio returns from the cross-sectional literature to predict the market excess return. Our investigation has a number of key features. First, we focus on a cornerstone of the cross-sectional literature, namely, long-short anomaly portfolio returns, as the literature often uses such returns as evidence of cross-sectional mispricing. Second, we employ out-of-sample tests, since such tests provide the most rigorous and relevant evidence on stock return predictability (e.g., Goyal and Welch (2008), Martin and Nagel (2021)). Third, we examine the predictive ability of a large number (100) of long-short anomaly portfolio returns representative of those from the cross-sectional literature and simultaneously aggregate the information in the set of anomaly portfolio returns. Because accommodating a large number of anomaly portfolio returns poses problems for the conventional multiple predictive regression approach, we apply a variety of shrinkage techniques—including machine learning, forecast combination, and dimension reduction—to guard against overfitting the data in a high-dimensional setting. Fourth, we explore economic rationales for the ability of long-short anomaly portfolio returns to predict the market return, thereby complementing our statistical findings with economic analysis.
在本文中,我們研究了金融文獻的這兩條主線是否有關聯。具體來說,我們從橫斷面文獻中分析了多空異常投資組合收益預測市場超額收益的能力。我們的調查有許多關鍵特徵。首先,我們關注橫斷面文獻的基石,即多空異常投資組合收益,因為文獻經常使用此類收益作為橫斷面錯誤定價的證據。其次,我們採用樣本外測試,因為此類測試提供了關於股票收益可預測性的最嚴格和相關的證據(例如,Goyal 和 Welch ( 2008 )、Martin 和 Nagel ( 2021 ))。第三,我們檢查了代表橫斷面文獻的大量(100)多空異常投資組合收益的預測能力,並同時匯總了異常投資組合收益集中的資訊。由於容納大量異常投資組合回報會為傳統的多重預測迴歸方法帶來問題,因此我們應用了各種收縮技術(包括機器學習、預測組合和降維)來防止高維度設定中的資料過度擬合。第四,我們探討多空異常投資組合報酬預測市場報酬能力的經濟原理,進而以經濟分析補充我們的統計結果。
Empirically, we find that the information in the group of 100 long-short anomaly portfolio returns is indeed useful for forecasting the monthly market excess return on an out-of-sample basis, provided that we rely on strategies that guard against overfitting the data. The out-of-sample
Economically, the predictive power of long-short anomaly portfolio returns for the market return can be explained via asymmetric mispricing correction persistence (MCP), which arises from asymmetric limits of arbitrage (Shleifer and Vishny (1997)). To analyze the implications of MCP for market return predictability, we specify a data-generating process with stationary under- and overpricing components in the prices of the long and short legs, respectively, of the anomaly portfolio.2 Consider, for example, the short leg. We show that when the return momentum effect from the correction of old overpricing shocks dominates the return reversal effect from the immediate correction of a new overpricing shock, the short-leg return in period
從經濟上講,多空異常投資組合收益對市場收益的預測能力可以透過不對稱錯誤定價修正持續性(MCP)來解釋,MCP是由套利的不對稱限制引起的(Shleifer和Vishny( 1997 )) 。為了分析 MCP 對市場回報可預測性的影響,我們指定了一個資料產生過程,其中異常投資組合的多頭和空頭價格分別具有固定的定價過低和過高的成分。 2例如,考慮短腿。我們表明,當舊的定價過高衝擊修正的回報動量效應主導新的定價過高衝擊立即修正的回報反轉效應時,短期回報
與期間市場回報呈正相關
。直觀上看,時期
短腿回報衡量期末異常特徵所識別的定價過高的糾正
(Akbas 等人( 2015 年),Engelberg、McLean 和 Pontiff( 2018 年))。如果定價過高修正過程夠持久,那麼短線報酬將存在正序列依賴性;因為短邊收益是市場收益的一部分,這意味著短邊收益與未來市場收益呈正相關。短邊回報也可能表明短邊以外的細分市場定價過高,從而放大對市場回報的影響。事實上,我們提供的經驗證據表明,短期回報包含了大部分市場的相關定價過高資訊。類似的推理解釋了長腿回報如何與未來市場回報呈正相關。
Our empirical results indicate that long- and short-leg returns are positively related to the future market return. However, the out-of-sample predictive ability of long-leg returns is weak, indicating that MCP is considerably stronger for overpricing vis-à-vis underpricing. Consequently, long-short anomaly returns strongly negatively predict the market return. Via recursive regressions, we find evidence that such negative relationships remain important over time. Stronger MCP for overpricing relative to underpricing is consistent with arbitrage with respect to overpriced shares being less aggressive, due to factors such as short-sale constraints (Miller (1977)), feedback effects (Edmans, Goldstein, and Jiang (2015)), and price drops coupled with declines in liquidity (Dong, Krystyniak, and Peng (2019)). In line with relatively strong overpricing correction persistence, evidence suggests that overvaluation, as reflected in the short legs of anomaly portfolios, drives the profitability of many anomaly returns (e.g., Hong, Lim, and Stein (2000), Stambaugh, Yu, and Yuan (2012, 2015), Avramov et al. (2013)).
我們的實證結果表明,多頭和空頭回報與未來市場回報呈正相關。然而,長腿回報的樣本外預測能力較弱,這表明 MCP 對於定價過高相對於定價過低的預測能力要強得多。因此,多空異常報酬對市場報酬有強烈的負面預測。透過遞歸回歸,我們發現證據表明這種負面關係隨著時間的推移仍然很重要。由於賣空限制(Miller( 1977 ))、反饋效應(Edmans、Goldstein 和 Jiang( 2015 ))等因素,定價過高的MCP 相對於定價過低而言更強,這與定價過高股票的套利行為不那麼激進是一致的。與相對較強的高估修正持續性相一致,有證據表明,高估(反映在異常投資組合的空頭中)推動了許多異常回報的盈利能力(例如,Hong,Lim和Stein( 2000 ),Stambaugh ,Yu和Yuan) ( 2012 年, 2015 年),阿夫拉莫夫等人( 2013 年))。
If MCP is stronger in the short leg compared to the long leg, it may seem that using short-leg anomaly portfolio returns instead of long-short anomaly returns would work better for forecasting the market return. However, we show econometrically that long-short anomaly returns can provide a stronger predictive signal for the market return. Intuitively, the long- and short-leg returns contain a common component unrelated to the future market return. By taking the difference between the long- and short-leg returns, we filter the noise in the predictor variable and thereby provide a sharper signal for anticipating the market return; such filtering is akin to alleviating the errors-in-variables problem. We also show that aggregating information across long-short anomaly returns can further sharpen the predictive signal by filtering the idiosyncratic noise in the individual predictors.
如果短邊的 MCP 比長邊更強,那麼似乎使用短邊異常投資組合收益而不是多空異常收益來預測市場收益會更有效。然而,我們從計量經濟學角度表明,多空異常收益可以為市場收益提供更強的預測訊號。直觀上,多頭和空頭回報包含與未來市場回報無關的共同成分。透過獲取多頭和空頭回報之間的差異,我們過濾了預測變數中的噪聲,從而為預測市場回報提供了更清晰的訊號;這種過濾類似於減輕變數錯誤問題。我們還表明,聚合多空異常收益的資訊可以透過過濾各個預測變數中的特殊雜訊來進一步銳化預測訊號。
Methodologically, the most straightforward approach for incorporating the information in a large set of potential predictors is to specify a multiple predictive regression that includes all of the lagged predictors as explanatory variables. However, conventional ordinary least squares (OLS) estimation of a high-dimensional predictive regression is highly susceptible to overfitting. By construction, OLS maximizes the fit of the model over the in-sample estimation period, which can lead to poor out-of-sample performance; intuitively, OLS is prone to misinterpreting noise in the data for a predictive signal. Because we are interested in forecasting the monthly market excess return, which inherently contains a small predictable component, we are operating in a quite noisy environment that exacerbates the danger of overfitting. As anticipated, we find that the conventional forecast based on OLS estimation of the multiple predictive regression that includes all of the long-short anomaly returns exhibits symptoms of extreme overfitting: The forecast is highly volatile and substantially less accurate than the prevailing mean benchmark.
從方法上來說,將資訊納入大量潛在預測變數的最直接方法是指定多重預測迴歸,其中包括所有滯後預測變數作為解釋變數。然而,高維預測迴歸的傳統普通最小平方法 (OLS) 估計很容易出現過度擬合。透過構造,OLS 最大化模型在樣本內估計期間的適合度,這可能會導致樣本外表現不佳;直觀上,OLS 很容易將資料中的雜訊誤解為預測訊號。因為我們感興趣的是預測每月的市場超額收益,它本身包含一個小的可預測成分,所以我們在一個相當嘈雜的環境中操作,這加劇了過度擬合的危險。正如預期的那樣,我們發現基於包含所有多空異常收益的多重預測回歸的 OLS 估計的傳統預測表現出極端過度擬合的症狀:預測波動性很大,並且遠低於普遍的平均基準準確度。
We use a variety of forecasting strategies to guard against overfitting high-dimensional predictive regressions, all of which essentially rely on shrinkage. The first employs the elastic net (ENet; Zou and Hastie (2005)), a refinement of the well-known least absolute shrinkage and selection operator (LASSO; Tibshirani (1996)), to estimate the forecasting model. The LASSO and ENet are machine-learning techniques that use penalized regression to directly shrink the parameter estimates and thereby avoid overfitting the data.3 We also consider forecast combination (Bates and Granger (1969)). We use a simple combination approach that takes the arithmetic mean of univariate predictive regression forecasts based on the individual predictors (Rapach, Strauss, and Zhou (2010)), as well as a refinement due to Rapach and Zhou (2020) and Han et al. (2021) that incorporates insights from Diebold and Shin (2019) by using machine-learning techniques to select the individual forecasts to include in the combination forecast. Finally, we employ dimension-reduction techniques to combine the predictors into a single variable, which subsequently serves as the explanatory variable in a univariate predictive regression. We consider three dimension-reduction techniques: The first takes the cross-sectional average of the individual predictors, the second extracts the first principal component from the set of predictors (Ludvigson and Ng (2007, 2009)), and the third, an implementation of partial least squares (PLS; Wold (1966)), extracts the first target-relevant factor from the set of predictors (Kelly and Pruitt (2013, 2015), Huang et al. (2015)). As indicated previously, the forecasts based on strategies designed to circumvent overfitting all produce statistically and economically significant
我們使用各種預測策略來防止過度擬合高維預測迴歸,所有這些策略本質上都依賴收縮。第一個採用彈性網絡(ENet;Zou 和 Hastie( 2005 )),這是對眾所周知的最小絕對收縮和選擇算子(LASSO;Tibshirani( 1996 ))的改進,來估計預測模型。 LASSO 和 ENet 是機器學習技術,它們使用懲罰回歸來直接縮小參數估計,從而避免過度擬合資料。 3我們也考慮了預測組合(Bates 和 Granger ( 1969 ))。我們使用簡單的組合方法,該方法採用基於各個預測變數的單變量預測回歸預測的算術平均值(Rapach、Strauss 和 Zhou( 2010 )),以及 Rapach 和 Zhou( 2020 )以及 Han 等人的改進。 ( 2021 ) 結合了 Diebold 和 Shin ( 2019 ) 的見解,透過使用機器學習技術來選擇要包含在組合預測中的單一預測。最後,我們採用降維技術將預測變數組合成單一變量,該變數隨後充當單變量預測迴歸中的解釋變數。 我們考慮三種降維技術:第一個採用各個預測變數的橫斷面平均值,第二個從預測變數集中提取第一個主成分(Ludvigson 和 Ng ( 2007 , 2009 )),第三個是實現偏最小平方法(PLS;Wold( 1966 )),從預測變數集中提取第一個目標相關因子(Kelly 和Pruitt ( 2013,2015 ),Huang 等人( 2015 ))。如前所述,基於旨在避免過度擬合的策略的預測都產生了統計和經濟上的顯著影響
To further assess the relevance of asymmetric limits of arbitrage for generating our findings, we form subgroups of anomalies using three proxies for limits of arbitrage: bid-ask spread, idiosyncratic volatility, and market capitalization. Specifically, we use the proxies to construct subgroups of anomalies characterized by more stringent limits of arbitrage in their short legs vis-à-vis their long legs. To the extent that the proxies capture limits of arbitrage, we expect the subgroups of long-short anomaly portfolio returns that consist of anomalies with relatively strong (weak) limits of arbitrage in their short legs to display more (less) out-of-sample predictive ability for the market return. This is what we find.
為了進一步評估不對稱套利限制與得出我們的研究結果的相關性,我們使用三個套利限制代理形成異常子組:買賣價差、特殊波動性和市值。具體來說,我們使用代理來建立異常子組,其特徵是短腿相對於長腿的套利限制更嚴格。就代理捕獲套利限製而言,我們預期多空異常投資組合收益的子組(由短邊套利限制相對強(弱)的異常組成)將顯示更多(更少)樣本外對市場回報的預測能力。這就是我們發現的。
Limits of arbitrage emerge from a variety of frictions (Gromb and Vayanos (2010)). In the models of Gârleanu and Pedersen (2013, 2016) and Dong, Kang, and Peress (2020), frictions such as limited risk-bearing capacity and transactions costs induce arbitrageurs to correct mispricing only slowly, resulting in MCP. To examine the relevance of limits of arbitrage, we analyze how various frictions are related to the out-of-sample predictive ability of long-short anomaly portfolio returns. Intuitively, we expect arbitrage activity to be more constrained during periods when frictions are more acute, implying stronger out-of-sample predictive power for long-short anomaly returns during such periods. We consider a number of variables as proxies for frictions that can affect MCP in anomaly portfolios (especially in the short legs), including aggregate liquidity (Pástor and Stambaugh (2003)), idiosyncratic risk (Ang et al. (2006), Pontiff (2006)), trading noise (Hu, Pan, and Wang (2013)), the VIX, economic uncertainty and risk aversion indices (Jurado, Ludvigson, and Ng (2015), Bekaert, Engstrom, and Xu (2021)), and short costs (Asness et al. (2018)). Consistent with stronger MCP for overpricing vis-à-vis underpricing, we detect significant increases in out-of-sample market return predictability during periods of heightened frictions.
套利的限制源自於各種摩擦(Gromb 和 Vayanos ( 2010 ))。在Gârleanu和Pedersen( 2013,2016 )以及Dong、Kang和Peress( 2020 )的模型中,有限的風險承受能力和交易成本等摩擦導致套利者只能緩慢地糾正錯誤定價,從而導致MCP。為了檢驗套利限制的相關性,我們分析了各種摩擦與多空異常投資組合報酬的樣本外預測能力之間的關係。直觀上,我們預計,在摩擦更加嚴重的時期,套利活動會受到更多限制,這意味著在該時期,多空異常回報的樣本外預測能力更強。我們考慮了許多變數作為摩擦的代理,這些摩擦可能影響異常投資組合中的 MCP(尤其是短邊),包括總流動性(Pástor 和 Stambaugh( 2003 ))、特殊風險(Ang 等人( 2006 )、 Pontiff( 2006 ))、交易噪音(Hu、Pan 和 Wang( 2013 ))、VIX、經濟不確定性和風險厭惡指數(Jurado、Ludvigson 和 Ng( 2015 )、Bekaert、Engstrom 和 Xu( 2021 )),以及短期成本(Asness 等人( 2018 ))。與定價過高相對於定價過低的更強的 MCP 一致,我們發現在摩擦加劇期間樣本外市場回報的可預測性顯著增加。
Finally, we provide more direct evidence that the predictive power of long-short anomaly portfolio returns for the market return stems from slow arbitrage, especially with respect to overpricing. Following Chen, Da, and Huang (2019), we construct net arbitrage positions in the overall market by aggregating the value-weighted trades of hedge funds and short sellers. We find that an increase in long-short anomaly portfolio returns predicts a statistically and economically significant decrease in net arbitrage positions in the market, primarily due to increases in short positions. We also examine the tone of public news about U.S. financial markets extracted from news articles from Thomson Reuters by Calomiris and Mamaysky (2019). We find that an increase in long-short anomaly returns leads to more negative market news tone.
最後,我們提供了更直接的證據,表明多空異常投資組合收益對市場收益的預測能力源於緩慢套利,特別是在定價過高方面。繼Chen、Da和Huang( 2019 )之後,我們透過匯總對沖基金和賣空者的價值加權交易來建構整個市場的淨套利部位。我們發現,多空異常投資組合收益的增加預示著市場中淨套利頭寸在統計和經濟上的顯著下降,這主要是由於空頭頭寸的增加。我們也研究了從 Calomiris 和 Mamaysky 的湯森路透新聞文章中提取的有關美國金融市場的公共新聞的語氣( 2019 年)。我們發現,多空異常報酬的增加會導致更多負面的市場消息基調。
Our study is related to Engelberg et al. (2021), which presently is the only other study to systematically investigate connections between cross-sectional and time-series stock return predictability. Engelberg et al. (2021) follow the existing approach in the time-series literature by using the (equal- or value-weighted) average of firm-level values for a given characteristic as a predictor of the market return. Taking into account a large number of anomaly characteristics from the cross-sectional literature, they find little evidence that characteristics on average have out-of-sample time-series predictive ability for the market return. In contrast, we introduce a new approach that relies on long-short anomaly portfolio returns—a bedrock of the cross-sectional literature—to predict the market return, and we find strong evidence that the aggregate information in long-short anomaly returns is valuable for forecasting the market return. The two papers complement each other by shedding light on the relationship between cross-sectional and time-series stock return predictability.4
我們的研究與 Engelberg 等人相關。 ( 2021 ),這是目前唯一一項系統性研究橫斷面股票報酬可預測性和時間序列股票報酬可預測性之間關聯的研究。恩格伯格等人。 ( 2021 )遵循時間序列文獻中的現有方法,使用給定特徵的公司層面價值的(等權或價值加權)平均值作為市場回報的預測因子。考慮到橫斷面文獻中的大量異常特徵,他們發現幾乎沒有證據表明特徵平均對市場回報具有樣本外時間序列預測能力。相較之下,我們引入了一種依賴多空異常投資組合回報(橫斷面文獻的基石)來預測市場回報的新方法,並且我們發現強有力的證據表明多空異常回報中的匯總資訊是有價值的用於預測市場回報。這兩篇論文透過闡明橫斷面股票收益預測性和時間序列股票收益預測性之間的關係來相輔相成。 4
The remainder of the paper is organized as follows. Section I provides intuition on the predictive power of long-short anomaly portfolio returns for the market return. Section II describes the construction and evaluation of the out-of-sample forecasts. Section III presents the data, while Section IV reports forecast performance results. Section V provides additional results pertaining to asymmetric limits of arbitrage. Section VI concludes.
本文的其餘部分組織如下。第一節提供了多空異常投資組合報酬對市場報酬的預測能力的直覺。第二節描述了樣本外預測的建構和評估。第三部分介紹了數據,第四部分報告了預測績效結果。第五節提供了與不對稱套利限制有關的其他結果。第六節總結。
I. Intuition on Predictability
I. 對可預測性的直覺
In this section, we use a stylized data-generating process to provide intuition on the predictive ability of anomaly portfolio returns for the market return.5
在本節中,我們使用程式化的資料產生流程來直觀地了解異常投資組合回報對市場回報的預測能力。 5
A. Data-Generating Process
A. 資料產生過程
假設異常投資組合的多頭和空頭價格包含一個共同的鞅成分,週期為
,而長邊(短邊)價格包含固定成分
)反映了定價過低(定價過高)的水平,與共同成分不相關。直觀上,錯誤定價意味著價格中存在固定成分:假設錯誤定價最終會糾正,任何錯誤定價衝擊只會暫時影響價格,儘管糾正過程可能會持續多個時期。每條腿的對數回報(以價格變動而言)由下式給出
) 是長腿(短腿)回報,
是錯誤定價的變化,並且
。使用方程式( 1 ),多空異常投資組合收益
為了方便說明,假設多頭和空頭共同構成市場。然後
是(等權重)市場回報。 6
Equation (4) provides a general representation for the mispricing component in each leg, as any stationary autoregressive moving-average (ARMA) process can be expressed via the infinite-order moving-average (MA) process. The equation expresses the current-period level of mispricing as a function of current and past mispricing shocks. The MA process can be interpreted as an impulse-response function:
B. Mispricing Correction Persistence
In sum, two opposing effects determine
The previous example (
C. Noise Reduction
D. Extensions
For expositional ease, we assume that the long and short legs cover the market in equation (3). When the long and short legs are the last and first decile portfolios, respectively—as in our empirical application—the same intuition applies. We address this issue empirically in Section V.B, where we show that the predictive ability of long-short anomaly returns extends across multiple market segments. Thus, our intuition behind market return predictability based on long-short anomaly returns extends beyond serial dependence in the long and short legs (especially the latter) and includes the relevance of long-short anomaly returns for signaling mispricing correction more generally across the market.12
II. Methodology
This section describes the construction of the out-of-sample forecasts and their evaluation using both statistical and economic criteria.
A. Forecast Construction
Suppose that we want to forecast the market excess return (
The prevailing mean forecast, which implicitly assumes that the market excess return is unpredictable (apart from its mean value), is the most popular benchmark in the literature. The prevailing mean forecast is simply the average of the market excess return observations available at the time of forecast formation. Because the predictable component in the monthly market excess return is inherently small (i.e., return data are quite noisy), the prevailing mean forecast is difficult to beat in practice (e.g., Goyal and Welch (2008)). With this in mind, successful out-of-sample strategies effectively shrink the market excess return forecast toward the prevailing mean benchmark to reduce the likelihood of overreacting to the noise in return data.
- Conventional OLS: The conventional OLS forecast is based on a fitted multiple predictive regression that includes all of the lagged predictors as explanatory variables. Although it is straightforward to compute, the conventional OLS forecast is likely to perform poorly in practice, especially when the number of predictors is large. By construction, conventional OLS estimation maximizes the fit of the model (i.e., the in-sample
statistic) over the estimation sample, which can lead to in-sample overfitting and poor out-of-sample performance, especially for high-dimensional models. The inherently large unpredictable component in the market excess return exacerbates the overfitting problem. - ENet: The ENet forecast is based on the multiple predictive regression fitted via the ENet instead of OLS. The ENet (Zou and Hastie (2005)) relies on penalized regression to guard against overfitting. The ENet penalty term includes both
(LASSO) and (ridge; Hoerl and Kennard (1970)) components. The component permits shrinkage to zero, so that the ENet performs variable selection. Based on Flynn, Hurvich, and Simonoff (2013), we use the Hurvich and Tsai (1989) corrected version of the Akaike (1973) information criterion to select the value of the regularization parameter governing the degree of shrinkage. The ENet directly addresses overfitting by shrinking the slope coefficients of the fitted model, which results in shrinking the forecast toward the prevailing mean benchmark. - Simple Combination: Instead of OLS estimation of the multiple predictive regression, forecast combination begins by computing a set of forecasts based on OLS estimation of univariate predictive regressions that include each lagged predictor (considered in turn). The simple combination forecast is the arithmetic mean of the individual univariate forecasts. Rapach, Strauss, and Zhou (2010) show that the simple combination forecast exerts a strong shrinkage effect.
- Combination ENet: When the number of predictors is large, the simple combination forecast can be too conservative in the sense that it “overshrinks” the forecast toward the prevailing mean, thereby neglecting too much of the relevant information in the predictor variables. Using insights from Diebold and Shin (2019), Rapach and Zhou (2020) and Han et al. (2021) employ the ENet to refine the simple combination forecast. Instead of averaging across all of the individual univariate predictive regression forecasts, the combination ENet (C-ENet) forecast takes the average of the individual forecasts selected by the ENet in a Granger and Ramanathan (1984) multiple regression relating the actual market excess return to the individual univariate forecasts.
- Predictor Average: An alternative strategy for guarding against overfitting is to first combine the predictors themselves into a small number of variables and then use the reduced set of variables as predictors in a low-dimensional predictive regression. Intuitively, as discussed in Section I.D, we consolidate the predictors to filter the noise in the individual predictors. The predictor average forecast is based on OLS estimation of a univariate predictive regression in which the lagged cross-sectional average of the predictors serves as the explanatory variable.
- Principal Component: We can also combine the predictors by extracting the first principal component from the set of predictors. The principal component forecast uses the lagged principal component as the explanatory variable in a univariate predictive regression estimated via OLS.
- PLS: The first principal component explains as much variation as possible in the predictors themselves. However, from a forecasting standpoint, we are interested in explaining the target variable. Instead of extracting a factor that explains as much of the variation in the predictors as possible, Kelly and Pruitt (2013, 2015) develop a three-pass regression filter to construct a target-relevant factor from a set of predictors that is maximally correlated with the target variable. The lagged target-relevant factor then serves as the explanatory variable in a univariate predictive regression estimated via OLS. The three-pass regression filter is essentially a version of PLS.
B. Forecast Evaluation—Statistical Accuracy
C. Forecast Evaluation—Economic Value
III. Data
We consider 100 long-short anomaly portfolio returns that are representative of anomalies from the cross-sectional literature and that can be replicated using publicly available data from CRSP, Compustat, and I/B/E/S. The anomalies correspond to numerous categories, such as value versus growth, profitability, investment, issuance activity, momentum, and trading frictions. Table IA.I in the Internet Appendix lists the 100 anomalies.16
For each anomaly, at the beginning of each month we sort stocks into value-weighted decile portfolios based on the relevant characteristic. We consider all stocks traded on the NYSE, AMEX, and NASDAQ, after excluding stocks with a price below $5. The long-short anomaly portfolio goes long (short) the tenth (first) decile portfolio, where the long (short) leg is expected to generate relatively high (low) returns. We exclude anomalies that are interactions of two separate signals (since such anomalies are essentially based on multiple anomalies) and those that are indicator variables (such as IPOs). We also use anomalies that have a nonmissing return starting in 1985.17 The market excess return is the CRSP value-weighted market return minus the risk-free return (also from CRSP).
The sample period spans 1970:01 to 2017:12. Table I reports summary statistics for the anomaly portfolio returns. Of the 100 representative anomalies, 75, 71, 56, and 49 generate alphas with
The table reports summary statistics for monthly anomaly portfolio returns for 100 anomalies. The sample period is 1970:01 to 2017:12. For each anomaly, we sort stocks into value-weighted decile portfolios according to the characteristic underlying the anomaly. The long-short anomaly portfolio goes long (short) the tenth (first) decile portfolio. | |
---|---|
Number of anomalies | 100 |
Fama and French (1993) three-factor model alpha | |
Number of long-short anomaly portfolio returns with |
75 |
Number of long-short anomaly portfolio returns with |
71 |
Number of long-short anomaly portfolio returns with |
56 |
Number of long-short anomaly portfolio returns with |
49 |
Average correlation across anomaly decile rankings | 0.05 |
Average correlation across monthly anomaly excess returns | |
Long leg | 0.76 |
Short leg | 0.82 |
Long-short | 0.08 |
Long-leg anomaly portfolio excess returns | |
Average of sample means | 0.71% |
Average of sample standard deviations | 5.16% |
Short-leg anomaly portfolio excess returns | |
Average of sample means | 0.33% |
Average of sample standard deviations | 6.20% |
Long-short anomaly portfolio returns | |
Average of sample means | 0.38% |
Average of sample standard deviations | 4.37% |
On average, the long- and short-leg portfolio returns of any two anomalies are strongly correlated (0.76 and 0.82, respectively). In the context of Section I, this likely reflects the common component in the long- and short-leg returns unrelated to mispricing (
IV. Out-of-Sample Results
In constructing the out-of-sample forecasts, we use the first 10 years (1970:01 to 1979:12) of the full sample period as the initial in-sample estimation period. The subsequent five years (1980:01 to 1984:12) serve as the initial holdout out-of-sample period for computing the C-ENet forecast, so that 1985:01 to 2017:12 (396 observations) constitutes the out-of-sample period for forecast evaluation. Because the methods in Section II.A require nonmissing predictor data (with the exception of the predictor average), for an anomaly with a missing return in a given month, we fill in the missing return with the cross-sectional average for the available anomaly returns in that month.
A. Forecast Accuracy
Figure 1 depicts monthly market excess return forecasts based on the 100 long-short anomaly portfolio returns and strategies described in Section II.A. The conventional OLS forecast is highly volatile. In fact, the conventional OLS forecast is more than 6% (72% annualized) in magnitude for a number of months; such extreme forecasts point to severe overfitting. The other forecasts are substantially less volatile than the conventional OLS forecast. In other words, as intended, the ENet, simple combination, C-ENet, predictor average, principal component, and PLS strategies shrink the forecasts toward the prevailing mean (while still incorporating information from the 100 long-short anomaly portfolio returns). The simple combination forecast exerts a particularly strong shrinkage effect (as discussed in Section II.A). Excluding the conventional OLS forecast, the PLS forecast demonstrates the most volatility, with the volatilities for the ENet, C-ENet, predictor average, and principal component forecasts falling between those for the simple combination and PLS forecasts. We note that the market excess return forecasts in Figure 1 are typically more volatile around business-cycle recessions, which indicates that the long-short anomaly portfolio returns themselves are more volatile around recessions.

Table II reports
The table reports Campbell and Thompson (2008) out-of-sample |
|||||||
---|---|---|---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) |
Anomaly | |||||||
Portfolio | OLS | ENet | Combine | C-ENet | Avg | PC | PLS |
Long-short | −2,513.86 | 2.03** | 0.89*** | 2.81*** | 1.89** | 1.25** | 2.06*** |
return | |||||||
Long-leg | −344,960.22 | −0.90 | 0.29 | −0.68 | 0.26 | 0.24 | 0.41 |
excess return | |||||||
Short-leg | −13,284.68 | 1.81* | 0.72* | 0.39* | 0.75* | 0.74* | 0.84* |
excess return |
Overall, we find that the information in the group of 100 long-short anomaly portfolio returns is quite useful for predicting the monthly market excess return, provided that we employ forecasting strategies that guard against overfitting. The fact that the ENet, simple combination, C-ENet, predictor average, principal component, and PLS forecasts all generate statistically and economically significant improvements in out-of-sample accuracy indicates that our results are robust and not overly reliant on a particular method for circumventing overfitting.
Table II also reports results for monthly market excess return forecasts based on long- and short-leg anomaly portfolio excess returns. A clear pattern emerges: The short-leg excess returns provide some evidence of statistically and economically significant predictive ability, while there is no statistically or economically significant evidence of predictive ability for the long-leg excess returns. As discussed in Section I.B, this pattern is consistent with asymmetric limits of arbitrage and stronger MCP for overpricing vis-à-vis underpricing.
Comparing the results in Table II for the long-short and short-leg excess returns, the former perform better than the latter for forecasting the monthly market excess return. As explained in Section I.C (and supported by the average volatilities reported in Table I), the long-short return can provide a sharper predictive signal when the long and short legs contain a sizable common component that is unrelated to the future market excess return, as the long-short return filters the common component to produce a less-diluted signal.
B. Economic Value
As described in Section II, we also measure the marginal economic benefit of the predictive ability of long-short anomaly portfolio returns for a mean-variance investor with a relative risk aversion coefficient of three who allocates between the market portfolio and risk-free Treasury bills. Figure 2 plots log cumulative excess returns for portfolios using market excess return forecasts based on the 100 long-short anomaly returns. The figure also depicts the log cumulative excess return for the portfolio based on the prevailing mean benchmark forecast, as well as that for the market portfolio. Figure 2 reveals that portfolios that incorporate the information in the 100 long-short anomaly returns generally exhibit superior performance compared to both the portfolio based on the prevailing mean benchmark forecast and the market portfolio.

Figure 2 also shows annualized average utility gains when the investor uses a competing forecast of the market excess return in lieu of the prevailing mean benchmark. For the conventional OLS forecast based on the 100 long-short anomaly returns (not shown in the figure), there is a large loss of −4.97%, which further manifests the overfitting problem for the conventional approach. In contrast, the forecasts designed to guard against overfitting all generate substantial utility gains. The smallest annualized gain obtains for the simple combination forecast (2.59%), which is still economically sizable.20 The annualized gain reaches as high as 6.38% for the PLS forecast, and it is above 600 basis points for the ENet and C-ENet forecasts (6.26% and 6.06%, respectively).
Although largely ignored by the market return predictability literature, it is important to assess the statistical significance of the average utility gains. To do so, we use the moving-block bootstrap procedure recommended by McCracken and Valente (2018) to compute critical values for testing
For the portfolios based on the ENet, C-ENet, predictor average, principal component, and PLS forecasts, the information in the long-short anomaly returns appears especially valuable for improving performance during the latter half of the Great Recession. It is worth noting that most of the 100 anomalies were published before that time, suggesting that the utility gains accruing to the information in the group of anomalies are not limited to the anomalies' prepublication periods.22
C. Additional Results
Because the long- and short-leg anomaly returns are components of the market return, the lagged market return itself potentially contains relevant information for forecasting the monthly market return. For the 1970:01 to 2017:12 sample period, the autocorrelation for the market excess return is 0.08 (significant at the 10% level). However, the lagged market excess return does not evince significant out-of-sample predictive ability for the 1985:01 to 2017:12 out-of-sample period. As explained in Section I.C, this is not surprising, since the market return is a noisier predictor than the long-short anomaly portfolio return.
Long-short anomaly portfolio returns also show stronger predictive power for the monthly market excess return than popular predictors from the literature for the 1985:01 to 2017:12 out-of-sample period. Table IA.VI in the Internet Appendix reports
To directly compare the information in the popular predictors to that in the long-short anomaly returns, Table IA.VIII in the Internet Appendix reports Harvey, Leybourne, and Newbold (1998) forecast encompassing test results for monthly market excess return forecasts based on the two sets of predictors. Forecasts based on the popular predictors do not encompass those based on the long-short anomaly returns, and thus the anomaly returns contain information useful for forecasting the monthly market excess return beyond that contained in the popular predictors. The information content of long-short anomaly returns therefore appears quite different from that of popular predictors from the literature.25 Moreover, as shown in Table II and Figure 2, the differential information in long-short anomaly returns delivers statistically and economically significant out-of-sample gains.
The out-of-sample predictive ability of long-short anomaly portfolio returns at longer horizons is also of interest. Table IA.X in the Internet Appendix reports
As an external validity test, we examine the out-of-sample predictive ability of long-short anomaly portfolio returns for industry excess returns.27 Specifically, we consider five industry portfolios (spanning the market) from Kenneth French's Data Library: Consumer, Manufacturing, Hi-Tech, Health, and Other.28 Table IA.XI in the Internet Appendix reports
As a final assessment of out-of-sample return predictability in this section, we explore the effects of the selection of the long-short portfolio returns used to forecast the market excess return.30 We begin with a set of more than 18,000 long-short portfolio returns from Yan and Zheng (2017), which are constructed by sorting on individual firm characteristics from Compustat.31 From the complete set of long-short portfolio returns, we form two subsets with insignificant and significant alphas.32 Based on the evidence in Yan and Zheng (2017), long-short portfolio returns with highly significant alphas are likely to be meaningful anomalies and not the result of data mining. We randomly draw two groups of 100 long-short portfolio returns from the respective subsets. For each group, we generate out-of-sample market excess return forecasts beginning in 1985:01 using the predictor average and compute the
Figure 3 presents histograms for the

V. Asymmetric Limits of Arbitrage
In this section, we further examine the relevance of asymmetric limits of arbitrage for market excess return predictability based on long-short anomaly portfolio returns.
A. Slope Coefficients
We first examine recursive estimates of the slope coefficients in the predictive regressions underlying the predictor average, principal component, and PLS forecasts. These predictive regressions provide a convenient means for capturing the information in all 100 of the long-short anomaly portfolio returns in a single slope coefficient.
Panels (a) to (c) of Figure 4 depict recursive estimates of the standardized slope coefficients in equation (35) and their 90% confidence intervals when the explanatory variable is

B. Market Segments
According to the intuition in Section I, an important part of the predictive ability of long-short anomaly portfolio returns stems from their lead-lag covariances with segments of the market return. To obtain insight into this mechanism, we examine the cross-autocovariances between long-short anomaly portfolio returns and decile excess returns. This allows us to investigate the relevance of the information contained in long-short anomaly returns for different segments of the market related to the anomalies.
To incorporate information from the entire group of 100 anomalies and construct market segments without overlapping stocks, we proceed as follows.35 For a given month, we sort stocks according to each anomaly characteristic (considered in turn). For each stock, we then take the average of its ranks across the anomalies; we take the average of the ranks to account for the fact that some stocks have missing observations for some characteristics in a given month. We then sort stocks into value-weighted decile portfolios according to their average ranks, with the first (tenth) decile constituting the most overvalued (undervalued) stocks. The long-short portfolio again goes long (short) the tenth (first) decile portfolio.
Panel (a) of Figure 5 plots estimates of

To further explore the relevance of overpricing correction persistence, we examine return predictability for portfolios of stocks that are most overvalued according to the predictor average.38 The predictor average (i.e., the cross-sectional average of the long-short anomaly portfolio returns) for a given month is effectively a portfolio of all of the individual available stocks for the month (although some stocks may receive zero weights). Each month, we back out the portfolio weights corresponding to the predictor average and sort stocks according to the predictor average weights. We then form a portfolio (NEG10-PA) that consists of the 10% of stocks with the largest negative weights (in magnitude) in the predictor average portfolio; the weights for the stocks in the NEG10-PA portfolio are proportional to the absolute values of their weights in the predictor average portfolio. Based on a univariate predictive regression, we then use the excess return on the NEG10-PA portfolio to predict the excess return on a portfolio of the same stocks with the same weights in the next month. For the 1985:01 to 2017:12 out-of-sample period, the
C. Subgroups
In this section, we generate out-of-sample forecasts based on subgroups of anomalies formed according to proxies for asymmetric limits of arbitrage.39 We form subgroups based on three proxies: bid-ask spread (BA), idiosyncratic volatility (IDIO), and market capitalization (SIZE).40 For a given month and anomaly characteristic, we first sort stocks into deciles as described in Section III and compute the average values for a given proxy for the stocks in the long and short legs. We then compute the long-leg average value for the proxy minus the short-leg average value for the proxy for each month. Finally, we compute the time-series average of the differences for 1970:01 to 1984:12. We denote the time-series averages for the differences for the bid-ask spread, idiosyncratic volatility, and size proxies by DTSA-BA, DTSA-IDIO, and DTSA-SIZE, respectively.
We form the subgroups BA-NEG, IDIO-NEG, and SIZE-NEG (BA-POS, IDIO-POS, and SIZE-POS), which are the subgroups of anomalies for which DTSA-BA, DTSA-IDIO, and DTSA-SIZE, respectively, are negative (positive). In line with our out-of-sample focus, we exclude data from the forecast evaluation period when determining the subgroups.41 Stocks with larger bid-ask spreads, greater idiosyncratic volatility, and smaller market capitalization are typically viewed as having stronger limits of arbitrage. We therefore expect greater market return predictability based on BA-NEG, IDIO-NEG, and SIZE-POS compared to BA-POS, IDIO-POS, and SIZE-NEG, respectively, as the former three subgroups represent anomalies with asymmetrically strong limits of arbitrage in their short legs vis-à-vis their long legs. To combine the information in the different proxies, we also form a subgroup that is the union of the anomalies in BA-NEG, IDIO-NEG, and SIZE-POS, as well as a subgroup that comprises the complement of the union. We expect the union subgroup to have stronger predictive ability relative to its complement.
Table III reports
The table reports Campbell and Thompson (2008) out-of-sample |
||||||
---|---|---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) | (6) | (7) |
Subgroup | ENet | Combine | C-ENet | Avg | PC | PLS |
Panel A: Bid-Ask Spread | ||||||
BA-NEG | 3.57*** | 1.13*** | 3.65*** | 1.97** | 1.39** | 2.94*** |
BA-POS | −0.28 | 0.48** | −1.41 | 0.97** | 0.85* | 0.29** |
Panel B: Idiosyncratic Volatility | ||||||
IDIO-NEG | 1.63*** | 0.97*** | 2.43*** | 1.53** | 1.24** | 1.85*** |
IDIO-POS | 1.17* | 0.42** | −0.43 | −0.33 | 0.12 | −0.14 |
Panel C: Market Capitalization | ||||||
SIZE-NEG | 0.39 | 0.40** | −1.01 | 0.43 | 0.19 | −1.20 |
SIZE-POS | 2.39*** | 1.04** | 2.91*** | 1.60** | 1.15** | 1.59*** |
Panel D: Bid-Ask Spread, Idiosyncratic Volatility, Market Capitalization | ||||||
Union of BA-NEG, | 2.67*** | 0.98*** | 3.10*** | 1.93** | 1.23** | 2.12*** |
IDIO-NEG, | ||||||
SIZE-NEG | ||||||
Complement of union | 0.37 | 0.05 | −0.09 | −0.26 | −0.31 | −0.83 |
We also form subgroups via a double sort to identify subgroups of anomalies with both high limits of arbitrage overall and high asymmetric limits of arbitrage with respect to their short legs. To do so, for BA, we first identify anomalies with relatively low and high average values for the proxy for the stocks in their long and short legs (BA-LOW and BA-HIGH, respectively). For each of the BA-LOW and BA-HIGH subgroups, we then form NEG and POS subgroups as described above, resulting in four subgroups based on BA (BA-LOW-NEG, BA-LOW-POS, BA-HIGH-NEG, and BA-HIGH-POS). For compactness, we take the union of the BA-LOW-NEG, BA-LOW-POS, and BA-HIGH-POS subgroups to form the BA-REST group. We then compare the results between the BA-HIGH-NEG and BA-REST subgroups. In a similar manner, we form IDIO-HIGH-NEG and IDIO-REST (SIZE-LOW-POS and SIZE-REST) subgroups based on IDIO (SIZE). We expect the BA-HIGH-NEG, IDIO-HIGH-NEG, and SIZE-LOW-POS subgroups to exhibit stronger predictive ability compared to the BA-REST, IDIO-REST, and SIZE-REST subgroups, as the former three subgroups are the anomalies with both high limits of arbitrage overall and high asymmetric limits of arbitrage with respect to their short legs. As shown in Table IA.XIV in the Internet Appendix, this is indeed what we find.
D. Market Frictions
As Gârleanu and Pedersen (2013, 2016) and Dong, Kang, and Peress (2020) argue, frictions such as limited risk-bearing capacity and transaction costs induce arbitrageurs to slowly respond to mispricing, resulting in MCP. If our predictability finding is driven by arbitrageurs slowly correcting mispricing in the presence of asymmetric limits of arbitrage and stronger MCP for overpricing vis-à-vis underpricing, then long-short anomaly portfolio returns should contain more relevant information for predicting the market excess return during times of high frictions. We investigate this issue by testing for an increase in the
We consider a variety of proxies for market frictions from the literature. We begin with the level of and innovations to aggregate liquidity from Pástor and Stambaugh (2003). We also consider idiosyncratic volatility, which is widely believed to be a major implementation cost of short arbitrage (e.g., Pontiff (2006)). We measure aggregate idiosyncratic risk for a given month by first calculating the idiosyncratic volatility of individual stocks following Ang et al. (2006) and then computing the value-weighted average of the idiosyncratic volatilities for the individual stocks. Furthermore, we use trading noise (Hu, Pan, and Wang (2013)), which tracks the shortage in arbitrage capital via noise in Treasury security prices, and short fees (Asness et al. (2018)), which measure the cost of shorting stocks. For the latter, we again aggregate to the market level by computing the value-weighted average of short fees for individual stocks.43
In addition to the above variables, we consider risk, uncertainty, and risk aversion indices as proxies for the frictions affecting the risk-bearing capacity of arbitrageurs. Along with the VIX, we use macroeconomic, financial, and real uncertainty indices from Jurado, Ludvigson, and Ng (2015), who construct the indices by estimating stochastic volatility series for the residuals from fitted dynamic diffusion-index models based on macroeconomic and financial variables. Finally, we consider the jointly estimated economic uncertainty and risk aversion indices from Bekaert, Engstrom, and Xu (2021). For all of the proxies, we separate high- and low-friction regimes using the sample median.
Table IV reports differences in
The table reports percentage-point increases in Campbell and Thompson (2008) out-of-sample |
||||||
---|---|---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) | (6) | (7) |
Friction | ENet | Combine | C-ENet | Avg | PC | PLS |
Aggregate liquidity | 4.48** | 0.86** | 5.20** | 3.92** | 3.64** | 9.23** |
Liquidity innovations | 6.53** | 1.42** | 6.56** | 7.82*** | 6.11*** | 13.21*** |
Idiosyncratic volatility | 2.73* | 1.06** | 4.21** | 4.29** | 3.57** | 2.05** |
Trading noise | 3.67** | 1.49*** | 5.11** | 8.12*** | 5.83*** | 12.42*** |
Short fees | 10.34* | 2.50** | 12.56* | 9.63** | 8.50** | 14.70** |
VIX | −0.25 | 1.26** | 4.87** | 6.06** | 6.10** | 8.93** |
Financial uncertainty | −1.28 | 0.02* | 1.18* | 3.24** | 2.80** | 3.94** |
Macro uncertainty | 1.20 | 0.95** | 3.28* | 3.97** | 4.09** | 7.21** |
Real uncertainty | 2.02* | 1.10** | 6.19** | 4.47** | 4.31** | 8.02*** |
Economic uncertainty | 1.29 | 0.88** | 3.93* | 3.67** | 3.48** | 3.59** |
Risk aversion uncertainty | 3.52** | 0.60** | 6.36** | 4.68** | 4.19** | 7.38** |
E. Arbitrage Trading
Next, we examine whether long-short anomaly portfolio returns predict arbitrageurs' trading in the broad market and the tone of market-wide news. If the overpricing correction signaled by long-short anomaly returns during month
We first examine changes in arbitrageurs' aggregate net position in the broad market, where we follow Chen, Da, and Huang (2019) in constructing the net arbitrage trading measure. Specifically, we compute aggregate value-weighted long positions for hedge funds across all stocks in the market, where hedge funds' long positions in individual stocks are based on quarterly reported holdings in the 13F database. Agarwal et al. (2013) identify hedge funds by combining the information in the 13F database and five hedge fund databases. The 13F holdings data cover the largest number of institutional investors, as all institutional investment managers that have discretion over $100 million or more in Section 13(f) securities need to report their quarter-end holdings in these securities. A 13F-filing institution is classified as a hedge fund if its major business is sponsoring and/or managing hedge funds according to information from various sources (such as institution websites, SEC filings, industry publications, and news articles). Our final sample consists of 1,710 unique hedge funds.44 We construct the value-weighted short-selling positions across all stocks in the market by aggregating short interest in individual stocks. A stock's short interest in a month is the total number of uncovered shares sold short for transactions settled on or before the 15th of the month (from Compustat) divided by the total number of shares outstanding (from CRSP). We use short interest from the last month of each quarter to match the quarterly data frequency for hedge-fund holdings. The sample period covers 1980:1 to 2017:4.
Since hedge funds' aggregate long positions and short interest display trends, we compute deviations in the long positions of hedge funds and short interest from trailing four-quarter MAs. Chen, Da, and Huang (2019) show that the difference between the long and short arbitrage trading measures (i.e., net arbitrage trading) effectively aggregates the actions of arbitrageurs.
Table V reports standardized slope coefficient estimates for predictive regressions that relate quarterly arbitrage trading measures to lagged long-short anomaly portfolio returns, where we use long-short anomaly returns from the last month in the quarter. As in Section V.A, to capture the aggregate information across the 100 anomalies in a single slope coefficient, we estimate predictive regressions with
The table reports ordinary least squares estimates of standardized slope coefficient estimates for univariate predictive regressions that use the strategy in the first column to combine the information in 100 long-short anomaly portfolio returns to predict the variable in the column heading. Predictor average uses the the cross-sectional average of the 100 long-short anomaly portfolio returns. Principal component (partial least squares) uses the first principal component (target-relevant factor) extracted from the 100 long-short anomaly portfolio returns. The sample period for the second through fourth columns is 1980:1 to 2017:4; the sample period for the fifth column is 1996:01 to 2017:12. The dependent variable is standardized in the fifth column; *, **, and *** indicate significance at the 10%, 5%, and 1% level, respectively. | ||||
---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) |
Strategy | Arbitrage Net Market Position | Arbitrage Long Market Position | Arbitrage Short Market Position | Financial Market News Tone |
Predictor average | −5.81*** | −1.91 | 3.89** | −0.36*** |
Principal component | −4.81** | −1.18 | 3.63** | −0.32*** |
Partial least squares | −5.32*** | −1.62 | 3.70** | −0.35*** |
The third and fourth columns of Table V report results for arbitrageurs' long and short positions, respectively. The coefficient estimates are all negative (positive) for the long (short) trading measure, indicating that arbitrageurs reduce (increase) their long (short) positions in the market in response to an increase in long-short anomaly returns. The estimates are all significant at the 5% level for the regressions for the short market position, while they are all insignificant for the regressions for the long market position. Comparing the results in the third and fourth columns, the coefficient estimates are larger in magnitude for the short side than for the long side, consistent with stronger MCP for overpricing. Decomposing the net change in the second column, approximately one-third comes from declines in long positions and two-thirds from increases in short positions.
Finally, the last column of Table V reports predictive regression results for monthly U.S. financial market news tone. The news tone measure, from Calomiris and Mamaysky (2019), is based on English-language news articles from Thomson Reuters. Monthly news tone is an aggregation of differences in word tone among news articles on the topic of U.S. financial markets. We detrend news tone by computing deviations from a trailing 12-month MA. To incorporate information from the 100 anomalies into a single slope coefficient, we again estimate predictive regressions with
To the extent that asymmetric limits of arbitrage generate relatively strong overpricing correction persistence, we expect an increase in long-short anomaly returns to lead to more negative market-wide news tone in the next month; in other words, public news is more likely to confirm that shares were broadly overvalued. The results in the last column of Table V support this conjecture. The coefficient estimates are all significantly negative (at the 1% level), and the economic magnitudes are sizable: A one-standard-deviation increase in the predictor results in a 0.32- to 0.36-standard-deviation drop in market news tone. Overall, the results in Table V are consistent with the view that long-short anomaly portfolio returns predict market returns because arbitrageurs slowly correct market-wide overpricing.
VI. Conclusion
Is cross-sectional stock return predictability related to the time-series predictability of the aggregate market excess return? Our paper provides the first positive systematic answer to this question. Specifically, we find that a representative group of 100 long-short anomaly portfolio returns from the cross-sectional literature contains valuable information for predicting the market excess return on an out-of-sample basis, provided that we use forecasting strategies that guard against overfitting the data. We explore economic channels underlying the predictive ability of long-short anomaly portfolio returns, explaining how asymmetric limits of arbitrage—by giving rise to overpricing correction persistence—can account for the patterns of return predictability in the data. We also provide evidence supporting the relevance of asymmetric limits of arbitrage and overpricing correction persistence for explaining our results. In light of our positive findings for the equity market, it would be interesting in future research to examine links between cross-sectional and time-series return predictability in other markets, such as bonds and currencies, by applying our methods for extracting predictive signals from large sets of cross-sectional anomalies.46
Editors: Stefan Nagel, Philip Bond, Amit Seru, and Wei Xiong