IDEAS home Printed from https://ideas.repec.org/p/cpr/ceprdp/19314.html
   My bibliography  Save this paper

Portfolio management with big data

Author

Listed:
  • Penaranda, Francisco
  • Sentana, Enrique

Abstract

The purpose of this survey is to summarize the academic literature that studies some of the ways in which portfolio management has been affected in recent years by the availability of big datasets: many assets, many characteristics for each of them, many macro predictors, and various sources of unstructured data. Thus, we deliberately focus on applications rather than methods. We also include brief reviews of the financial theories underlying asset management, which provide the relevant background to assess the plethora of recent contributions to such an active research field.

Suggested Citation

  • Penaranda, Francisco & Sentana, Enrique, 2024. "Portfolio management with big data," CEPR Discussion Papers 19314, C.E.P.R. Discussion Papers.
  • Handle: RePEc:cpr:ceprdp:19314
    as

    Download full text from publisher

    File URL: https://cepr.org/publications/DP19314
    Download Restriction: CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    2. Stefano Giglio & Bryan Kelly & Dacheng Xiu, 2022. "Factor Models, Machine Learning, and Asset Pricing," Annual Review of Financial Economics, Annual Reviews, vol. 14(1), pages 337-368, November.
    3. Jagannathan, Ravi & Wang, Zhenyu, 1996. "The Conditional CAPM and the Cross-Section of Expected Returns," Journal of Finance, American Finance Association, vol. 51(1), pages 3-53, March.
    4. Lo, Andrew W. & Mackinlay, A. Craig, 1997. "Maximizing Predictability In The Stock And Bond Markets," Macroeconomic Dynamics, Cambridge University Press, vol. 1(1), pages 102-134, January.
    5. Francisco Peñaranda & Enrique Sentana, 2015. "A Unifying Approach to the Empirical Evaluation of Asset Pricing Models," The Review of Economics and Statistics, MIT Press, vol. 97(2), pages 412-435, May.
    6. George Chacko & Luis M. Viceira, 2005. "Dynamic Consumption and Portfolio Choice with Stochastic Volatility in Incomplete Markets," The Review of Financial Studies, Society for Financial Studies, vol. 18(4), pages 1369-1402.
    7. William F. Sharpe, 1963. "A Simplified Model for Portfolio Analysis," Management Science, INFORMS, vol. 9(2), pages 277-293, January.
    8. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    9. Leippold, Markus & Wang, Qian & Zhou, Wenyu, 2022. "Machine learning in the Chinese stock market," Journal of Financial Economics, Elsevier, vol. 145(2), pages 64-82.
    10. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    11. Stefano Giglio & Yuan Liao & Dacheng Xiu, 2021. "Thousands of Alpha Tests," NBER Chapters, in: Big Data: Long-Term Implications for Financial Markets and Firms, pages 3456, National Bureau of Economic Research, Inc.
    12. Enrique Sentana, 2005. "Least Squares Predictions and Mean-Variance Analysis," Journal of Financial Econometrics, Oxford University Press, vol. 3(1), pages 56-78.
    13. repec:dau:papers:123456789/4688 is not listed on IDEAS
    14. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    15. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    16. Frank Fabozzi & Dashan Huang & Guofu Zhou, 2010. "Robust portfolios: contributions from operations research and finance," Annals of Operations Research, Springer, vol. 176(1), pages 191-220, April.
    17. Alejandro Lopez-Lira & Yuehua Tang, 2023. "Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models," Papers 2304.07619, arXiv.org, revised Sep 2024.
    18. Connor, Gregory & Korajczyk, Robert A, 1993. "A Test for the Number of Factors in an Approximate Factor Model," Journal of Finance, American Finance Association, vol. 48(4), pages 1263-1291, September.
    19. Laurent Barras & Olivier Scaillet & Russ Wermers, 2010. "False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas," Journal of Finance, American Finance Association, vol. 65(1), pages 179-216, February.
    20. Peñaranda, Francisco & Sentana, Enrique, 2016. "Duality in mean-variance frontiers with conditioning information," Journal of Empirical Finance, Elsevier, vol. 38(PB), pages 762-785.
    21. Kozak, Serhiy & Nagel, Stefan & Santosh, Shrihari, 2020. "Shrinking the cross-section," Journal of Financial Economics, Elsevier, vol. 135(2), pages 271-292.
    22. Ravi Jagannathan & Tongshu Ma, 2003. "Risk Reduction in Large Portfolios: Why Imposing the Wrong Constraints Helps," Journal of Finance, American Finance Association, vol. 58(4), pages 1651-1683, August.
    23. Nicolae Gârleanu & Lasse Heje Pedersen, 2013. "Dynamic Trading with Predictable Returns and Transaction Costs," Journal of Finance, American Finance Association, vol. 68(6), pages 2309-2340, December.
    24. Bryan Kelly & Semyon Malamud & Lasse Heje Pedersen, 2023. "Principal Portfolios," Journal of Finance, American Finance Association, vol. 78(1), pages 347-387, February.
    25. Hansen, Lars Peter & Jagannathan, Ravi, 1991. "Implications of Security Market Data for Models of Dynamic Economies," Journal of Political Economy, University of Chicago Press, vol. 99(2), pages 225-262, April.
    26. Lin William Cong & Ke Tang & Jingyuan Wang & Yang Zhang, 2021. "Deep Sequence Modeling: Development and Applications in Asset Pricing," Papers 2108.08999, arXiv.org.
    27. J. Tobin, 1958. "Liquidity Preference as Behavior Towards Risk," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 25(2), pages 65-86.
    28. LuisM. Viceira & John Y. Campbell, 2001. "Who Should Buy Long-Term Bonds?," American Economic Review, American Economic Association, vol. 91(1), pages 99-127, March.
    29. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    30. Brennan, Michael J. & Schwartz, Eduardo S. & Lagnado, Ronald, 1997. "Strategic asset allocation," Journal of Economic Dynamics and Control, Elsevier, vol. 21(8-9), pages 1377-1403, June.
    31. Kan, Raymond & Zhou, Guofu, 2007. "Optimal Portfolio Choice with Parameter Uncertainty," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 42(3), pages 621-656, September.
    32. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    33. Michael W. Brandt & Pedro Santa-Clara & Rossen Valkanov, 2009. "Parametric Portfolio Policies: Exploiting Characteristics in the Cross-Section of Equity Returns," The Review of Financial Studies, Society for Financial Studies, vol. 22(9), pages 3411-3447, September.
    34. Keywan Christian Rasekhschaffe & Robert C. Jones, 2019. "Machine Learning for Stock Selection," Financial Analysts Journal, Taylor & Francis Journals, vol. 75(3), pages 70-88, July.
    35. Merton, Robert C., 1971. "Optimum consumption and portfolio rules in a continuous-time model," Journal of Economic Theory, Elsevier, vol. 3(4), pages 373-413, December.
    36. Lehmann, Bruce N. & Modest, David M., 1988. "The empirical foundations of the arbitrage pricing theory," Journal of Financial Economics, Elsevier, vol. 21(2), pages 213-254, September.
    37. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    38. Mykola Babiak & Jozef Barunik, 2020. "Deep Learning, Predictability, and Optimal Portfolio Returns," CERGE-EI Working Papers wp677, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    39. Banz, Rolf W., 1981. "The relationship between return and market value of common stocks," Journal of Financial Economics, Elsevier, vol. 9(1), pages 3-18, March.
    40. Jonathan Ingersoll & Ivo Welch, 2007. "Portfolio Performance Manipulation and Manipulation-proof Performance Measures," The Review of Financial Studies, Society for Financial Studies, vol. 20(5), pages 1503-1546, 2007 17.
    41. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    42. Michael W. Brandt & Pedro Santa‐Clara, 2006. "Dynamic Portfolio Selection by Augmenting the Asset Space," Journal of Finance, American Finance Association, vol. 61(5), pages 2187-2217, October.
    43. Ledoit, Olivier & Wolf, Michael, 2003. "Improved estimation of the covariance matrix of stock returns with an application to portfolio selection," Journal of Empirical Finance, Elsevier, vol. 10(5), pages 603-621, December.
    44. John Y. Campbell & Luis M. Viceira, 1999. "Consumption and Portfolio Decisions when Expected Returns are Time Varying," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 433-495.
    45. Jorion, Philippe, 1986. "Bayes-Stein Estimation for Portfolio Analysis," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 21(3), pages 279-292, September.
    46. Merton, Robert C, 1973. "An Intertemporal Capital Asset Pricing Model," Econometrica, Econometric Society, vol. 41(5), pages 867-887, September.
    47. Harry Markowitz, 1952. "Portfolio Selection," Journal of Finance, American Finance Association, vol. 7(1), pages 77-91, March.
    48. Andrew Y. Chen & Tom Zimmermann, 2022. "Open Source Cross-Sectional Asset Pricing," Critical Finance Review, now publishers, vol. 11(2), pages 207-264, May.
    49. Doron Avramov & Guofu Zhou, 2010. "Bayesian Portfolio Analysis," Annual Review of Financial Economics, Annual Reviews, vol. 2(1), pages 25-47, December.
    50. Francisco Peñaranda & Liuren Wu, 2022. "Targets, Predictability, and Performance," Management Science, INFORMS, vol. 68(2), pages 1537-1555, February.
    51. Ayman Chaouki & Stephen Hardiman & Christian Schmidt & Emmanuel S'eri'e & Joachim de Lataillade, 2020. "Deep Deterministic Portfolio Optimization," Papers 2003.06497, arXiv.org, revised Apr 2020.
    52. Pedro M. Mirete-Ferrer & Alberto Garcia-Garcia & Juan Samuel Baixauli-Soler & Maria A. Prats, 2022. "A Review on Machine Learning for Asset Management," Risks, MDPI, vol. 10(4), pages 1-46, April.
    53. Tobek, Ondrej & Hronec, Martin, 2021. "Does it pay to follow anomalies research? Machine learning approach with international evidence," Journal of Financial Markets, Elsevier, vol. 56(C).
    54. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    55. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    56. Heston, Steven L. & Sadka, Ronnie, 2008. "Seasonality in the cross-section of stock returns," Journal of Financial Economics, Elsevier, vol. 87(2), pages 418-445, February.
    57. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    58. Soohun Kim & Robert A Korajczyk & Andreas Neuhierl & Wei JiangEditor, 2021. "Arbitrage Portfolios," The Review of Financial Studies, Society for Financial Studies, vol. 34(6), pages 2813-2856.
    59. Clifford S. Asness & Tobias J. Moskowitz & Lasse Heje Pedersen, 2013. "Value and Momentum Everywhere," Journal of Finance, American Finance Association, vol. 68(3), pages 929-985, June.
    60. Michael Johannes & Arthur Korteweg & Nicholas Polson, 2014. "Sequential Learning, Predictability, and Optimal Portfolio Returns," Journal of Finance, American Finance Association, vol. 69(2), pages 611-644, April.
    61. Elton, Edwin J & Gruber, Martin J, 1973. "Estimating the Dependence Structure of Share Prices-Implications for Portfolio Selection," Journal of Finance, American Finance Association, vol. 28(5), pages 1203-1232, December.
    62. Frost, Peter A. & Savarino, James E., 1986. "An Empirical Bayes Approach to Efficient Portfolio Selection," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 21(3), pages 293-305, September.
    63. Chamberlain, Gary, 1983. "Funds, Factors, and Diversification in Arbitrage Pricing Models," Econometrica, Econometric Society, vol. 51(5), pages 1305-1323, September.
    64. Lily Fang & Joel Peress, 2009. "Media Coverage and the Cross‐section of Stock Returns," Journal of Finance, American Finance Association, vol. 64(5), pages 2023-2052, October.
    65. Matthew Spiegel, 2008. "Forecasting the Equity Premium: Where We Stand Today," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1453-1454, July.
    66. Obaid, Khaled & Pukthuanthong, Kuntara, 2022. "A picture is worth a thousand words: Measuring investor sentiment by combining machine learning and photos from news," Journal of Financial Economics, Elsevier, vol. 144(1), pages 273-297.
    67. Fama, Eugene F & MacBeth, James D, 1973. "Risk, Return, and Equilibrium: Empirical Tests," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 607-636, May-June.
    68. repec:bla:jfinan:v:58:y:2003:i:4:p:1651-1684 is not listed on IDEAS
    69. Gallant, A. Ronald & Hansen, Lars Peter & Tauchen, George, 1990. "Using conditional moments of asset payoffs to infer the volatility of intertemporal marginal rates of substitution," Journal of Econometrics, Elsevier, vol. 45(1-2), pages 141-179.
    70. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    71. Merton, Robert C., 1980. "On estimating the expected return on the market : An exploratory investigation," Journal of Financial Economics, Elsevier, vol. 8(4), pages 323-361, December.
    72. Fan, Jianqing & Fan, Yingying & Lv, Jinchi, 2008. "High dimensional covariance matrix estimation using a factor model," Journal of Econometrics, Elsevier, vol. 147(1), pages 186-197, November.
    73. Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zura Kakushadze & Willie Yu, 2016. "Multifactor Risk Models and Heterotic CAPM," Papers 1602.04902, arXiv.org, revised Mar 2016.
    2. John Y. Campbell, 2000. "Asset Pricing at the Millennium," Journal of Finance, American Finance Association, vol. 55(4), pages 1515-1567, August.
    3. Gregory Connor & Lisa R. Goldberg & Robert A. Korajczyk, 2010. "Portfolio Risk Analysis," Economics Books, Princeton University Press, edition 1, number 9224.
    4. Zura Kakushadze & Willie Yu, 2016. "Statistical Risk Models," Papers 1602.08070, arXiv.org, revised Jan 2017.
    5. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    6. Zura Kakushadze, 2015. "Heterotic Risk Models," Papers 1508.04883, arXiv.org, revised Jan 2016.
    7. Penaranda, Francisco, 2007. "Portfolio choice beyond the traditional approach," LSE Research Online Documents on Economics 24481, London School of Economics and Political Science, LSE Library.
    8. Constantinos Kardaras & Hyeng Keun Koo & Johannes Ruf, 2022. "Estimation of growth in fund models," Papers 2208.02573, arXiv.org.
    9. Behr, Patrick & Guettler, Andre & Truebenbach, Fabian, 2012. "Using industry momentum to improve portfolio performance," Journal of Banking & Finance, Elsevier, vol. 36(5), pages 1414-1423.
    10. repec:gnv:wpaper:unige:76321 is not listed on IDEAS
    11. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.
    12. Ni, Xuanming & Zheng, Tiantian & Zhao, Huimin & Zhu, Shushang, 2023. "High-dimensional portfolio optimization based on tree-structured factor model," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    13. Patrick Gagliardini & Elisa Ossola & Olivier Scaillet, 2016. "Time‐Varying Risk Premium in Large Cross‐Sectional Equity Data Sets," Econometrica, Econometric Society, vol. 84, pages 985-1046, May.
    14. Guillaume Chevalier & Guillaume Coqueret & Thomas Raffinot, 2022. "Supervised portfolios," Post-Print hal-04144588, HAL.
    15. Amit Goyal, 2012. "Empirical cross-sectional asset pricing: a survey," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 26(1), pages 3-38, March.
    16. Cakici, Nusret & Fieberg, Christian & Metko, Daniel & Zaremba, Adam, 2023. "Machine learning goes global: Cross-sectional return predictability in international stock markets," Journal of Economic Dynamics and Control, Elsevier, vol. 155(C).
    17. Thomas Conlon & John Cotter & Iason Kynigakis, 2021. "Machine Learning and Factor-Based Portfolio Optimization," Papers 2107.13866, arXiv.org.
    18. Hsu, Po-Hsuan & Han, Qiheng & Wu, Wensheng & Cao, Zhiguang, 2018. "Asset allocation strategies, data snooping, and the 1 / N rule," Journal of Banking & Finance, Elsevier, vol. 97(C), pages 257-269.
    19. Sainan Jin & Liangjun Su & Yonghui Zhang, 2015. "Nonparametric testing for anomaly effects in empirical asset pricing models," Empirical Economics, Springer, vol. 48(1), pages 9-36, February.
    20. Bui, Dien Giau & Kong, De-Rong & Lin, Chih-Yung & Lin, Tse-Chun, 2023. "Momentum in machine learning: Evidence from the Taiwan stock market," Pacific-Basin Finance Journal, Elsevier, vol. 82(C).
    21. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.

    More about this item

    Keywords

    Machine learning; Mean-variance analysis; Stochastic discount factors;
    All these keywords.

    JEL classification:

    • G11 - Financial Economics - - General Financial Markets - - - Portfolio Choice; Investment Decisions
    • G12 - Financial Economics - - General Financial Markets - - - Asset Pricing; Trading Volume; Bond Interest Rates
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:19314. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.