IDEAS home Printed from https://ideas.repec.org/a/eee/beexfi/v32y2021ics2214635021001210.html
   My bibliography  Save this article

Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis

Author

Listed:
  • Goodell, John W.
  • Kumar, Satish
  • Lim, Weng Marc
  • Pattnaik, Debidutta

Abstract

Artificial intelligence (AI) and machine learning (ML) are two related technologies that are emergent in financial scholarship. However, no review, to date, has offered a wholistic retrospection of this research. To address this gap, we provide an overview of AI and ML research in finance. Using both co-citation and bibliometric-coupling analyses, we infer the thematic structure of AI and ML research in finance for 1986–April 2021. By uncovering nine (co-citation) and eight (bibliometric coupling) specific clusters of finance that apply AI and ML, we further identify three overarching groups of finance scholarship that are roughly equivalent for both forms of analysis: (1) portfolio construction, valuation, and investor behavior; (2) financial fraud and distress; and (3) sentiment inference, forecasting, and planning. Additionally, using co-occurrence and confluence analyses, we highlight trends and research directions regarding AI and ML in finance research. Our results provide assessment of AI and ML in finance research.

Suggested Citation

  • Goodell, John W. & Kumar, Satish & Lim, Weng Marc & Pattnaik, Debidutta, 2021. "Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
  • Handle: RePEc:eee:beexfi:v:32:y:2021:i:c:s2214635021001210
    DOI: 10.1016/j.jbef.2021.100577
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S2214635021001210
    Download Restriction: no

    File URL: https://libkey.io/10.1016/j.jbef.2021.100577?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Paul C. Tetlock & Maytal Saar‐Tsechansky & Sofus Macskassy, 2008. "More Than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, American Finance Association, vol. 63(3), pages 1437-1467, June.
    2. Donthu, Naveen & Kumar, Satish & Mukherjee, Debmalya & Pandey, Nitesh & Lim, Weng Marc, 2021. "How to conduct a bibliometric analysis: An overview and guidelines," Journal of Business Research, Elsevier, vol. 133(C), pages 285-296.
    3. Soh Young In & Dane Rook & Ashby Monk, 2019. "Integrating Alternative Data (Also Known as ESG Data) in Investment Decision Making," Global Economic Review, Taylor & Francis Journals, vol. 48(3), pages 237-260, July.
    4. Graham Elliott & Allan Timmermann, 2016. "Forecasting in Economics and Finance," Annual Review of Economics, Annual Reviews, vol. 8(1), pages 81-110, October.
    5. Flood, M. D. & Jagadish, H. V. & Raschid, L., 2016. "Big data challenges and opportunities in financial stability monitoring," Financial Stability Review, Banque de France, issue 20, pages 129-142, April.
    6. M. M. Kessler, 1963. "Bibliographic coupling between scientific papers," American Documentation, Wiley Blackwell, vol. 14(1), pages 10-25, January.
    7. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    8. Beynon, Malcolm J. & Peel, Michael J., 2001. "Variable precision rough set theory and data discretisation: an application to corporate failure prediction," Omega, Elsevier, vol. 29(6), pages 561-576, December.
    9. Y Liu & M Schumann, 2005. "Data mining feature selection for credit scoring models," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1099-1108, September.
    10. Li, Xiao, 2020. "When financial literacy meets textual analysis: A conceptual review," Journal of Behavioral and Experimental Finance, Elsevier, vol. 28(C).
    11. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    12. Syoiti Ninomiya & Nicolas Victoir, 2008. "Weak Approximation of Stochastic Differential Equations and Application to Derivative Pricing," Applied Mathematical Finance, Taylor & Francis Journals, vol. 15(2), pages 107-121.
    13. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    14. Pattnaik, Debidutta & Hassan, Mohammad Kabir & Kumar, Satish & Paul, Justin, 2020. "Trade credit research before and after the global financial crisis of 2008 – A bibliometric overview," Research in International Business and Finance, Elsevier, vol. 54(C).
    15. Justin A. Sirignano, 2019. "Deep learning for limit order books," Quantitative Finance, Taylor & Francis Journals, vol. 19(4), pages 549-570, April.
    16. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    17. Fabian Muniesa, 2007. "Market technologies and the pragmatics of prices," Post-Print halshs-00160893, HAL.
    18. Javier Arroyo & Rosa Espínola & Carlos Maté, 2011. "Different Approaches to Forecast Interval Time Series: A Comparison in Finance," Computational Economics, Springer;Society for Computational Economics, vol. 37(2), pages 169-191, February.
    19. Mustak, Mekhail & Salminen, Joni & Plé, Loïc & Wirtz, Jochen, 2021. "Artificial intelligence in marketing: Topic modeling, scientometric analysis, and research agenda," Journal of Business Research, Elsevier, vol. 124(C), pages 389-404.
    20. Boehmer, Ekkehart & Grammig, Joachim & Theissen, Erik, 2007. "Estimating the probability of informed trading--does trade misclassification matter?," Journal of Financial Markets, Elsevier, vol. 10(1), pages 26-47, February.
    21. Kevin W. Boyack & Richard Klavans, 2010. "Co‐citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    22. José Willer Prado & Valderí Castro Alcântara & Francisval Melo Carvalho & Kelly Carvalho Vieira & Luiz Kennedy Cruz Machado & Dany Flávio Tonelli, 2016. "Multivariate analysis of credit risk and bankruptcy research data: a bibliometric study involving different knowledge fields (1968–2014)," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1007-1029, March.
    23. Jan De Spiegeleer & Dilip B. Madan & Sofie Reyners & Wim Schoutens, 2018. "Machine learning for quantitative finance: fast derivative pricing, hedging and fitting," Quantitative Finance, Taylor & Francis Journals, vol. 18(10), pages 1635-1643, October.
    24. Alexandra Künzi-Bay & János Mayer, 2006. "Computational aspects of minimizing conditional value-at-risk," Computational Management Science, Springer, vol. 3(1), pages 3-27, January.
    25. Patrick Houlihan & Germán G. Creamer, 2021. "Leveraging Social Media to Predict Continuation and Reversal in Asset Prices," Computational Economics, Springer;Society for Computational Economics, vol. 57(2), pages 433-453, February.
    26. Vikas Sangwan & Harshita & Puneet Prakash & Shveta Singh, 2019. "Financial technology: a review of extant literature," Studies in Economics and Finance, Emerald Group Publishing Limited, vol. 37(1), pages 71-88, November.
    27. Paul C. Tetlock, 2007. "Giving Content to Investor Sentiment: The Role of Media in the Stock Market," Journal of Finance, American Finance Association, vol. 62(3), pages 1139-1168, June.
    28. Mohammed Mubashir Ali & Ashraf Elazouni, 2009. "Finance-based CPM/LOB scheduling of projects with repetitive non-serial activities," Construction Management and Economics, Taylor & Francis Journals, vol. 27(9), pages 839-856.
    29. Mekhail Mustak & Joni Salminen & Loïc Plé & Jochen Wirtz, 2021. "Artificial intelligence in marketing: Topic modeling, scientometric analysis, and research agenda," Post-Print hal-03269994, HAL.
    30. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    31. Tim Loughran & Bill McDonald, 2020. "Textual Analysis in Finance," Annual Review of Financial Economics, Annual Reviews, vol. 12(1), pages 357-375, December.
    32. Debidutta Pattnaik & Satish Kumar & Bruce Burton, 2021. "Thirty Years of The Australian Accounting Review: A Bibliometric Analysis," Australian Accounting Review, CPA Australia, vol. 31(2), pages 150-164, June.
    33. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    34. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    35. Aggarwal, Divya & Chandrasekaran, Shabana & Annamalai, Balamurugan, 2020. "A complete empirical ensemble mode decomposition and support vector machine-based approach to predict Bitcoin prices," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    36. Toorajipour, Reza & Sohrabpour, Vahid & Nazarpour, Ali & Oghazi, Pejvak & Fischl, Maria, 2021. "Artificial intelligence in supply chain management: A systematic literature review," Journal of Business Research, Elsevier, vol. 122(C), pages 502-517.
    37. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    38. Heston, Steven L, 1993. "A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options," The Review of Financial Studies, Society for Financial Studies, vol. 6(2), pages 327-343.
    39. Kim, Soon-Ho & Kim, Dongcheol, 2014. "Investor sentiment from internet message postings and the predictability of stock returns," Journal of Economic Behavior & Organization, Elsevier, vol. 107(PB), pages 708-729.
    40. Mark Broadie & Menghui Cao, 2008. "Improved lower and upper bound algorithms for pricing American options by simulation," Quantitative Finance, Taylor & Francis Journals, vol. 8(8), pages 845-861.
    41. Bhatia, Ankita & Chandani, Arti & Chhateja, Jagriti, 2020. "Robo advisory and its potential in addressing the behavioral biases of investors — A qualitative study in Indian context," Journal of Behavioral and Experimental Finance, Elsevier, vol. 25(C).
    42. Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
    43. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    44. Francesco Ciampi & Alessandro Giannozzi & Giacomo Marzi & Edward I. Altman, 2021. "Rethinking SME default prediction: a systematic literature review and future perspectives," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 2141-2188, March.
    45. Dong Yang & Pu Chen & Fuyuan Shi & Chenggong Wen, 2018. "Internet Finance: Its Uncertain Legal Foundations and the Role of Big Data in Its Development," Emerging Markets Finance and Trade, Taylor & Francis Journals, vol. 54(4), pages 721-732, March.
    46. M. Bee & J. Hambuckers & L. Trapin, 2021. "Estimating large losses in insurance analytics and operational risk using the g-and-h distribution," Quantitative Finance, Taylor & Francis Journals, vol. 21(7), pages 1207-1221, July.
    47. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    48. Kevin W. Boyack & Richard Klavans, 2010. "Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    49. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    50. Lily Fang & Joel Peress, 2009. "Media Coverage and the Cross‐section of Stock Returns," Journal of Finance, American Finance Association, vol. 64(5), pages 2023-2052, October.
    51. Raúl Gómez Martínez & Miguel Prado Román & Paola Plaza Casado, 2019. "Big Data Algorithmic Trading Systems Based on Investors’ Mood," Journal of Behavioral Finance, Taylor & Francis Journals, vol. 20(2), pages 227-238, April.
    52. Das, Sanjiv Ranjan, 2014. "Text and Context: Language Analytics in Finance," Foundations and Trends(R) in Finance, now publishers, vol. 8(3), pages 145-261, November.
    53. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    54. Tat Lung (Ron) Chan & Nicholas Hale, 2020. "Pricing European-type, early-exercise and discrete barrier options using an algorithm for the convolution of Legendre series," Quantitative Finance, Taylor & Francis Journals, vol. 20(8), pages 1307-1324, August.
    55. Lee, Charles M. C. & Radhakrishna, Balkrishna, 2000. "Inferring investor behavior: Evidence from TORQ data," Journal of Financial Markets, Elsevier, vol. 3(2), pages 83-111, May.
    56. Odders-White, Elizabeth R., 2000. "On the occurrence and consequences of inaccurate trade classification," Journal of Financial Markets, Elsevier, vol. 3(3), pages 259-286, August.
    57. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    58. Makarius, Erin E. & Mukherjee, Debmalya & Fox, Joseph D. & Fox, Alexa K., 2020. "Rising with the machines: A sociotechnical framework for bringing artificial intelligence into the organization," Journal of Business Research, Elsevier, vol. 120(C), pages 262-273.
    59. Craig Lewis & Steven Young, 2019. "Fad or future? Automated analysis of financial text and its implications for corporate reporting," Accounting and Business Research, Taylor & Francis Journals, vol. 49(5), pages 587-615, July.
    60. Satish Kumar & Weng Marc Lim & Nitesh Pandey & J. Christopher Westland, 2021. "20 years of Electronic Commerce Research," Electronic Commerce Research, Springer, vol. 21(1), pages 1-40, March.
    61. Xiaojun Li & Pan Tang, 2020. "Stock index prediction based on wavelet transform and FCD‐MLGRU," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(8), pages 1229-1237, December.
    62. Xiaobo Tang & Shixuan Li & Mingliang Tan & Wenxuan Shi, 2020. "Incorporating textual and management factors into financial distress prediction: A comparative study of machine learning methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(5), pages 769-787, August.
    63. Waltman, Ludo & van Eck, Nees Jan & Noyons, Ed C.M., 2010. "A unified approach to mapping and clustering of bibliometric networks," Journal of Informetrics, Elsevier, vol. 4(4), pages 629-635.
    64. H. Kent Baker & Satish Kumar & Debidutta Pattnaik, 2021. "Research constituents, intellectual structure, and collaboration pattern in the Journal of Forecasting: A bibliometric analysis," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(4), pages 577-602, July.
    65. Nag, Ashok K & Mitra, Amit, 2002. "Forecasting Daily Foreign Exchange Rates Using Genetically Optimized Neural Networks," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 21(7), pages 501-511, November.
    66. Wall, Larry D., 2018. "Some financial regulatory implications of artificial intelligence," Journal of Economics and Business, Elsevier, vol. 100(C), pages 55-63.
    67. Daniela Gabor & Sally Brooks, 2017. "The digital revolution in financial inclusion: international development in the fintech era," New Political Economy, Taylor & Francis Journals, vol. 22(4), pages 423-436, July.
    68. Craja, Patricia & Kim, Alisa & Lessmann, Stefan, 2020. "Deep Learning application for fraud detection in financial statements," IRTG 1792 Discussion Papers 2020-007, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Goodell, John W. & Kumar, Satish & Li, Xiao & Pattnaik, Debidutta & Sharma, Anuj, 2022. "Foundations and research clusters in investor attention: Evidence from bibliometric and topic modelling analysis," International Review of Economics & Finance, Elsevier, vol. 82(C), pages 511-529.
    2. Ahmad, Khurshid & Han, JingGuang & Hutson, Elaine & Kearney, Colm & Liu, Sha, 2016. "Media-expressed negative tone and firm-level stock returns," Journal of Corporate Finance, Elsevier, vol. 37(C), pages 152-172.
    3. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    4. Manuel Ammann & Nic Schaub, 2021. "Do Individual Investors Trade on Investment-Related Internet Postings?," Management Science, INFORMS, vol. 67(9), pages 5679-5702, September.
    5. Daniele Ballinari & Simon Behrendt, 2021. "How to gauge investor behavior? A comparison of online investor sentiment measures," Digital Finance, Springer, vol. 3(2), pages 169-204, June.
    6. Agarwal, Shweta & Kumar, Shailendra & Goel, Utkarsh, 2019. "Stock market response to information diffusion through internet sources: A literature review," International Journal of Information Management, Elsevier, vol. 45(C), pages 118-131.
    7. Avramov, Doron & Li, Minwen & Wang, Hao, 2021. "Predicting corporate policies using downside risk: A machine learning approach," Journal of Empirical Finance, Elsevier, vol. 63(C), pages 1-26.
    8. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    9. Miwa, Kotaro, 2023. "Divergent opinions on social media," International Review of Economics & Finance, Elsevier, vol. 86(C), pages 182-196.
    10. Abdi, Farshid & Kormanyos, Emily & Pelizzon, Loriana & Getmansky, Mila & Simon, Zorka, 2021. "Market impact of government communication: The case of presidential tweets," SAFE Working Paper Series 314, Leibniz Institute for Financial Research SAFE, revised 2021.
    11. Mao, Huina & Counts, Scott & Bollen, Johan, 2015. "Quantifying the effects of online bullishness on international financial markets," Statistics Paper Series 09, European Central Bank.
    12. Patrick Houlihan & Germán G. Creamer, 2021. "Leveraging Social Media to Predict Continuation and Reversal in Asset Prices," Computational Economics, Springer;Society for Computational Economics, vol. 57(2), pages 433-453, February.
    13. Frank, Murray Z. & Sanati, Ali, 2018. "How does the stock market absorb shocks?," Journal of Financial Economics, Elsevier, vol. 129(1), pages 136-153.
    14. Zongwu Cai & Pixiong Chen, 2022. "New Online Investor Sentiment and Asset Returns," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202216, University of Kansas, Department of Economics, revised Nov 2022.
    15. Eierle, Brigitte & Klamer, Sebastian & Muck, Matthias, 2022. "Does it really pay off for investors to consider information from social media?," International Review of Financial Analysis, Elsevier, vol. 81(C).
    16. Prajwal Eachempati & Praveen Ranjan Srivastava, 2021. "Accounting for unadjusted news sentiment for asset pricing," Qualitative Research in Financial Markets, Emerald Group Publishing Limited, vol. 13(3), pages 383-422, May.
    17. Karapandza, Rasa, 2016. "Stock returns and future tense language in 10-K reports," Journal of Banking & Finance, Elsevier, vol. 71(C), pages 50-61.
    18. Buehlmaier, Matthias M. M. & Zechner, Josef, 2016. "Financial media, price discovery, and merger arbitrage," CFS Working Paper Series 551, Center for Financial Studies (CFS).
    19. Tim Loughran & Bill Mcdonald, 2016. "Textual Analysis in Accounting and Finance: A Survey," Journal of Accounting Research, Wiley Blackwell, vol. 54(4), pages 1187-1230, September.
    20. Chen, Cathy Yi-Hsuan & Després, Roméo & Guo, Li & Renault, Thomas, 2019. "What makes cryptocurrencies special? Investor sentiment and return predictability during the bubble," IRTG 1792 Discussion Papers 2019-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    More about this item

    Keywords

    Artificial intelligence; Bibliometric analysis; Finance; Machine learning; Review;
    All these keywords.

    JEL classification:

    • B16 - Schools of Economic Thought and Methodology - - History of Economic Thought through 1925 - - - Quantitative and Mathematical
    • B41 - Schools of Economic Thought and Methodology - - Economic Methodology - - - Economic Methodology
    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C40 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - General
    • C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:beexfi:v:32:y:2021:i:c:s2214635021001210. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/journal-of-behavioral-and-experimental-finance .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.