IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2308.06882.html
   My bibliography  Save this paper

Quantifying Outlierness of Funds from their Categories using Supervised Similarity

Author

Listed:
  • Dhruv Desai
  • Ashmita Dhiman
  • Tushar Sharma
  • Deepika Sharma
  • Dhagash Mehta
  • Stefano Pasquali

Abstract

Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. Here, we aim to quantify the effect of miscategorization of funds utilizing a machine learning based approach. We formulate the problem of miscategorization of funds as a distance-based outlier detection problem, where the outliers are the data-points that are far from the rest of the data-points in the given feature space. We implement and employ a Random Forest (RF) based method of distance metric learning, and compute the so-called class-wise outlier measures for each data-point to identify outliers in the data. We test our implementation on various publicly available data sets, and then apply it to mutual fund data. We show that there is a strong relationship between the outlier measures of the funds and their future returns and discuss the implications of our findings.

Suggested Citation

  • Dhruv Desai & Ashmita Dhiman & Tushar Sharma & Deepika Sharma & Dhagash Mehta & Stefano Pasquali, 2023. "Quantifying Outlierness of Funds from their Categories using Supervised Similarity," Papers 2308.06882, arXiv.org.
  • Handle: RePEc:arx:papers:2308.06882
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2308.06882
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Edwin J. Elton & Martin J. Gruber & Christopher R. Blake, 2003. "Incentive Fees and Mutual Funds," Journal of Finance, American Finance Association, vol. 58(2), pages 779-804, April.
    2. Athanasios Orphanides, "undated". "Compensation Incentives and Risk Taking Behavior: Evidence from Mutual Funds," Finance and Economics Discussion Series 1996-21, Board of Governors of the Federal Reserve System (U.S.), revised 10 Dec 2019.
    3. Moreno, David & Marco, Paulina & Olmeda, Ignacio, 2006. "Self-organizing maps could improve the classification of Spanish mutual funds," European Journal of Operational Research, Elsevier, vol. 174(2), pages 1039-1054, October.
    4. Kim, Moon & Shukla, Ravi & Tomas, Michael, 2000. "Mutual fund objective misclassification," Journal of Economics and Business, Elsevier, vol. 52(4), pages 309-323.
    5. Vipul Satone & Dhruv Desai & Dhagash Mehta, 2021. "Fund2Vec: Mutual Funds Similarity using Graph Learning," Papers 2106.12987, arXiv.org.
    6. Jerinsh Jeyapaulraj & Dhruv Desai & Peter Chu & Dhagash Mehta & Stefano Pasquali & Philip Sommer, 2022. "Supervised similarity learning for corporate bonds using Random Forest proximities," Papers 2207.04368, arXiv.org, revised Oct 2022.
    7. Dimitrios Vamvourellis & Mate Attila Toth & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Mutual Fund Categorization using Natural Language Processing," Papers 2207.04959, arXiv.org.
    8. Lin, Yi & Jeon, Yongho, 2006. "Random Forests and Adaptive Nearest Neighbors," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 578-590, June.
    9. Michael J. Cooper & Huseyin Gulen & P. Raghavendra Rau, 2005. "Changing Names with Style: Mutual Fund Name Changes and Their Effects on Fund Flows," Journal of Finance, American Finance Association, vol. 60(6), pages 2825-2858, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gregory Yampolsky & Dhruv Desai & Mingshu Li & Stefano Pasquali & Dhagash Mehta, 2024. "Case-based Explainability for Random Forest: Prototypes, Critics, Counter-factuals and Semi-factuals," Papers 2408.06679, arXiv.org.
    2. Nathalia Castellanos & Dhruv Desai & Sebastian Frank & Stefano Pasquali & Dhagash Mehta, 2024. "Can an unsupervised clustering algorithm reproduce a categorization system?," Papers 2408.10340, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vipul Satone & Dhruv Desai & Dhagash Mehta, 2021. "Fund2Vec: Mutual Funds Similarity using Graph Learning," Papers 2106.12987, arXiv.org.
    2. Dhagash Mehta & Dhruv Desai & Jithin Pradeep, 2020. "Machine Learning Fund Categorizations," Papers 2006.00123, arXiv.org.
    3. Dimitrios Vamvourellis & Mate Attila Toth & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Mutual Fund Categorization using Natural Language Processing," Papers 2207.04959, arXiv.org.
    4. Cuthbertson, Keith & Nitzsche, Dirk & O'Sullivan, Niall, 2016. "A review of behavioural and management effects in mutual fund performance," International Review of Financial Analysis, Elsevier, vol. 44(C), pages 162-176.
    5. Jerinsh Jeyapaulraj & Dhruv Desai & Peter Chu & Dhagash Mehta & Stefano Pasquali & Philip Sommer, 2022. "Supervised similarity learning for corporate bonds using Random Forest proximities," Papers 2207.04368, arXiv.org, revised Oct 2022.
    6. Sensoy, Berk A., 2009. "Performance evaluation and self-designated benchmark indexes in the mutual fund industry," Journal of Financial Economics, Elsevier, vol. 92(1), pages 25-39, April.
    7. Cumming, Douglas & Johan, Sofia & Zhang, Yelin, 2019. "What is mutual fund flow?," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 62(C), pages 222-251.
    8. Nathalia Castellanos & Dhruv Desai & Sebastian Frank & Stefano Pasquali & Dhagash Mehta, 2024. "Can an unsupervised clustering algorithm reproduce a categorization system?," Papers 2408.10340, arXiv.org.
    9. Irina Bezhentseva Mateus & Cesario Mateus & Natasa Todorovic, 2019. "Benchmark-adjusted performance of US equity mutual funds and the issue of prospectus benchmarks," Journal of Asset Management, Palgrave Macmillan, vol. 20(1), pages 15-30, February.
    10. Fernando Muñoz & María Vargas & Ruth Vicente, 2021. "Style-changing behaviour in the socially responsible mutual fund industry: consequences on financial and sustainable performance," Sustainability Accounting, Management and Policy Journal, Emerald Group Publishing Limited, vol. 12(5), pages 1027-1051, February.
    11. Agarwal, Vikas & Boyson, Nicole M. & Naik, Narayan Y., 2007. "Hedge funds for retail investors? An examination of hedged mutual funds," CFR Working Papers 07-04, University of Cologne, Centre for Financial Research (CFR).
    12. Kurniawan, Meinanda & How, Janice & Verhoeven, Peter, 2016. "Fund governance and style drift," Pacific-Basin Finance Journal, Elsevier, vol. 40(PA), pages 59-72.
    13. Jun, Xiao & Li, Mingsheng & Yugang, Chen, 2017. "Catering to behavioral demand for dividends and its potential agency issue," Pacific-Basin Finance Journal, Elsevier, vol. 46(PB), pages 269-291.
    14. Liu, Jianxiang & Yi, WenYu, 2024. "Does the style drift caused by frequent cross-industry portfolio rebalancing harm fund performance? Evidence from China," Finance Research Letters, Elsevier, vol. 60(C).
    15. Mateus, Irina B. & Mateus, Cesario & Todorovic, Natasa, 2019. "Review of new trends in the literature on factor models and mutual fund performance," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 344-354.
    16. Fraś Alicja, 2018. "Expensive and Cheap Funds – Polish Stock Mutual Fund Fees in 2017," Financial Sciences. Nauki o Finansach, Sciendo, vol. 23(4), pages 38-49, December.
    17. Chua, Angeline Kim Pei & Tam, On Kit, 2020. "The shrouded business of style drift in active mutual funds," Journal of Corporate Finance, Elsevier, vol. 64(C).
    18. Kirchler, Michael & Lindner, Florian & Weitzel, Utz, 2020. "Delegated investment decisions and rankings," Journal of Banking & Finance, Elsevier, vol. 120(C).
    19. Cameron Truong, 2013. "The January effect, does options trading matter?," Australian Journal of Management, Australian School of Business, vol. 38(1), pages 31-48, April.
    20. Chang, Xiaochen & Guo, Songlin & Huang, Junkai, 2022. "Kidnapped mutual funds: Irrational preference of naive investors and fund incentive distortion," International Review of Financial Analysis, Elsevier, vol. 83(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2308.06882. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.