IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v461y2016icp726-746.html
   My bibliography  Save this article

MST Fitness Index and implicit data narratives: A comparative test on alternative unsupervised algorithms

Author

Listed:
  • Buscema, Massimo
  • Sacco, Pier Luigi

Abstract

In this paper, we introduce a new methodology for the evaluation of alternative algorithms in capturing the deep statistical structure of datasets of different types and nature, called MST Fitness, and based on the notion of Minimum Spanning Tree (MST). We test this methodology on six different databases, some of which artificial and widely used in similar experimentations, and some related to real world phenomena. Our test set consists of eight different algorithms, including some widely known and used, such as Principal Component Analysis, Linear Correlation, or Euclidean Distance. We moreover consider more sophisticated Artificial Neural Network based algorithms, such as the Self-Organizing Map (SOM) and a relatively new algorithm called Auto-Contractive Map (AutoCM). We find that, for our benchmark of datasets, AutoCM performs consistently better than all other algorithms for all of the datasets, and that its global performance is superior to that of the others of several orders of magnitude. It is to be checked in future research if AutoCM can be considered a truly general-purpose algorithm for the analysis of heterogeneous categories of datasets.

Suggested Citation

  • Buscema, Massimo & Sacco, Pier Luigi, 2016. "MST Fitness Index and implicit data narratives: A comparative test on alternative unsupervised algorithms," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 461(C), pages 726-746.
  • Handle: RePEc:eee:phsmap:v:461:y:2016:i:c:p:726-746
    DOI: 10.1016/j.physa.2016.05.055
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437116302473
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2016.05.055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Miccichè, Salvatore & Bonanno, Giovanni & Lillo, Fabrizio & N. Mantegna, Rosario, 2003. "Degree stability of a minimum spanning tree of price return and volatility," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 324(1), pages 66-73.
    2. R. Mantegna, 1999. "Hierarchical structure in financial markets," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 11(1), pages 193-197, September.
    3. Enzo Grossi & Angelo Compare & Massimo Buscema, 2014. "The concept of individual semantic maps in clinical psychology: a feasibility study on a new paradigm," Quality & Quantity: International Journal of Methodology, Springer, vol. 48(1), pages 15-35, January.
    4. Dias, João, 2013. "Spanning trees and the Eurozone crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(23), pages 5974-5984.
    5. C. Coronnello & M. Tumminello & F. Lillo & S. Miccich`e & R. N. Mantegna, 2005. "Sector identification in a set of stock return time series traded at the London Stock Exchange," Papers cond-mat/0508122, arXiv.org.
    6. Paolo Massimo Buscema & William J. Tastle, 2015. "Artificial Neural Network What-If Theory," International Journal of Information Systems and Social Change (IJISSC), IGI Global, vol. 6(4), pages 52-81, October.
    7. Enzo Grossi & Giorgio Tavano Blessi & Pier Sacco & Massimo Buscema, 2012. "The Interaction Between Culture, Health and Psychological Well-Being: Data Mining from the Italian Culture and Well-Being Project," Journal of Happiness Studies, Springer, vol. 13(1), pages 129-148, March.
    8. Sensoy, Ahmet & Tabak, Benjamin M., 2014. "Dynamic spanning trees in stock market networks: The case of Asia-Pacific," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 387-402.
    9. Yiting Zhang & Gladys Hui Ting Lee & Jian Cheng Wong & Jun Liang Kok & Manamohan Prusty & Siew Ann Cheong, 2010. "Will the US Economy Recover in 2010? A Minimal Spanning Tree Study," Papers 1009.5800, arXiv.org, revised Dec 2010.
    10. Jang, Wooseok & Lee, Junghoon & Chang, Woojin, 2011. "Currency crises and the evolution of foreign exchange market: Evidence from minimum spanning tree," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(4), pages 707-718.
    11. Kalyagin, V.A. & Koldanov, A.P. & Koldanov, P.A. & Pardalos, P.M. & Zamaraev, V.A., 2014. "Measures of uncertainty in market network analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 413(C), pages 59-70.
    12. Zhang, Yiting & Lee, Gladys Hui Ting & Wong, Jian Cheng & Kok, Jun Liang & Prusty, Manamohan & Cheong, Siew Ann, 2011. "Will the US economy recover in 2010? A minimal spanning tree study," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(11), pages 2020-2050.
    13. Coelho, Ricardo & Gilmore, Claire G. & Lucey, Brian & Richmond, Peter & Hutzler, Stefan, 2007. "The evolution of interdependence in world equity markets—Evidence from minimum spanning trees," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 376(C), pages 455-466.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Paolo Massimo Buscema & Guido Ferilli & Christer Gustafsson & Pier Luigi Sacco, 2020. "The Complex Dynamic Evolution of Cultural Vibrancy in the Region of Halland, Sweden," International Regional Science Review, , vol. 43(3), pages 159-202, May.
    2. Mennini, Francesco Saverio & Gitto, Lara & Russo, Simone & Cicchetti, Americo & Ruggeri, Matteo & Coretti, Silvia & Maurelli, Guido & Buscema, Paolo Massimo, 2017. "Does regional belonging explain the similarities in the expenditure determinants of Italian healthcare deliveries?," Economic Analysis and Policy, Elsevier, vol. 55(C), pages 47-56.
    3. De Carlo, Manuela & Ferilli, Guido & d'Angella, Francesca & Buscema, Massimo, 2021. "Artificial intelligence to design collaborative strategy: An application to urban destinations," Journal of Business Research, Elsevier, vol. 129(C), pages 936-948.
    4. Erspamer, Christopher & Della Torre, Francesca & Massini, Giulia & Ferilli, Guido & Sacco, Pier Luigi & Buscema, Paolo Massimo, 2022. "Global world (dis-)order? Analyzing the dynamic evolution of the micro-structure of multipolarism by means of an unsupervised neural network approach," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    5. Paolo Massimo Buscema & Francesca Della Torre & Giulia Massini & Guido Ferilli & Pier Luigi Sacco, 2023. "Reconstructing the Emergent Organization of Information Flows in International Stock Markets: A Computational Complex Systems Approach," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 49-89, June.
    6. Buscema, Massimo & Ferilli, Guido & Sacco, Pier Luigi, 2017. "What kind of ‘world order’? An artificial neural networks approach to intensive data mining," Technological Forecasting and Social Change, Elsevier, vol. 117(C), pages 46-56.
    7. Buscema, Massimo & Ferilli, Guido & Gustafsson, Christer & Massini, Giulia & Sacco, Pier Luigi, 2022. "A nonlinear, data-driven, ANNs-based approach to culture-led development policies in rural areas: The case of Gjakove and Peć districts, Western Kosovo," Chaos, Solitons & Fractals, Elsevier, vol. 162(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gautier Marti & Frank Nielsen & Miko{l}aj Bi'nkowski & Philippe Donnat, 2017. "A review of two decades of correlations, hierarchies, networks and clustering in financial markets," Papers 1703.00485, arXiv.org, revised Nov 2020.
    2. Deviren, Seyma Akkaya & Deviren, Bayram, 2016. "The relationship between carbon dioxide emission and economic growth: Hierarchical structure methods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 451(C), pages 429-439.
    3. Djauhari, Maman Abdurachman & Gan, Siew Lee, 2015. "Optimality problem of network topology in stocks market analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 419(C), pages 108-114.
    4. Yao, Hongxing & Memon, Bilal Ahmed, 2019. "Network topology of FTSE 100 Index companies: From the perspective of Brexit," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 1248-1262.
    5. Leonidas Sandoval Junior & Italo De Paula Franca, 2011. "Correlation of financial markets in times of crisis," Papers 1102.1339, arXiv.org, revised Mar 2011.
    6. Sandoval, Leonidas & Franca, Italo De Paula, 2012. "Correlation of financial markets in times of crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(1), pages 187-208.
    7. Barbi, A.Q. & Prataviera, G.A., 2019. "Nonlinear dependencies on Brazilian equity network from mutual information minimum spanning trees," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 876-885.
    8. Sandoval, Leonidas, 2014. "To lag or not to lag? How to compare indices of stock markets that operate on different times," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 403(C), pages 227-243.
    9. Cheong, Siew Ann & Fornia, Robert Paulo & Lee, Gladys Hui Ting & Kok, Jun Liang & Yim, Woei Shyr & Xu, Danny Yuan & Zhang, Yiting, 2011. "The Japanese economy in crises: A time series segmentation study," Economics Discussion Papers 2011-24, Kiel Institute for the World Economy (IfW Kiel).
    10. Leonidas Sandoval Junior, 2011. "A Map of the Brazilian Stock Market," Papers 1107.4146, arXiv.org, revised Mar 2013.
    11. Kazemilari, Mansooreh & Mardani, Abbas & Streimikiene, Dalia & Zavadskas, Edmundas Kazimieras, 2017. "An overview of renewable energy companies in stock exchange: Evidence from minimal spanning tree approach," Renewable Energy, Elsevier, vol. 102(PA), pages 107-117.
    12. Sandoval, Leonidas, 2012. "Pruning a minimum spanning tree," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(8), pages 2678-2711.
    13. A. Q. Barbi & G. A. Prataviera, 2017. "Nonlinear dependencies on Brazilian equity network from mutual information minimum spanning trees," Papers 1711.06185, arXiv.org, revised May 2019.
    14. Leonidas Sandoval Junior, 2011. "Cluster formation and evolution in networks of financial market indices," Papers 1111.5069, arXiv.org.
    15. Marcin Wk{a}torek & Stanis{l}aw Dro.zd.z & Jaros{l}aw Kwapie'n & Ludovico Minati & Pawe{l} O'swik{e}cimka & Marek Stanuszek, 2020. "Multiscale characteristics of the emerging global cryptocurrency market," Papers 2010.15403, arXiv.org, revised Mar 2021.
    16. Radhakrishnan, Srinivasan & Duvvuru, Arjun & Sultornsanee, Sivarit & Kamarthi, Sagar, 2016. "Phase synchronization based minimum spanning trees for analysis of financial time series with nonlinear correlations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 444(C), pages 259-270.
    17. Bilal Ahmed Memon & Hongxing Yao & Rabia Tahir, 2020. "General election effect on the network topology of Pakistan’s stock market: network-based study of a political event," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 6(1), pages 1-14, December.
    18. Sandoval, Leonidas Junior, 2013. "To lag or not to lag? How to compare indices of stock markets that operate at different times," Insper Working Papers wpe_319, Insper Working Paper, Insper Instituto de Ensino e Pesquisa.
    19. Millington, Tristan & Niranjan, Mahesan, 2021. "Construction of minimum spanning trees from financial returns using rank correlation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 566(C).
    20. Cheong, Siew Ann & Fornia, Robert Paulo & Lee, Gladys Hui Ting & Kok, Jun Liang & Yim, Woei Shyr & Xu, Danny Yuan & Zhang, Yiting, 2012. "The Japanese economy in crises: A time series segmentation study," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 6, pages 1-81.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:461:y:2016:i:c:p:726-746. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.