IDEAS home Printed from https://ideas.repec.org/a/hin/complx/2095048.html
   My bibliography  Save this article

Machine Learning to Assess Relatedness: The Advantage of Using Firm-Level Data

Author

Listed:
  • Giambattista Albora
  • Andrea Zaccaria
  • Pierluigi Contucci

Abstract

The relatedness between a country or a firm and a product is a measure of the feasibility of that economic activity. As such, it is a driver for investments at a private and institutional level. Traditionally, relatedness is measured using networks derived by country-level co-occurrences of product pairs, that is counting how many countries export both. In this work, we compare networks and machine learning algorithms trained not only on country-level data, but also on firms, which is something not much studied due to the low availability of firm-level data. We quantitatively compare the different measures of relatedness, by using them to forecast the exports at the country and firm level, assuming that more related products have a higher likelihood to be exported in the future. Our results show that relatedness is scale dependent: the best assessments are obtained by using machine learning on the same typology of data one wants to predict. Moreover, we found that while relatedness measures based on country data are not suitable for firms, firm-level data are very informative also for the development of countries. In this sense, models built on firm data provide a better assessment of relatedness. We also discuss the effect of using parameter optimization and community detection algorithms to identify clusters of related companies and products, finding that a partition into a higher number of blocks decreases the computational time while maintaining a prediction performance well above the network-based benchmarks.

Suggested Citation

  • Giambattista Albora & Andrea Zaccaria & Pierluigi Contucci, 2022. "Machine Learning to Assess Relatedness: The Advantage of Using Firm-Level Data," Complexity, Hindawi, vol. 2022, pages 1-12, July.
  • Handle: RePEc:hin:complx:2095048
    DOI: 10.1155/2022/2095048
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/complexity/2022/2095048.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/complexity/2022/2095048.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2022/2095048?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Giambattista Albora & Matteo Straccamore & Andrea Zaccaria, 2024. "Machine learning-based similarity measure to forecast M&A from patent data," Papers 2404.07179, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:complx:2095048. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.