IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2412.02342.html
   My bibliography  Save this paper

A Rule-Based Methodology for Company Identification: Application to the Downstream Space Sector

Author

Listed:
  • Kenza Bousedra
  • Pierre Pelletier

Abstract

This paper proposes an original methodology based on Named Entity Recognition (NER) to identify companies involved in downstream space activities, i.e., companies that provide services or products exploiting data and technology from space. Using a rule-based approach, the method leverages a corpus of texts from digitized French press articles to extract company names related to the downstream space segment. This approach allowed the detection of 88 new downstream space companies, enriching the existing database of the sector by 33\%. The paper details the identification process and provides guidelines for future replications, applying the method to other geographic areas, or adapting it to other industries where new entrants are challenging to identify using traditional activity classifications.

Suggested Citation

  • Kenza Bousedra & Pierre Pelletier, 2024. "A Rule-Based Methodology for Company Identification: Application to the Downstream Space Sector," Papers 2412.02342, arXiv.org.
  • Handle: RePEc:arx:papers:2412.02342
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2412.02342
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gregory S. Miller, 2006. "The Press as a Watchdog for Accounting Fraud," Journal of Accounting Research, Wiley Blackwell, vol. 44(5), pages 1001-1033, December.
    2. Leland Bybee & Bryan T. Kelly & Asaf Manela & Dacheng Xiu, 2021. "Business News and Business Cycles," NBER Working Papers 29344, National Bureau of Economic Research, Inc.
    3. repec:bla:jfinan:v:59:y:2004:i:3:p:1259-1294 is not listed on IDEAS
    4. Tobback, Ellen & Naudts, Hans & Daelemans, Walter & Junqué de Fortuny, Enric & Martens, David, 2018. "Belgian economic policy uncertainty index: Improvement through text mining," International Journal of Forecasting, Elsevier, vol. 34(2), pages 355-365.
    5. Sanjay K. Arora & Jan Youtie & Philip Shapira & Lidan Gao & TingTing Ma, 2013. "Entry strategies in an emerging technology: a pilot web-based study of graphene firms," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 1189-1207, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrii Babii & Ryan T. Ball & Eric Ghysels & Jonas Striaukas, 2024. "Panel data nowcasting: The case of price–earnings ratios," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 292-307, March.
    2. Sebastian Kaumanns, 2019. "“Some fuzzy math”: relational information on debt value adjustments by managers and the financial press," Business Research, Springer;German Academic Association for Business Research, vol. 12(2), pages 755-794, December.
    3. Li, Xiaoqing & Qiao, Penghua & Zhao, Lin, 2019. "CEO media exposure, political connection and Chinese firms' stock price synchronicity," International Review of Economics & Finance, Elsevier, vol. 63(C), pages 61-75.
    4. Bailliu, Jeannine & Han, Xinfen & Kruger, Mark & Liu, Yu-Hsien & Thanabalasingam, Sri, 2019. "Can media and text analytics provide insights into labour market conditions in China?," International Journal of Forecasting, Elsevier, vol. 35(3), pages 1118-1130.
    5. Wong, Jin Boon & Zhang, Qin, 2022. "Stock market reactions to adverse ESG disclosure via media channels," The British Accounting Review, Elsevier, vol. 54(1).
    6. Kim, Jongkyum & Lim, Jee-Hae & Yoon, Kyunghee, 2022. "How do the content, format, and tone of Twitter-based corporate disclosure vary depending on earnings performance?," International Journal of Accounting Information Systems, Elsevier, vol. 47(C).
    7. Kai Wai Hui & Clive Lennox & Guochang Zhang, 2014. "The Market's Valuation of Fraudulently Reported Earnings," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 41(5-6), pages 627-651, June.
    8. Andrew C. Call & Gerald S. Martin & Nathan Y. Sharp & Jaron H. Wilde, 2018. "Whistleblowers and Outcomes of Financial Misrepresentation Enforcement Actions," Journal of Accounting Research, Wiley Blackwell, vol. 56(1), pages 123-171, March.
    9. Badolato, Patrick G. & Donelson, Dain C. & Ege, Matthew, 2014. "Audit committee financial expertise and earnings management: The role of status," Journal of Accounting and Economics, Elsevier, vol. 58(2), pages 208-230.
    10. Jeffrey Cohen & Yuan Ding & Cédric Lesage & Hervé Stolowy, 2017. "Media Bias and the Persistence of the Expectation Gap: An Analysis of Press Articles on Corporate Fraud," Journal of Business Ethics, Springer, vol. 144(3), pages 637-659, September.
    11. Nicholas Guest & Jaewoo Kim, 2024. "The media response to a loss of analyst coverage," Review of Accounting Studies, Springer, vol. 29(4), pages 3752-3787, December.
    12. James D. Westphal & David L. Deephouse, 2011. "Avoiding Bad Press: Interpersonal Influence in Relations Between CEOs and Journalists and the Consequences for Press Reporting About Firms and Their Leadership," Organization Science, INFORMS, vol. 22(4), pages 1061-1086, August.
    13. Li, Yin & Arora, Sanjay & Youtie, Jan & Shapira, Philip, 2018. "Using web mining to explore Triple Helix influences on growth in small and mid-size firms," Technovation, Elsevier, vol. 76, pages 3-14.
    14. Wu, Chunying & Xiong, Xiong & Gao, Ya, 2022. "The role of different information sources in information spread: Evidence from three media channels in China," International Review of Economics & Finance, Elsevier, vol. 80(C), pages 327-341.
    15. Robert M. Bushman & Christopher D. Williams & Regina Wittenberg‐Moerman, 2017. "The Informational Role of the Media in Private Lending," Journal of Accounting Research, Wiley Blackwell, vol. 55(1), pages 115-152, March.
    16. Hans B. Christensen & Luzi Hail & Christian Leuz, 2021. "Mandatory CSR and sustainability reporting: economic analysis and literature review," Review of Accounting Studies, Springer, vol. 26(3), pages 1176-1248, September.
    17. Xin Xu & Feng Xiong & Zhe An, 2023. "Using Machine Learning to Predict Corporate Fraud: Evidence Based on the GONE Framework," Journal of Business Ethics, Springer, vol. 186(1), pages 137-158, August.
    18. Stella D. Juventia & Sarah K. Jones & Marie-Angélique Laporte & Roseline Remans & Chiara Villani & Natalia Estrada-Carmona, 2020. "Text Mining National Commitments towards Agrobiodiversity Conservation and Use," Sustainability, MDPI, vol. 12(2), pages 1-19, January.
    19. Lu, Chao & Yang, Meng & Xia, Xiaoxue, 2023. "Economic policy uncertainty and default risk: Evidence from China," Economic Analysis and Policy, Elsevier, vol. 79(C), pages 821-836.
    20. An, Zhe & Chen, Chen & Naiker, Vic & Wang, Jun, 2020. "Does media coverage deter firms from withholding bad news? Evidence from stock price crash risk," Journal of Corporate Finance, Elsevier, vol. 64(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2412.02342. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.