IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2010.01996.html
   My bibliography  Save this paper

Evaluation of company investment value based on machine learning

Author

Listed:
  • Junfeng Hu
  • Xiaosa Li
  • Yuru Xu
  • Shaowu Wu
  • Bin Zheng

Abstract

In this paper, company investment value evaluation models are established based on comprehensive company information. After data mining and extracting a set of 436 feature parameters, an optimal subset of features is obtained by dimension reduction through tree-based feature selection, followed by the 5-fold cross-validation using XGBoost and LightGBM models. The results show that the Root-Mean-Square Error (RMSE) reached 3.098 and 3.059, respectively. In order to further improve the stability and generalization capability, Bayesian Ridge Regression has been used to train a stacking model based on the XGBoost and LightGBM models. The corresponding RMSE is up to 3.047. Finally, the importance of different features to the LightGBM model is analysed.

Suggested Citation

  • Junfeng Hu & Xiaosa Li & Yuru Xu & Shaowu Wu & Bin Zheng, 2020. "Evaluation of company investment value based on machine learning," Papers 2010.01996, arXiv.org.
  • Handle: RePEc:arx:papers:2010.01996
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2010.01996
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Urbinati, Andrea & Bogers, Marcel & Chiesa, Vittorio & Frattini, Federico, 2019. "Creating and capturing value from Big Data: A multiple-case study analysis of provider companies," Technovation, Elsevier, vol. 84, pages 21-36.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michele Gorgoglione & Achille Claudio Garavelli & Umberto Panniello & Angelo Natalicchio, 2023. "Information Retrieval Technologies and Big Data Analytics to Analyze Product Innovation in the Music Industry," Sustainability, MDPI, vol. 15(1), pages 1-16, January.
    2. Zhucui Jing & Ying Zheng & Hongli Guo, 2023. "A Study of the Impact of Digital Competence and Organizational Agility on Green Innovation Performance of Manufacturing Firms—The Moderating Effect Based on Knowledge Inertia," Administrative Sciences, MDPI, vol. 13(12), pages 1-17, December.
    3. Maniyassouwe Amana & Pingfeng Liu & Mona Alariqi, 2022. "Value Creation and Capture with Big Data in Smart Phones Companies," Sustainability, MDPI, vol. 14(23), pages 1-22, November.
    4. Huňady Ján & Pisár Peter & Khawaja Sarwar & Qureshi Fayyaz Hussain, 2024. "The Digital Transformation of European Union Countries before and during COVID-19," Business Systems Research, Sciendo, vol. 15(1), pages 22-45.
    5. Sarbu, Miruna, 2022. "The impact of industry 4.0 on innovation performance: Insights from German manufacturing and service firms," Technovation, Elsevier, vol. 113(C).
    6. Francesco Facchini & Joanna Oleśków-Szłapka & Luigi Ranieri & Andrea Urbinati, 2019. "A Maturity Model for Logistics 4.0: An Empirical Analysis and a Roadmap for Future Research," Sustainability, MDPI, vol. 12(1), pages 1-18, December.
    7. Canhoto, Ana Isabel & Clear, Fintan, 2020. "Artificial intelligence and machine learning as business tools: A framework for diagnosing value destruction potential," Business Horizons, Elsevier, vol. 63(2), pages 183-193.
    8. Patrucco, Andrea S. & Marzi, Giacomo & Trabucchi, Daniel, 2023. "The role of absorptive capacity and big data analytics in strategic purchasing and supply chain management decisions," Technovation, Elsevier, vol. 126(C).
    9. Cheng, Cong & Wang, Limin, 2022. "How companies configure digital innovation attributes for business model innovation? A configurational view," Technovation, Elsevier, vol. 112(C).
    10. Yu, Yan & Ibarra, Julio E. & Kumar, Kuldeep & Chergarova, Vasilka, 2021. "Coevolution of cyberinfrastructure development and scientific progress," Technovation, Elsevier, vol. 100(C).
    11. Yuegang Song & Ruibing Wu, 2022. "The Impact of Financial Enterprises’ Excessive Financialization Risk Assessment for Risk Control based on Data Mining and Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 60(4), pages 1245-1267, December.
    12. Berndt Jesenko & Christian Schlögl, 2021. "The effect of web of science subject categories on clustering: the case of data-driven methods in business and economic sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6785-6801, August.
    13. Chun-Yan Zhu & Dong-Liang Zhang, 2022. "An Empirical Study on the Mechanism of Dynamic Capacity Formation in the Supply Chain," Sustainability, MDPI, vol. 14(22), pages 1-20, November.
    14. Raphaëlle Barbier & Skander Ben Yahia & Sylvain Lenfle & Benoit Weil, 2024. "Data-push innovation beyond serendipity: The case of a digital platform making Earth Observation data fit into multiple use contexts," Post-Print hal-04611800, HAL.
    15. Ávila-Robinson, Alfonso & Islam, Nazrul & Sengoku, Shintaro, 2022. "Exploring the knowledge base of innovation research: Towards an emerging innovation model," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    16. Barbier, Raphaëlle & Ben Yahia, Skander & Lenfle, Sylvain & Weil, Benoit, 2024. "Data-push innovation beyond serendipity: The case of a digital platform making Earth Observation data fit into multiple use contexts," Technovation, Elsevier, vol. 132(C).
    17. Maryia Zaitsava & Elona Marku & Maria Chiara Guardo & Azar Shahgholian, 2023. "A fine-grained perspective on big data knowledge creation: dimensions, insights, and mechanism from a pilot study," Journal of Management & Governance, Springer;Accademia Italiana di Economia Aziendale (AIDEA), vol. 27(2), pages 547-573, June.
    18. Merín-Rodrigáñez, Joan & Dasí, Àngels & Alegre, Joaquín, 2024. "Digital transformation and firm performance in innovative SMEs: The mediating role of business model innovation," Technovation, Elsevier, vol. 134(C).
    19. Cho, Jaehan & DeStefano, Timothy & Kim, Hanhin & Kim, Inchul & Paik, Jin Hyun, 2023. "What's driving the diffusion of next-generation digital technologies?," Technovation, Elsevier, vol. 119(C).
    20. Shih-Chia Chang & Hsu-Hwa Chang & Ming-Tsang Lu, 2021. "Evaluating Industry 4.0 Technology Application in SMEs: Using a Hybrid MCDM Approach," Mathematics, MDPI, vol. 9(4), pages 1-21, February.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2010.01996. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.