IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v35y2020i2d10.1007_s00180-019-00919-6.html
   My bibliography  Save this article

Real-manufacturing-oriented big data analysis and data value evaluation with domain knowledge

Author

Listed:
  • Weichang Kong

    (Tongji University)

  • Fei Qiao

    (Tongji University)

  • Qidi Wu

    (Tongji University)

Abstract

As one of the most popular topics currently, big data has played an important role in both academic research and practical applications. However, in the manufacturing industry, it is difficult to make full use of the research results for production optimization and/or management due to the low quality of real workshop data. Typical quality problems of real workshop data include the information match degree, missing recessive data, and false error identification. The conventional data analysis methods cannot handle most such issues because these methods fail to consider professional insights into and domain knowledge about the data. The main motivation of this paper is to explore methods for analyzing and evaluating big data with domain knowledge. For this purpose, real production data from a semiconductor manufacturing workshop are adopted as the data object. First, a series of data analysis techniques with domain knowledge are developed for diagnosing the imperfections. Then, corresponding data processing techniques with domain knowledge are proposed for solving those data quality problems according to specific flaws in the data. Furthermore, this paper proposes quantitative calculation methods of data value density to determine the extent to which data quality can be improved by the proposed data processing techniques. Case studies are conducted to demonstrate that data analysis and processing techniques with domain knowledge can effectively handle data quality problems of real workshop data in terms of the information match degree, missing recessive data, and false error identification. The work in this paper has the potential to be further extended and applied to other big data applications beyond the manufacturing industry.

Suggested Citation

  • Weichang Kong & Fei Qiao & Qidi Wu, 2020. "Real-manufacturing-oriented big data analysis and data value evaluation with domain knowledge," Computational Statistics, Springer, vol. 35(2), pages 515-538, June.
  • Handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00919-6
    DOI: 10.1007/s00180-019-00919-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-019-00919-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-019-00919-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhou, Kaile & Fu, Chao & Yang, Shanlin, 2016. "Big data driven smart energy management: From big data to big insights," Renewable and Sustainable Energy Reviews, Elsevier, vol. 56(C), pages 215-225.
    2. Zhong, Ray Y. & Huang, George Q. & Lan, Shulin & Dai, Q.Y. & Chen, Xu & Zhang, T., 2015. "A big data approach for logistics trajectory discovery from RFID-enabled production data," International Journal of Production Economics, Elsevier, vol. 165(C), pages 260-272.
    3. Akter, Shahriar & Wamba, Samuel Fosso & Gunasekaran, Angappa & Dubey, Rameshwar & Childe, Stephen J., 2016. "How to improve firm performance using big data analytics capability and business strategy alignment?," International Journal of Production Economics, Elsevier, vol. 182(C), pages 113-131.
    4. Edwards, Richard E. & New, Joshua & Parker, Lynne E. & Cui, Borui & Dong, Jin, 2017. "Constructing large scale surrogate models from big data and artificial intelligence," Applied Energy, Elsevier, vol. 202(C), pages 685-699.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Desheng Dash Wu & Wolfgang Karl Härdle, 2020. "Service data analytics and business intelligence 2017," Computational Statistics, Springer, vol. 35(2), pages 423-426, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roßmann, Bernhard & Canzaniello, Angelo & von der Gracht, Heiko & Hartmann, Evi, 2018. "The future and social impact of Big Data Analytics in Supply Chain Management: Results from a Delphi study," Technological Forecasting and Social Change, Elsevier, vol. 130(C), pages 135-149.
    2. Wang, Hui & Gong, Qiguo & Wang, Shouyang, 2017. "Information processing structures and decision making delays in MRP and JIT," International Journal of Production Economics, Elsevier, vol. 188(C), pages 41-49.
    3. Purva Grover & Arpan Kumar Kar, 2017. "Big Data Analytics: A Review on Theoretical Contributions and Tools Used in Literature," Global Journal of Flexible Systems Management, Springer;Global Institute of Flexible Systems Management, vol. 18(3), pages 203-229, September.
    4. Sadeghi, M. & Kalantar, M., 2023. "Fully decentralized multi-agent coordination scheme in smart distribution restoration: Multilevel consensus," Applied Energy, Elsevier, vol. 350(C).
    5. Raut, Rakesh D. & Mangla, Sachin Kumar & Narwane, Vaibhav S. & Dora, Manoj & Liu, Mengqi, 2021. "Big Data Analytics as a mediator in Lean, Agile, Resilient, and Green (LARG) practices effects on sustainable supply chains," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 145(C).
    6. Ma, Shuaiyin & Zhang, Yingfeng & Lv, Jingxiang & Ge, Yuntian & Yang, Haidong & Li, Lin, 2020. "Big data driven predictive production planning for energy-intensive manufacturing industries," Energy, Elsevier, vol. 211(C).
    7. Ghasemaghaei, Maryam & Calic, Goran, 2019. "Does big data enhance firm innovation competency? The mediating role of data-driven insights," Journal of Business Research, Elsevier, vol. 104(C), pages 69-84.
    8. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2017. "A multidisciplinary perspective of big data in management research," International Journal of Production Economics, Elsevier, vol. 191(C), pages 97-112.
    9. Acciarini, Chiara & Cappa, Francesco & Boccardelli, Paolo & Oriani, Raffaele, 2023. "How can organizations leverage big data to innovate their business models? A systematic literature review," Technovation, Elsevier, vol. 123(C).
    10. Mariani, Marcello M. & Fosso Wamba, Samuel, 2020. "Exploring how consumer goods companies innovate in the digital age: The role of big data analytics companies," Journal of Business Research, Elsevier, vol. 121(C), pages 338-352.
    11. Ludivine Ravat & Aurélie Hemonnet-Goujot & Sandrine Hollet-Haudebert, 2023. "Data-driven innovation capability of marketing: an exploratory study of its components and underlying processes," Post-Print hal-04151199, HAL.
    12. Miguel Ángel Rodríguez López & Diego Rodríguez Rodríguez, 2024. "La aplicación de datos masivos en economía de la energía: una revisión," Working Papers 2024-08, FEDEA.
    13. Ahmad Ibrahim Aljumah & Mohammed T. Nuseir & Md. Mahmudul Alam, 2021. "Traditional marketing analytics, big data analytics and big data system quality and the success of new product development," Post-Print hal-03538161, HAL.
    14. Cano-Marin, Enrique & Mora-Cantallops, Marçal & Sánchez-Alonso, Salvador, 2023. "Twitter as a predictive system: A systematic literature review," Journal of Business Research, Elsevier, vol. 157(C).
    15. de Camargo Fiorini, Paula & Roman Pais Seles, Bruno Michel & Chiappetta Jabbour, Charbel Jose & Barberio Mariano, Enzo & de Sousa Jabbour, Ana Beatriz Lopes, 2018. "Management theory and big data literature: From a review to a research agenda," International Journal of Information Management, Elsevier, vol. 43(C), pages 112-129.
    16. Mohammad Ali Yamin, 2021. "Investigating the Drivers of Supply Chain Resilience in the Wake of the COVID-19 Pandemic: Empirical Evidence from an Emerging Economy," Sustainability, MDPI, vol. 13(21), pages 1-16, October.
    17. Rosa Lombardi & Raffaele Trequattrini & Federico Schimperna & Myriam Cano-Rubio, 2021. "The Impact of Smart Technologies on theManagement and Strategic Control: A Structured Literature Review," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2021(suppl. 1), pages 11-30.
    18. Luther Yuong Qai Chong & Thien Sang Lim, 2022. "Pull and Push Factors of Data Analytics Adoption and Its Mediating Role on Operational Performance," Sustainability, MDPI, vol. 14(12), pages 1-19, June.
    19. Papanagnou, Christos & Seiler, Andreas & Spanaki, Konstantina & Papadopoulos, Thanos & Bourlakis, Michael, 2022. "Data-driven digital transformation for emergency situations: The case of the UK retail sector," International Journal of Production Economics, Elsevier, vol. 250(C).
    20. Jihoon Moon & Junhong Kim & Pilsung Kang & Eenjun Hwang, 2020. "Solving the Cold-Start Problem in Short-Term Load Forecasting Using Tree-Based Methods," Energies, MDPI, vol. 13(4), pages 1-37, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00919-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.