IDEAS home Printed from https://ideas.repec.org/a/ora/journl/v1y2017i1p453-461.html
   My bibliography  Save this article

Applying Big Data Algorithms For Sales Data Stored In Sap Hana

Author

Listed:
  • OLÃH JUDIT

    (UNIVERSITY OF DEBRECEN)

  • ERDEI EDINA

    (UNIVERSITY OF DEBRECEN)

  • POPP JOZSEF

    (UNIVERSITY OF DEBRECEN)

Abstract

An increasing number of small and medium enterprises operate and supervise their financial, logistics, production, human resource and other activities with ERP (Enterprise Resource Planning) IT systems, which are capable of managing these processes in a uniform framework. It is our research objective to explore analytical and prediction methods which can provide solutions to various financial and logistics problems that are faced by nowadays’ enterprises. In order to achieve this objective, we analysed a real database of an enterprise using SAP. This database covers the period between January 2013 – October 2016 and contains 7 million sales records of 15,674 different products. These products also include currently inactive items, however, they can still be important from the aspect of data analysis in relation to the examined period. The data structure was created using the recently introduced SAP HANA (High Performance Analytic Appliance) database management system which revolutionised data storage with its in-memory and column-oriented features. This new technology makes it possible to execute various transactions more effectively and more quickly. Following the preparation of the schematics of data needed for processing and the calculation of calculation fields, we used the tools provided by SAP Predictive Analytics which was introduced to the market in 2015. After filtering sales data for 15 quarters, we used k-means clustering for each period. After preparing and examining the clusters, we made observations which make it easier to perform stock management, logistics and pricing activities in the future, thereby contributing to the long-term increase of enterprise profit. Clustering was also performed in R programming language, which enabled us to illustrate the clustering results, i.e., each sales record in 3D, colouring them based on their associated cluster label. After inspecting these graphic outputs, it was concluded that certain products should be withdrawn from the market, while others should be either developed or their stock level increased. We used regression analysis on the cluster centroids to predict the movement of each cluster mainly in terms of time. As a result, we provided an estimation as to the direction that the products belonging to the sales records in each cluster will fluctuate to in accordance with the coordinates of the cluster centroids, thereby making recommendations related to the management of each item and group of items. As a next step, we performed a global F test for regression analysis to examine the correctness of our model. As a conclusion, we reject the null hypothesis which stated that our model is basically invalid.

Suggested Citation

  • Olãh Judit & Erdei Edina & Popp Jozsef, 2017. "Applying Big Data Algorithms For Sales Data Stored In Sap Hana," Annals of Faculty of Economics, University of Oradea, Faculty of Economics, vol. 1(1), pages 453-461, July.
  • Handle: RePEc:ora:journl:v:1:y:2017:i:1:p:453-461
    as

    Download full text from publisher

    File URL: http://anale.steconomiceuoradea.ro/volume/2017/n1/44.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Helen Thomas & Anindya Datta, 2001. "A Conceptual Model and Algebra for On-Line Analytical Processing in Decision Support Databases," Information Systems Research, INFORMS, vol. 12(1), pages 83-102, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang, Chun-Che & (Bill) Tseng, Tzu-Liang & Li, Ming-Zhong & Gung, Roger R., 2006. "Models of multi-dimensional analysis for qualitative data and its application," European Journal of Operational Research, Elsevier, vol. 174(2), pages 983-1008, October.
    2. Gediminas Adomavicius & Alexander Tuzhilin & Rong Zheng, 2011. "REQUEST: A Query Language for Customizing Recommendations," Information Systems Research, INFORMS, vol. 22(1), pages 99-117, March.

    More about this item

    Keywords

    SAP HANA; SAP Predictive Analytics; R programming language; k-means clustering; regression analysis;
    All these keywords.

    JEL classification:

    • F47 - International Economics - - Macroeconomic Aspects of International Trade and Finance - - - Forecasting and Simulation: Models and Applications
    • M15 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Administration - - - IT Management
    • M49 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Accounting - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ora:journl:v:1:y:2017:i:1:p:453-461. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catalin ZMOLE (email available below). General contact details of provider: https://edirc.repec.org/data/feoraro.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.