IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i2p292-d1026779.html
   My bibliography  Save this article

Improving Data Sparsity in Recommender Systems Using Matrix Regeneration with Item Features

Author

Listed:
  • Sang-Min Choi

    (Department of Computer Science, Gyeongsang National University, Jinju-si 52828, Republic of Korea)

  • Dongwoo Lee

    (Manager S/W Development Wellxecon Corp., Seoul 06168, Republic of Korea)

  • Kiyoung Jang

    (Department of Computer Science, Yonsei University, Seoul 03722, Republic of Korea)

  • Chihyun Park

    (Department of Computer Science and Engineering, Kangwon National University, Chuncheon 24341, Republic of Korea)

  • Suwon Lee

    (Department of Computer Science, Gyeongsang National University, Jinju-si 52828, Republic of Korea
    Department of AI Convergence Engineering, Gyeongsang National University, Jinju-si 52828, Republic of Korea
    The Research Institute of Natural Science, Gyeongsang National University, Jinju-si 52828, Republic of Korea)

Abstract

With the development of the Web, users spend more time accessing information that they seek. As a result, recommendation systems have emerged to provide users with preferred contents by filtering abundant information, along with providing means of exposing search results to users more effectively. These recommendation systems operate based on the user reactions to items or on the various user or item features. It is known that recommendation results based on sparse datasets are less reliable because recommender systems operate according to user responses. Thus, we propose a method to improve the dataset sparsity and increase the accuracy of the prediction results by using item features with user responses. A method based on the content-based filtering concept is proposed to extract category rates from the user–item matrix according to the user preferences and to organize these into vectors. Thereafter, we present a method to filter the user–item matrix using the extracted vectors and to regenerate the input matrix for collaborative filtering (CF). We compare the prediction results of our approach and conventional CF using the mean absolute error and root mean square error. Moreover, we calculate the sparsity of the regenerated matrix and the existing input matrix, and demonstrate that the regenerated matrix is more dense than the existing one. By computing the Jaccard similarity between the item sets in the regenerated and existing matrices, we verify the matrix distinctions. The results of the proposed methods confirm that if the regenerated matrix is used as the CF input, a denser matrix with higher predictive accuracy can be constructed than when using conventional methods. The validity of the proposed method was verified by analyzing the effect of the input matrix composed of high average ratings on the CF prediction performance. The low sparsity and high prediction accuracy of the proposed method are verified by comparisons with the results by conventional methods. Improvements of approximately 16% based on K-nearest neighbor and 15% based on singular value decomposition, and a three times improvement in the sparsity based on regenerated and original matrices are obtained. We propose a matrix reconstruction method that can improve the performance of recommendations.

Suggested Citation

  • Sang-Min Choi & Dongwoo Lee & Kiyoung Jang & Chihyun Park & Suwon Lee, 2023. "Improving Data Sparsity in Recommender Systems Using Matrix Regeneration with Item Features," Mathematics, MDPI, vol. 11(2), pages 1-26, January.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:2:p:292-:d:1026779
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/2/292/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/2/292/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhang, Jing & Peng, Qinke & Sun, Shiquan & Liu, Che, 2014. "Collaborative filtering recommendation algorithm based on user preference derived from item domain features," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 396(C), pages 66-76.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yong Eui Kim & Sang-Min Choi & Dongwoo Lee & Yeong Geon Seo & Suwon Lee, 2023. "A Reliable Prediction Algorithm Based on Genre2Vec for Item-Side Cold-Start Problems in Recommender Systems with Smart Contracts," Mathematics, MDPI, vol. 11(13), pages 1-25, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yin, Likang & Deng, Yong, 2018. "Measuring transferring similarity via local information," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 498(C), pages 102-115.
    2. Li, Man & Wen, Luosheng & Chen, Feiyu, 2021. "A novel Collaborative Filtering recommendation approach based on Soft Co-Clustering," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 561(C).
    3. Ramezani, Mohsen & Yaghmaee, Farzin, 2016. "A novel video recommendation system based on efficient retrieval of human actions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 457(C), pages 607-623.
    4. Jun Yao & Jianhui Chen, 2023. "A Study on the Characteristics of Middle-aged Chinese Female Users Based on Clothing Needs," Asian Social Science, Canadian Center of Science and Education, vol. 19(4), pages 1-86, August.
    5. Yong Eui Kim & Sang-Min Choi & Dongwoo Lee & Yeong Geon Seo & Suwon Lee, 2023. "A Reliable Prediction Algorithm Based on Genre2Vec for Item-Side Cold-Start Problems in Recommender Systems with Smart Contracts," Mathematics, MDPI, vol. 11(13), pages 1-25, July.
    6. Moradi, Parham & Ahmadian, Sajad & Akhlaghian, Fardin, 2015. "An effective trust-based recommendation method using a novel graph clustering algorithm," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 436(C), pages 462-481.
    7. Tae-Yeun Kim & Sung Bum Pan & Sung-Hwan Kim, 2020. "Sentiment Digitization Modeling for Recommendation System," Sustainability, MDPI, vol. 12(12), pages 1-27, June.
    8. Liang Xiao & Qibei Lu & Feipeng Guo, 2020. "Mobile Personalized Recommendation Model based on Privacy Concerns and Context Analysis for the Sustainable Development of M-commerce," Sustainability, MDPI, vol. 12(7), pages 1-20, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:2:p:292-:d:1026779. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.