IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i18p2274-d636683.html
   My bibliography  Save this article

3D-DCDAE: Unsupervised Music Latent Representations Learning Method Based on a Deep 3D Convolutional Denoising Autoencoder for Music Genre Classification

Author

Listed:
  • Lvyang Qiu

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

  • Shuyu Li

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

  • Yunsick Sung

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

Abstract

With unlabeled music data widely available, it is necessary to build an unsupervised latent music representation extractor to improve the performance of classification models. This paper proposes an unsupervised latent music representation learning method based on a deep 3D convolutional denoising autoencoder (3D-DCDAE) for music genre classification, which aims to learn common representations from a large amount of unlabeled data to improve the performance of music genre classification. Specifically, unlabeled MIDI files are applied to 3D-DCDAE to extract latent representations by denoising and reconstructing input data. Next, a decoder is utilized to assist the 3D-DCDAE in training. After 3D-DCDAE training, the decoder is replaced by a multilayer perceptron (MLP) classifier for music genre classification. Through the unsupervised latent representations learning method, unlabeled data can be applied to classification tasks so that the problem of limiting classification performance due to insufficient labeled data can be solved. In addition, the unsupervised 3D-DCDAE can consider the musicological structure to expand the understanding of the music field and improve performance in music genre classification. In the experiments, which utilized the Lakh MIDI dataset, a large amount of unlabeled data was utilized to train the 3D-DCDAE, obtaining a denoising and reconstruction accuracy of approximately 98%. A small amount of labeled data was utilized for training a classification model consisting of the trained 3D-DCDAE and the MLP classifier, which achieved a classification accuracy of approximately 88%. The experimental results show that the model achieves state-of-the-art performance and significantly outperforms other methods for music genre classification with only a small amount of labeled data.

Suggested Citation

  • Lvyang Qiu & Shuyu Li & Yunsick Sung, 2021. "3D-DCDAE: Unsupervised Music Latent Representations Learning Method Based on a Deep 3D Convolutional Denoising Autoencoder for Music Genre Classification," Mathematics, MDPI, vol. 9(18), pages 1-17, September.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:18:p:2274-:d:636683
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/18/2274/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/18/2274/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Pieter-Tjerk de Boer & Dirk Kroese & Shie Mannor & Reuven Rubinstein, 2005. "A Tutorial on the Cross-Entropy Method," Annals of Operations Research, Springer, vol. 134(1), pages 19-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhe Jiang & Shuyu Li & Yunsick Sung, 2022. "Enhanced Evaluation Method of Musical Instrument Digital Interface Data based on Random Masking and Seq2Seq Model," Mathematics, MDPI, vol. 10(15), pages 1-17, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xi Chen & Enlu Zhou, 2015. "Population model-based optimization," Journal of Global Optimization, Springer, vol. 63(1), pages 125-148, September.
    2. Akimoto, Youhei & Auger, Anne & Hansen, Nikolaus, 2022. "An ODE method to prove the geometric convergence of adaptive stochastic algorithms," Stochastic Processes and their Applications, Elsevier, vol. 145(C), pages 269-307.
    3. Anastasia Spiliopoulou & Ioannis Papamichail & Markos Papageorgiou & Yannis Tyrinopoulos & John Chrysoulakis, 2017. "Macroscopic traffic flow model calibration using different optimization algorithms," Operational Research, Springer, vol. 17(1), pages 145-164, April.
    4. Deng, Xiangtian & Zhang, Yi & Jiang, Yi & Zhang, Yi & Qi, He, 2024. "A novel operation method for renewable building by combining distributed DC energy system and deep reinforcement learning," Applied Energy, Elsevier, vol. 353(PB).
    5. Zhang, Yali & Shang, Pengjian, 2019. "Multivariate multiscale distribution entropy of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 515(C), pages 72-80.
    6. A. Gouda & T. Szántai, 2008. "Rare event probabilities in stochastic networks," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 16(4), pages 441-461, December.
    7. Liang Huang & Juanjuan Zhu & Mulan Qiu & Xiaoxiang Li & Shasha Zhu, 2022. "CA-BASNet: A Building Extraction Network in High Spatial Resolution Remote Sensing Images," Sustainability, MDPI, vol. 14(18), pages 1-15, September.
    8. Reuven Y. Rubinstein, 2006. "How Many Needles are in a Haystack, or How to Solve #P-Complete Counting Problems Fast," Methodology and Computing in Applied Probability, Springer, vol. 8(1), pages 5-51, March.
    9. Ad Ridder & Bruno Tuffin, 2012. "Probabilistic Bounded Relative Error Property for Learning Rare Event Simulation Techniques," Tinbergen Institute Discussion Papers 12-103/III, Tinbergen Institute.
    10. Wu, Xin & Nie, Lei & Xu, Meng, 2017. "Robust fuzzy quality function deployment based on the mean-end-chain concept: Service station evaluation problem for rail catering services," European Journal of Operational Research, Elsevier, vol. 263(3), pages 974-995.
    11. Thirunavukkarasu, M. & Sawle, Yashwant & Lala, Himadri, 2023. "A comprehensive review on optimization of hybrid renewable energy systems using various optimization techniques," Renewable and Sustainable Energy Reviews, Elsevier, vol. 176(C).
    12. Bohteh Loh, Boris-Edmond & Nfah, Eustace Mbaka, 2024. "Techno – economic and environmental design of a three – phase hybrid renewable energy system for UNVDA Ndop Cameroon using meta-heuristic and analytical approaches," Renewable Energy, Elsevier, vol. 237(PA).
    13. El Masri, Maxime & Morio, Jérôme & Simatos, Florian, 2021. "Improvement of the cross-entropy method in high dimension for failure probability estimation through a one-dimensional projection without gradient estimation," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    14. Enlu Zhou & Shalabh Bhatnagar, 2018. "Gradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous Space," INFORMS Journal on Computing, INFORMS, vol. 30(1), pages 154-167, February.
    15. Tito Homem-de-Mello, 2007. "A Study on the Cross-Entropy Method for Rare-Event Probability Estimation," INFORMS Journal on Computing, INFORMS, vol. 19(3), pages 381-394, August.
    16. Jiang, Yubo & Zhu, Yunfang & Du, Xin & Jin, Tao, 2019. "The implicit network inferred from users’ residences and workplaces enhancing collaborative recommendation on smartphones," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 535(C).
    17. Bismut, Elizabeth & Straub, Daniel, 2021. "Optimal adaptive inspection and maintenance planning for deteriorating structural systems," Reliability Engineering and System Safety, Elsevier, vol. 215(C).
    18. Marco Caserta & Stefan Voß, 2016. "A corridor method based hybrid algorithm for redundancy allocation," Journal of Heuristics, Springer, vol. 22(4), pages 405-429, August.
    19. Ali Kadhem, Athraa & Abdul Wahab, Noor Izzri & Aris, Ishak & Jasni, Jasronita & Abdalla, Ahmed N., 2017. "Computational techniques for assessing the reliability and sustainability of electrical power systems: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 80(C), pages 1175-1186.
    20. Zheng Peng & Donghua Wu & Quan Zheng, 2013. "A Level-Value Estimation Method and Stochastic Implementation for Global Optimization," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 493-523, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:18:p:2274-:d:636683. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.