Author
Listed:
- Wenting Yi
(Centre for Advances in Reliability and Safety (CAiRS), Hong Kong SAR 999077, China)
- Siqi Bu
(Centre for Advances in Reliability and Safety (CAiRS), Hong Kong SAR 999077, China
Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University, Hong Kong SAR 999077, China)
- Hiu-Hung Lee
(Centre for Advances in Reliability and Safety (CAiRS), Hong Kong SAR 999077, China)
- Chun-Hung Chan
(Centre for Advances in Reliability and Safety (CAiRS), Hong Kong SAR 999077, China)
Abstract
Manifold learning-based approaches have emerged as prominent techniques for dimensionality reduction. Among these methods, t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP) stand out as two of the most widely used and effective approaches. While both methods share similar underlying procedures, empirical observations indicate two distinctive properties: global data structure preservation and computational efficiency. However, the underlying mathematical principles behind these distinctions remain elusive. To address this gap, this study presents a comparative analysis of the subprocesses involved in these methods, aiming to elucidate the mathematical mechanisms underlying the observed distinctions. By meticulously examining the equation formulations, the mathematical mechanisms contributing to global data structure preservation and computational efficiency are elucidated. To validate the theoretical analysis, data are collected through a laboratory experiment, and an open-source dataset is utilized for validation across different datasets. The consistent alignment of results obtained from both balanced and unbalanced datasets robustly confirms the study’s findings. The insights gained from this study provide a deeper understanding of the mathematical underpinnings of t-SNE and UMAP, enabling more informed and effective use of these dimensionality reduction techniques in various applications, such as anomaly detection, natural language processing, and bioinformatics.
Suggested Citation
Wenting Yi & Siqi Bu & Hiu-Hung Lee & Chun-Hung Chan, 2024.
"Comparative Analysis of Manifold Learning-Based Dimension Reduction Methods: A Mathematical Perspective,"
Mathematics, MDPI, vol. 12(15), pages 1-21, July.
Handle:
RePEc:gam:jmathe:v:12:y:2024:i:15:p:2388-:d:1447130
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:15:p:2388-:d:1447130. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.