Subsampling and Aggregation: A Solution to the Scalability Problem in Distance-Based Prediction for Mixed-Type Data
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Grané, Aurea & Salini, Silvia & Verdolini, Elena, 2021. "Robust multivariate analysis for mixed-type data: Novel algorithm and its practical application in socio-economic research," Socio-Economic Planning Sciences, Elsevier, vol. 73(C).
- Eva Boj & Adrià Caballé & Pedro Delicado & Anna Esteve & Josep Fortiana, 2016. "Global and local distance-based generalized linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 170-195, March.
- A. R. de Leon & A. Soo & T. Williamson, 2011. "Classification with discrete and continuous variables via general mixed-data models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(5), pages 1021-1032, February.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Alban Mbina Mbina & Guy Martial Nkiet & Fulgence Eyi Obiang, 2019. "Variable selection in discriminant analysis for mixed continuous-binary variables and several groups," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 773-795, September.
- Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2017. "General location model with factor analyzer covariance matrix structure and its applications," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 593-609, September.
- Miguel Angel Ortíz-Barrios & Matias Garcia-Constantino & Chris Nugent & Isaac Alfaro-Sarmiento, 2022. "A Novel Integration of IF-DEMATEL and TOPSIS for the Classifier Selection Problem in Assistive Technology Adoption for People with Dementia," IJERPH, MDPI, vol. 19(3), pages 1-31, January.
- Peiró-Signes, Ángel & Segarra-Oña, Marival & Trull-Domínguez, Óscar & Sánchez-Planelles, Joaquín, 2022. "Exposing the ideal combination of endogenous–exogenous drivers for companies’ ecoinnovative orientation: Results from machine-learning methods," Socio-Economic Planning Sciences, Elsevier, vol. 79(C).
- Beibei Yuan & Willem Heiser & Mark Rooij, 2019. "The δ-Machine: Classification Based on Distances Towards Prototypes," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 442-470, October.
- S. Barahona & P. Centella & X. Gual-Arnau & M. V. Ibáñez & A. Simó, 2020. "Supervised classification of geometrical objects by integrating currents and functional data analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 637-660, September.
- Galati, Antonino & Coticchio, Alessandro & Peiró-Signes, Ángel, 2023. "Identifying the factors affecting citizens' willingness to participate in urban forest governance: Evidence from the municipality of Palermo, Italy," Forest Policy and Economics, Elsevier, vol. 155(C).
- Aurea Grané & Alpha A. Sow-Barry, 2021. "Visualizing Profiles of Large Datasets of Weighted and Mixed Data," Mathematics, MDPI, vol. 9(8), pages 1-20, April.
- Bhat, Chandra R., 2015. "A new generalized heterogeneous data model (GHDM) to jointly model mixed types of dependent variables," Transportation Research Part B: Methodological, Elsevier, vol. 79(C), pages 50-77.
More about this item
Keywords
big data; classification; dissimilarities; ensemble; generalized linear model; Gower’s metric; machine learning;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:18:p:2247-:d:634318. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.