Scalable subsampling: computation, aggregation and inference
Author
Abstract
Suggested Citation
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Tao Zou & Xian Li & Xuan Liang & Hansheng Wang, 2021. "On the Subbagging Estimation for Massive Data," Papers 2103.00631, arXiv.org.
- Srijan Sengupta & Stanislav Volgushev & Xiaofeng Shao, 2016. "A Subsampled Double Bootstrap for Massive Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1222-1232, July.
- Ariel Kleiner & Ameet Talwalkar & Purnamrita Sarkar & Michael I. Jordan, 2014. "A scalable bootstrap for massive data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 795-816, September.
- Patrice Bertail & Emilie Chautru & Stephan Clémençon, 2017. "Empirical Processes in Survey Sampling with (Conditional) Poisson Designs," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 97-111, March.
- Lin, N. & Xi, R., 2010. "Fast surrogates of U-statistics," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 16-24, January.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Guangbao Guo & Yue Sun & Xuejun Jiang, 2020. "A partitioned quasi-likelihood for distributed statistical inference," Computational Statistics, Springer, vol. 35(4), pages 1577-1596, December.
- Xuejun Ma & Shaochen Wang & Wang Zhou, 2022. "Statistical inference in massive datasets by empirical likelihood," Computational Statistics, Springer, vol. 37(3), pages 1143-1164, July.
- Ma, Xuejun & Wang, Shaochen & Zhou, Wang, 2021. "Testing multivariate quantile by empirical likelihood," Journal of Multivariate Analysis, Elsevier, vol. 182(C).
- Xingcai Zhou & Zhaoyang Jing & Chao Huang, 2024. "Distributed Bootstrap Simultaneous Inference for High-Dimensional Quantile Regression," Mathematics, MDPI, vol. 12(5), pages 1-54, February.
- Dean Eckles & Maurits Kaptein, 2019. "Bootstrap Thompson Sampling and Sequential Decision Problems in the Behavioral Sciences," SAGE Open, , vol. 9(2), pages 21582440198, June.
- Badruddoza, Syed & Amin, Modhurima & McCluskey, Jill, 2019. "Assessing the Importance of an Attribute in a Demand SystemStructural Model versus Machine Learning," Working Papers 2019-5, School of Economic Sciences, Washington State University.
- Olhede, Sofia C. & Wolfe, Patrick J., 2018. "The future of statistics and data science," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 46-50.
- Sara Franceschi & Rosa Maria Di Biase & Agnese Marcelli & Lorenzo Fattorini, 2022. "Some Empirical Results on Nearest-Neighbour Pseudo-populations for Resampling from Spatial Populations," Stats, MDPI, vol. 5(2), pages 1-16, April.
- Shi, Chengchun & Lu, Wenbin & Song, Rui, 2018. "A massive data framework for M-estimators with cubic-rate," LSE Research Online Documents on Economics 102111, London School of Economics and Political Science, LSE Library.
- Gérard Biau & Erwan Scornet, 2016. "A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 197-227, June.
- Batuhan Özkan & Coşkun Parim & Erhan Çene, 2023. "Predicting Countries’ Development Levels Using the Decision Tree and Random Forest Methods," EKOIST Journal of Econometrics and Statistics, Istanbul University, Faculty of Economics, vol. 0(38), pages 87-104, June.
- Mercè Crosas & Gary King & James Honaker & Latanya Sweeney, 2015. "Automating Open Science for Big Data," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 260-273, May.
- Pier Luigi Conti & Fulvia Mecatti, 2022. "Resampling under Complex Sampling Designs: Roots, Development and the Way Forward," Stats, MDPI, vol. 5(1), pages 1-12, March.
- Amalan Mahendran & Helen Thompson & James M. McGree, 2023. "A model robust subsampling approach for Generalised Linear Models in big data settings," Statistical Papers, Springer, vol. 64(4), pages 1137-1157, August.
- Villoria, Nelson B. & Liu, Jing, 2018. "Using spatially explicit data to improve our understanding of land supply responses: An application to the cropland effects of global sustainable irrigation in the Americas," Land Use Policy, Elsevier, vol. 75(C), pages 411-419.
- Vaughan, Gregory, 2020. "Efficient big data model selection with applications to fraud detection," International Journal of Forecasting, Elsevier, vol. 36(3), pages 1116-1127.
- Wang, Xiaoqian & Kang, Yanfei & Hyndman, Rob J. & Li, Feng, 2023.
"Distributed ARIMA models for ultra-long time series,"
International Journal of Forecasting, Elsevier, vol. 39(3), pages 1163-1184.
- Xiaoqian Wang & Yanfei Kang & Rob J Hyndman & Feng Li, 2020. "Distributed ARIMA Models for Ultra-long Time Series," Monash Econometrics and Business Statistics Working Papers 29/20, Monash University, Department of Econometrics and Business Statistics.
- Zhang, Likun & Castillo, Enrique del & Berglund, Andrew J. & Tingley, Martin P. & Govind, Nirmal, 2020. "Computing confidence intervals from massive data via penalized quantile smoothing splines," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
- Milica Maricic & Jose A. Egea & Veljko Jeremic, 2019. "A Hybrid Enhanced Scatter Search—Composite I-Distance Indicator (eSS-CIDI) Optimization Approach for Determining Weights Within Composite Indicators," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 144(2), pages 497-537, July.
- Yang, Xinfeng & Yan, Xiaodong & Huang, Jian, 2019. "High-dimensional integrative analysis with homogeneity and sparsity recovery," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
More about this item
Keywords
Bagging; Big data; Bootstrap; Distributed inference; Subagging;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:biomet:v:111:y:2024:i:1:p:347-354.. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/biomet .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.