Scalable subsampling: computation, aggregation and inference
Author
Abstract
Suggested Citation
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Ariel Kleiner & Ameet Talwalkar & Purnamrita Sarkar & Michael I. Jordan, 2014. "A scalable bootstrap for massive data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 795-816, September.
- Patrice Bertail & Emilie Chautru & Stephan Clémençon, 2017. "Empirical Processes in Survey Sampling with (Conditional) Poisson Designs," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 97-111, March.
- Lin, N. & Xi, R., 2010. "Fast surrogates of U-statistics," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 16-24, January.
- Tao Zou & Xian Li & Xuan Liang & Hansheng Wang, 2021. "On the Subbagging Estimation for Massive Data," Papers 2103.00631, arXiv.org.
- Srijan Sengupta & Stanislav Volgushev & Xiaofeng Shao, 2016. "A Subsampled Double Bootstrap for Massive Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1222-1232, July.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Guangbao Guo & Yue Sun & Xuejun Jiang, 2020. "A partitioned quasi-likelihood for distributed statistical inference," Computational Statistics, Springer, vol. 35(4), pages 1577-1596, December.
- Baolin Chen & Shanshan Song & Yong Zhou, 2024. "Estimation and testing of expectile regression with efficient subsampling for massive data," Statistical Papers, Springer, vol. 65(9), pages 5593-5613, December.
- Ma, Xuejun & Wang, Shaochen & Zhou, Wang, 2021. "Testing multivariate quantile by empirical likelihood," Journal of Multivariate Analysis, Elsevier, vol. 182(C).
- Bingyao Huang & Yanyan Liu & Liuhua Peng, 2023. "Distributed inference for two‐sample U‐statistics in massive data analysis," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(3), pages 1090-1115, September.
- Xuejun Ma & Shaochen Wang & Wang Zhou, 2022. "Statistical inference in massive datasets by empirical likelihood," Computational Statistics, Springer, vol. 37(3), pages 1143-1164, July.
- Xingcai Zhou & Zhaoyang Jing & Chao Huang, 2024. "Distributed Bootstrap Simultaneous Inference for High-Dimensional Quantile Regression," Mathematics, MDPI, vol. 12(5), pages 1-53, February.
- Pier Luigi Conti & Fulvia Mecatti, 2022. "Resampling under Complex Sampling Designs: Roots, Development and the Way Forward," Stats, MDPI, vol. 5(1), pages 1-12, March.
- Amalan Mahendran & Helen Thompson & James M. McGree, 2023. "A model robust subsampling approach for Generalised Linear Models in big data settings," Statistical Papers, Springer, vol. 64(4), pages 1137-1157, August.
- Villoria, Nelson B. & Liu, Jing, 2018. "Using spatially explicit data to improve our understanding of land supply responses: An application to the cropland effects of global sustainable irrigation in the Americas," Land Use Policy, Elsevier, vol. 75(C), pages 411-419.
- Vaughan, Gregory, 2020. "Efficient big data model selection with applications to fraud detection," International Journal of Forecasting, Elsevier, vol. 36(3), pages 1116-1127.
- Wang, Xiaoqian & Kang, Yanfei & Hyndman, Rob J. & Li, Feng, 2023.
"Distributed ARIMA models for ultra-long time series,"
International Journal of Forecasting, Elsevier, vol. 39(3), pages 1163-1184.
- Xiaoqian Wang & Yanfei Kang & Rob J Hyndman & Feng Li, 2020. "Distributed ARIMA Models for Ultra-long Time Series," Monash Econometrics and Business Statistics Working Papers 29/20, Monash University, Department of Econometrics and Business Statistics.
- Yang, Xinfeng & Yan, Xiaodong & Huang, Jian, 2019. "High-dimensional integrative analysis with homogeneity and sparsity recovery," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
- Fang, Jianglin, 2023. "A split-and-conquer variable selection approach for high-dimensional general semiparametric models with massive data," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
- Tang, Lu & Zhou, Ling & Song, Peter X.-K., 2020. "Distributed simultaneous inference in generalized linear models via confidence distribution," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
- Yves G. Berger, 2023. "Unconditional empirical likelihood approach for analytic use of public survey data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 383-410, March.
- Beate Franke & Jean-FRANçois Plante & Ribana Roscher & En-shiun Annie Lee & Cathal Smyth & Armin Hatefi & Fuqi Chen & Einat Gil & Alexander Schwing & Alessandro Selvitella & Michael M. Hoffman & Roger, 2016. "Statistical Inference, Learning and Models in Big Data," International Statistical Review, International Statistical Institute, vol. 84(3), pages 371-389, December.
- Kaizhao Liu & Jose Blanchet & Lexing Ying & Yiping Lu, 2024. "Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty," Papers 2404.19145, arXiv.org, revised Apr 2024.
- Dean Eckles & Maurits Kaptein, 2019. "Bootstrap Thompson Sampling and Sequential Decision Problems in the Behavioral Sciences," SAGE Open, , vol. 9(2), pages 21582440198, June.
- Badruddoza, Syed & Amin, Modhurima & McCluskey, Jill, 2019. "Assessing the Importance of an Attribute in a Demand SystemStructural Model versus Machine Learning," Working Papers 2019-5, School of Economic Sciences, Washington State University.
- Olhede, Sofia C. & Wolfe, Patrick J., 2018. "The future of statistics and data science," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 46-50.
More about this item
Keywords
Bagging; Big data; Bootstrap; Distributed inference; Subagging;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:biomet:v:111:y:2024:i:1:p:347-354.. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/biomet .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.