IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v202y2024ics0047259x24000095.html
   My bibliography  Save this article

Nonlinear sufficient dimension reduction for distribution-on-distribution regression

Author

Listed:
  • Zhang, Qi
  • Li, Bing
  • Xue, Lingzhou

Abstract

We introduce a new approach to nonlinear sufficient dimension reduction in cases where both the predictor and the response are distributional data, modeled as members of a metric space. Our key step is to build universal kernels (cc-universal) on the metric spaces, which results in reproducing kernel Hilbert spaces for the predictor and response that are rich enough to characterize the conditional independence that determines sufficient dimension reduction. For univariate distributions, we construct the universal kernel using the Wasserstein distance, while for multivariate distributions, we resort to the sliced Wasserstein distance. The sliced Wasserstein distance ensures that the metric space possesses similar topological properties to the Wasserstein space, while also offering significant computation benefits. Numerical results based on synthetic data show that our method outperforms possible competing methods. The method is also applied to several data sets, including fertility and mortality data and Calgary temperature data.

Suggested Citation

  • Zhang, Qi & Li, Bing & Xue, Lingzhou, 2024. "Nonlinear sufficient dimension reduction for distribution-on-distribution regression," Journal of Multivariate Analysis, Elsevier, vol. 202(C).
  • Handle: RePEc:eee:jmvana:v:202:y:2024:i:c:s0047259x24000095
    DOI: 10.1016/j.jmva.2024.105302
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X24000095
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2024.105302?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ding, Shanshan & Cook, R. Dennis, 2015. "Tensor sliced inverse regression," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 216-231.
    2. Wei Luo & Lingzhou Xue & Jiawei Yao & Xiufan Yu, 2022. "Inverse moment methods for sufficient forecasting using high-dimensional predictors [Eigenvalue ratio test for the number of factors]," Biometrika, Biometrika Trust, vol. 109(2), pages 473-487.
    3. Chao Ying & Zhou Yu, 2022. "Fréchet sufficient dimension reduction for random objects [Some asymptotic theory for the bootstrap]," Biometrika, Biometrika Trust, vol. 109(4), pages 975-992.
    4. Dong, Yushen & Wu, Yichao, 2022. "Fréchet kernel sliced inverse regression," Journal of Multivariate Analysis, Elsevier, vol. 191(C).
    5. Fan, Jianqing & Xue, Lingzhou & Yao, Jiawei, 2017. "Sufficient forecasting using factor models," Journal of Econometrics, Elsevier, vol. 201(2), pages 292-306.
    6. Yaqing Chen & Zhenhua Lin & Hans-Georg Müller, 2023. "Wasserstein Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(542), pages 869-882, April.
    7. Wei Luo & Bing Li, 2016. "Combining eigenvalues and variation of eigenvectors for order determination," Biometrika, Biometrika Trust, vol. 103(4), pages 875-887.
    8. Elsa Cazelles & Vivien Seguy & Jérémie Bigot & Marco Cuturi & Nicolas Papadakis, 2017. "Log-PCA versus Geodesic PCA of histograms in the Wasserstein space," Working Papers 2017-85, Center for Research in Economics and Statistics.
    9. Xiufan Yu & Jiawei Yao & Lingzhou Xue, 2022. "Nonparametric Estimation and Conformal Inference of the Sufficient Forecasting With a Diverging Number of Factors," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 342-354, January.
    10. Laya Ghodrati & Victor M Panaretos, 2022. "Distribution-on-distribution regression via optimal transport maps [Upper and lower risk bounds for estimating the Wasserstein barycenter of random measures on the real line]," Biometrika, Biometrika Trust, vol. 109(4), pages 957-974.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yu, Xiufan & Yao, Jiawei & Xue, Lingzhou, 2024. "Power enhancement for testing multi-factor asset pricing models via Fisher’s method," Journal of Econometrics, Elsevier, vol. 239(2).
    2. Florian Gunsilius & Meng Hsuan Hsieh & Myung Jin Lee, 2022. "Tangential Wasserstein Projections," Papers 2207.14727, arXiv.org, revised Aug 2022.
    3. Tadao Hoshino, 2024. "Functional Spatial Autoregressive Models," Papers 2402.14763, arXiv.org, revised Oct 2024.
    4. Shih‐Hao Huang & Kerby Shedden & Hsin‐wen Chang, 2023. "Inference for the dimension of a regression relationship using pseudo‐covariates," Biometrics, The International Biometric Society, vol. 79(3), pages 2394-2403, September.
    5. Huang, Dashan & Jiang, Fuwei & Li, Kunpeng & Tong, Guoshi & Zhou, Guofu, 2023. "Are bond returns predictable with real-time macro data?," Journal of Econometrics, Elsevier, vol. 237(2).
    6. Tae-Hwy Lee & Ekaterina Seregina, 2020. "Learning from Forecast Errors: A New Approach to Forecast Combination," Working Papers 202024, University of California at Riverside, Department of Economics.
    7. Boudt, Kris & Cornilly, Dries & Verdonck, Tim, 2020. "Nearest comoment estimation with unobserved factors," Journal of Econometrics, Elsevier, vol. 217(2), pages 381-397.
    8. Wang, Fa, 2022. "Maximum likelihood estimation and inference for high dimensional generalized factor models with application to factor-augmented regressions," Journal of Econometrics, Elsevier, vol. 229(1), pages 180-200.
    9. Wang, Pei & Yin, Xiangrong & Yuan, Qingcong & Kryscio, Richard, 2021. "Feature filter for estimating central mean subspace and its sparse solution," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).
    10. Barigozzi, Matteo & Hallin, Marc & Luciani, Matteo & Zaffaroni, Paolo, 2024. "Inferential theory for generalized dynamic factor models," Journal of Econometrics, Elsevier, vol. 239(2).
    11. Hayley Randall & Andreas Artemiou & Xingye Qiao, 2021. "Sufficient dimension reduction based on distance‐weighted discrimination," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(4), pages 1186-1211, December.
    12. Wang, Qin & Xue, Yuan, 2021. "An ensemble of inverse moment estimators for sufficient dimension reduction," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    13. Seungchul Baek & Yen‐Yi Ho & Yanyuan Ma, 2020. "Using sufficient direction factor model to analyze latent activities associated with breast cancer survival," Biometrics, The International Biometric Society, vol. 76(4), pages 1340-1350, December.
    14. Nicolas Chatelais & Menzie Chinn & Arthur Stalla-Bourdillon, 2022. "Macroeconomic Forecasting Using Filtered Signals from a Stock Market Cross Section," Working papers 903, Banque de France.
    15. Fang, Fang & Yu, Zhou, 2020. "Model averaging assisted sufficient dimension reduction," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    16. Bernard, Gaspard & Verdebout, Thomas, 2024. "On testing the equality of latent roots of scatter matrices under ellipticity," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    17. Pircalabelu, Eugen & Artemiou, Andreas, 2021. "Graph informed sliced inverse regression," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    18. Zhang, Xin & Wang, Chong & Wu, Yichao, 2018. "Functional envelope for model-free sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 163(C), pages 37-50.
    19. Rajveer Jat & Daanish Padha, 2024. "Kernel Three Pass Regression Filter," Papers 2405.07292, arXiv.org, revised Jun 2024.
    20. Tan, Xilong & Tao, Yubo, 2023. "Trend-based forecast of cryptocurrency returns," Economic Modelling, Elsevier, vol. 124(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:202:y:2024:i:c:s0047259x24000095. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.