IDEAS home Printed from https://ideas.repec.org/p/azt/cemmap/20-22.html
   My bibliography  Save this paper

Graphical model inference with external network data

Author

Listed:
  • Jack Jewson
  • Li Li
  • Laura Battaglia
  • Stephen Hansen
  • David Rossell
  • Piotr Zwiernik

Abstract

A frequent challenge when using graphical models in applications is that the sample size is limited relative to the number of parameters to be learned. Our motivation stems from applications where one has external data, in the form of networks between variables, that provides valuable information to help improve inference. Specifically, we depict the relation between COVID-19 cases and social and geographical network data, and between stock market returns and economic and policy networks extracted from text data. We propose a graphical LASSO framework where likelihood penalties are guided by the external network data. We also propose a spike-and-slab prior framework that depicts how partial correlations depend on the networks, which helps interpret the fitted graphical model and its relationship to the network. We develop computational schemes and software implementations in R and probabilistic programming languages. Our applications show how incorporating network data can significantly improve interpretation, statistical accuracy, and out-of-sample prediction, in some instances using significantly sparser graphical models than would have otherwise been estimated.

Suggested Citation

  • Jack Jewson & Li Li & Laura Battaglia & Stephen Hansen & David Rossell & Piotr Zwiernik, 2022. "Graphical model inference with external network data," CeMMAP working papers 20/22, Institute for Fiscal Studies.
  • Handle: RePEc:azt:cemmap:20/22
    DOI: 10.47004/wp.cem.2022.2022
    as

    Download full text from publisher

    File URL: https://www.cemmap.ac.uk/wp-content/uploads/2022/11/CWP2022-Graphical-model-inference-with-external-network-data.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.47004/wp.cem.2022.2022?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    2. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    3. Carpenter, Bob & Gelman, Andrew & Hoffman, Matthew D. & Lee, Daniel & Goodrich, Ben & Betancourt, Michael & Brubaker, Marcus & Guo, Jiqiang & Li, Peter & Riddell, Allen, 2017. "Stan: A Probabilistic Programming Language," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i01).
    4. Veronika Ročková & Edward I. George, 2014. "EMVS: The EM Approach to Bayesian Variable Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 828-846, June.
    5. Tarek A Hassan & Stephan Hollander & Laurence van Lent & Ahmed Tahoun, 2019. "Firm-Level Political Risk: Measurement and Effects," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(4), pages 2135-2202.
    6. Marc Senneret & Yannick Malevergne & Patrice Abry & Gerald Perrin & Laurent Jaffres, 2016. "Covariance Versus Precision Matrix Estimation for Efficient Asset Allocation," Post-Print halshs-03590388, HAL.
    7. Wang, Tao & Zhu, Lixing, 2011. "Consistent tuning parameter selection in high dimensional sparse linear regression," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1141-1151, August.
    8. Kathleen Weiss Hanley & Gerard Hoberg, 2019. "Dynamic Interpretation of Emerging Risks in the Financial Sector," The Review of Financial Studies, Society for Financial Studies, vol. 32(12), pages 4543-4603.
    9. Ming Yuan & Yi Lin, 2007. "Model selection and estimation in the Gaussian graphical model," Biometrika, Biometrika Trust, vol. 94(1), pages 19-35.
    10. Hansen, Stephen & Davis, Steven & Seminario-Amez, Cristhian, 2020. "Firm-level Risk Exposures and Stock Returns in the Wake of COVID-19," CEPR Discussion Papers 15314, C.E.P.R. Discussion Papers.
    11. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    12. Elton, Edwin J & Gruber, Martin J, 1973. "Estimating the Dependence Structure of Share Prices-Implications for Portfolio Selection," Journal of Finance, American Finance Association, vol. 28(5), pages 1203-1232, December.
    13. Yingying Fan & Cheng Yong Tang, 2013. "Tuning parameter selection in high dimensional penalized likelihood," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 531-552, June.
    14. Peter Muller & Giovanni Parmigiani & Christian Robert & Judith Rousseau, 2004. "Optimal Sample Size for Multiple Testing: The Case of Gene Expression Microarrays," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 990-1001, December.
    15. Scott R. Baker & Nicholas Bloom & Steven J. Davis & Kyle J. Kost, 2019. "Policy News and Stock Market Volatility," NBER Working Papers 25720, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yunxiao Chen & Xiaoou Li & Jingchen Liu & Zhiliang Ying, 2017. "Regularized Latent Class Analysis with Application in Cognitive Diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 660-692, September.
    2. Tae-Hwy Lee & Ekaterina Seregina, 2020. "Learning from Forecast Errors: A New Approach to Forecast Combination," Working Papers 202024, University of California at Riverside, Department of Economics.
    3. Katayama, Shota & Imori, Shinpei, 2014. "Lasso penalized model selection criteria for high-dimensional multivariate linear regression analysis," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 138-150.
    4. Faccini, Renato & Matin, Rastin & Skiadopoulos, George, 2023. "Dissecting climate risks: Are they reflected in stock prices?," Journal of Banking & Finance, Elsevier, vol. 155(C).
    5. Chen, Yunxiao & Li, Xiaoou & Liu, Jingchen & Ying, Zhiliang, 2017. "Regularized latent class analysis with application in cognitive diagnosis," LSE Research Online Documents on Economics 103182, London School of Economics and Political Science, LSE Library.
    6. Burman, Prabir & Paul, Debashis, 2017. "Smooth predictive model fitting in regression," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 165-179.
    7. Tengfei Zhang, 2020. "Manager Uncertainty and Cross-Sectional Stock Returns," 2020 Papers pzh934, Job Market Papers.
    8. Chris Florakis & Christodoulos Louca & Roni Michaely & Michael Weber, 2020. "Cybersecurity Risk," Working Papers 2020-178, Becker Friedman Institute for Research In Economics.
    9. Dim, Chukwuma & Koerner, Kevin & Wolski, Marcin & Zwart, Sanne, 2022. "Hot off the press: News-implied sovereign default risk," EIB Working Papers 2022/06, European Investment Bank (EIB).
    10. Emre Demirkaya & Yang Feng & Pallavi Basu & Jinchi Lv, 2022. "Large-scale model selection in misspecified generalized linear models [Information theory and an extension of the maximum likelihood principle]," Biometrika, Biometrika Trust, vol. 109(1), pages 123-136.
    11. Jiao, Anqi & Ma, Han & Ren, Honglin, 2022. "Political catastrophe and firm strategies: Evidence from the capitol riot," Finance Research Letters, Elsevier, vol. 48(C).
    12. Steven J. Davis, 2019. "Rising Policy Uncertainty," NBER Working Papers 26243, National Bureau of Economic Research, Inc.
    13. Francisco Peñaranda & Enrique Sentana, 2024. "Portfolio management with big data," Working Papers wp2024_2411, CEMFI.
    14. Ravi Jagannathan & Tongshu Ma, 2003. "Risk Reduction in Large Portfolios: Why Imposing the Wrong Constraints Helps," Journal of Finance, American Finance Association, vol. 58(4), pages 1651-1683, August.
    15. Bianconi, Marcelo & Esposito, Federico & Sammon, Marco, 2021. "Trade policy uncertainty and stock returns," Journal of International Money and Finance, Elsevier, vol. 119(C).
    16. Gonzalo García-Donato & María Eugenia Castellanos & Alicia Quirós, 2021. "Bayesian Variable Selection with Applications in Health Sciences," Mathematics, MDPI, vol. 9(3), pages 1-16, January.
    17. Zhang, Tonglin, 2024. "Variables selection using L0 penalty," Computational Statistics & Data Analysis, Elsevier, vol. 190(C).
    18. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    19. Jonathan Fletcher, 2009. "Risk Reduction and Mean‐Variance Analysis: An Empirical Investigation," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 36(7‐8), pages 951-971, September.
    20. Chen, J. & Li, D. & Li, Y. & Linton, O. B., 2022. "Estimating Time-Varying Networks for High-Dimensional Time Series," Janeway Institute Working Papers 2231, Faculty of Economics, University of Cambridge.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:azt:cemmap:20/22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dermot Watson (email available below). General contact details of provider: https://edirc.repec.org/data/ifsssuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.