IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v42y2015i5p1133-1147.html
   My bibliography  Save this article

Multi-relational learning via hierarchical nonparametric Bayesian collective matrix factorization

Author

Listed:
  • Hongxia Yang
  • Aurelie Lozano

Abstract

Relational learning addresses problems where the data come from multiple sources and are linked together through complex relational networks. Two important goals are pattern discovery (e.g. by (co)-clustering) and predicting unknown values of a relation, given a set of entities and observed relations among entities. In the presence of multiple relations, combining information from different but related relations can lead to better insights and improved prediction. For this purpose, we propose a nonparametric hierarchical Bayesian model that improves on existing collaborative factorization models and frames a large number of relational learning problems. The proposed model naturally incorporates (co)-clustering and prediction analysis in a single unified framework, and allows for the estimation of entire missing row or column vectors. We develop an efficient Gibbs algorithm and a hybrid Gibbs using Newton's method to enable fast computation in high dimensions. We demonstrate the value of our framework on simulated experiments and on two real-world problems: discovering kinship systems and predicting the authors of certain articles based on article-word co-occurrence features.

Suggested Citation

  • Hongxia Yang & Aurelie Lozano, 2015. "Multi-relational learning via hierarchical nonparametric Bayesian collective matrix factorization," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(5), pages 1133-1147, May.
  • Handle: RePEc:taf:japsta:v:42:y:2015:i:5:p:1133-1147
    DOI: 10.1080/02664763.2014.999028
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/02664763.2014.999028
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664763.2014.999028?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Teh, Yee Whye & Jordan, Michael I. & Beal, Matthew J. & Blei, David M., 2006. "Hierarchical Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1566-1581, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michelle Dietzen & Haoran Zhai & Olivia Lucas & Oriol Pich & Christopher Barrington & Wei-Ting Lu & Sophia Ward & Yanping Guo & Robert E. Hynds & Simone Zaccaria & Charles Swanton & Nicholas McGranaha, 2024. "Replication timing alterations are associated with mutation acquisition during breast and lung cancer evolution," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    2. Redivo, Edoardo & Nguyen, Hien D. & Gupta, Mayetri, 2020. "Bayesian clustering of skewed and multimodal data using geometric skewed normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    3. Parvin Ahmadi & Iman Gholampour & Mahmoud Tabandeh, 2018. "Cluster-based sparse topical coding for topic mining and document clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 537-558, September.
    4. Jeffrey L. Furman & Florenta Teodoridis, 2020. "Automation, Research Technology, and Researchers’ Trajectories: Evidence from Computer Science and Electrical Engineering," Organization Science, INFORMS, vol. 31(2), pages 330-354, March.
    5. Shu-Ping Shi & Yong Song, 2012. "Identifying Speculative Bubbles with an Infinite Hidden Markov Model," Working Paper series 26_12, Rimini Centre for Economic Analysis.
    6. Jin, Xin & Maheu, John M. & Yang, Qiao, 2022. "Infinite Markov pooling of predictive distributions," Journal of Econometrics, Elsevier, vol. 228(2), pages 302-321.
    7. Gustaf Bellstam & Sanjai Bhagat & J. Anthony Cookson, 2021. "A Text-Based Analysis of Corporate Innovation," Management Science, INFORMS, vol. 67(7), pages 4004-4031, July.
    8. Hassan Akell & Farkhondeh-Alsadat Sajadi & Iraj Kazemi, 2023. "Construction of Jointly Distributed Random Samples Drawn from the Beta Two-Parameter Process," Methodology and Computing in Applied Probability, Springer, vol. 25(3), pages 1-12, September.
    9. Bauwens, Luc & Dufays, Arnaud & Rombouts, Jeroen V.K., 2014. "Marginal likelihood for Markov-switching and change-point GARCH models," Journal of Econometrics, Elsevier, vol. 178(P3), pages 508-522.
    10. Robert M. Dorazio & Bhramar Mukherjee & Li Zhang & Malay Ghosh & Howard L. Jelks & Frank Jordan, 2008. "Modeling Unobserved Sources of Heterogeneity in Animal Abundance Using a Dirichlet Process Prior," Biometrics, The International Biometric Society, vol. 64(2), pages 635-644, June.
    11. Jeong Hwan Kook & Michele Guindani & Linlin Zhang & Marina Vannucci, 2019. "NPBayes-fMRI: Non-parametric Bayesian General Linear Models for Single- and Multi-Subject fMRI Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(1), pages 3-21, April.
    12. Yong Song, 2014. "Modelling Regime Switching And Structural Breaks With An Infinite Hidden Markov Model," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(5), pages 825-842, August.
    13. Lu Shaochuan, 2023. "Scalable Bayesian Multiple Changepoint Detection via Auxiliary Uniformisation," International Statistical Review, International Statistical Institute, vol. 91(1), pages 88-113, April.
    14. Ying Liao & Yisha Xiang & Min Wang, 2021. "Health assessment and prognostics based on higher‐order hidden semi‐Markov models," Naval Research Logistics (NRL), John Wiley & Sons, vol. 68(2), pages 259-276, March.
    15. Markus Jochmann, 2015. "Modeling U.S. Inflation Dynamics: A Bayesian Nonparametric Approach," Econometric Reviews, Taylor & Francis Journals, vol. 34(5), pages 537-558, May.
    16. Evelina Gabasova & John Reid & Lorenz Wernisch, 2017. "Clusternomics: Integrative context-dependent clustering for heterogeneous datasets," PLOS Computational Biology, Public Library of Science, vol. 13(10), pages 1-29, October.
    17. Yoshi Fujiwara & Rubaiyat Islam, 2021. "Bitcoin's Crypto Flow Network," Papers 2106.11446, arXiv.org, revised Jul 2021.
    18. Occhini, Giulia & Tranos, Emmanouil & Wolf, Levi John, 2023. "Occupational segregation in the digital economy? A Natural Language Processing approach using UK Web Data," SocArXiv z8xta, Center for Open Science.
    19. Maheu, John M. & Yang, Qiao, 2016. "An infinite hidden Markov model for short-term interest rates," Journal of Empirical Finance, Elsevier, vol. 38(PA), pages 202-220.
    20. Jääskinen Väinö & Parkkinen Ville & Cheng Lu & Corander Jukka, 2014. "Bayesian clustering of DNA sequences using Markov chains and a stochastic partition model," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(1), pages 105-121, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:42:y:2015:i:5:p:1133-1147. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.