IDEAS home Printed from https://ideas.repec.org/a/bla/stanee/v72y2018i3p339-353.html
   My bibliography  Save this article

Language comprehension as a multi‐label classification problem

Author

Listed:
  • Konstantin Sering
  • Petar Milin
  • R. Harald Baayen

Abstract

The initial stage of language comprehension is a multilabel classification problem. Listeners or readers, presented with an utterance, need to discriminate between the intended words and the tens of thousands of other words they know. We propose to address this problem by pairing two networks. The first network is independently learned with the Rescorla Wagner model. The second network is based on the first network and learned with the rule of Widrow and Hoff. The first network has to recover from sublexical input features the meanings encoded in the language signal, resulting in a vector of activations over the lexicon. The second network takes this vector as input and further reduces uncertainty about the intended message. Classification performance for a lexicon with 52,000 entries is good. The model also correctly predicts several aspects of human language comprehension. By rejecting the traditional linguistic assumption that language is a (de)compositional system, and by instead espousing a discriminative approach, a more parsimonious yet highly effective functional characterization of the initial stage of language comprehension is obtained.

Suggested Citation

  • Konstantin Sering & Petar Milin & R. Harald Baayen, 2018. "Language comprehension as a multi‐label classification problem," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 72(3), pages 339-353, August.
  • Handle: RePEc:bla:stanee:v:72:y:2018:i:3:p:339-353
    DOI: 10.1111/stan.12134
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/stan.12134
    Download Restriction: no

    File URL: https://libkey.io/10.1111/stan.12134?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Simon N. Wood & Zheyuan Li & Gavin Shaddick & Nicole H. Augustin, 2017. "Generalized Additive Models for Gigadata: Modeling the U.K. Black Smoke Network Daily Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1199-1210, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. R. Harald Baayen & Yu-Ying Chuang & Elnaz Shafaei-Bajestan & James P. Blevins, 2019. "The Discriminative Lexicon: A Unified Computational Model for the Lexicon and Lexical Processing in Comprehension and Production Grounded Not in (De)Composition but in Linear Discriminative Learning," Complexity, Hindawi, vol. 2019, pages 1-39, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jonathan Berrisch & Florian Ziel, 2023. "Multivariate Probabilistic CRPS Learning with an Application to Day-Ahead Electricity Prices," Papers 2303.10019, arXiv.org, revised Feb 2024.
    2. Du, Qianqian & Mieno, Taro & Bullock, David & Edge, Brittani, 2021. "Economically Optimal Nitrogen Side-dressing Based on Vegetation Indices from Satellite Images Through On-farm Experiments," Land, Farm & Agribusiness Management Department 316596, Harper Adams University, Land, Farm & Agribusiness Management Department.
    3. Oskar Allerbo & Rebecka Jörnsten, 2022. "Flexible, non-parametric modeling using regularized neural networks," Computational Statistics, Springer, vol. 37(4), pages 2029-2047, September.
    4. David L. Miller & Richard Glennie & Andrew E. Seaton, 2020. "Understanding the Stochastic Partial Differential Equation Approach to Smoothing," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(1), pages 1-16, March.
    5. Frank van Berkum & Katrien Antonio & Michel Vellekoop, 2021. "Quantifying longevity gaps using micro‐level lifetime data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 548-570, April.
    6. Anne-Sophie Krah & Zoran Nikolić & Ralf Korn, 2020. "Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies," Risks, MDPI, vol. 8(1), pages 1-79, February.
    7. Simon N. Wood, 2020. "Inference and computation with generalized additive models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 307-339, June.
    8. Anne-Sophie Krah & Zoran Nikoli'c & Ralf Korn, 2019. "Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies," Papers 1909.02182, arXiv.org.
    9. Anna Vážná & Jana Vignerová & Marek Brabec & Jan Novák & Bohuslav Procházka & Antonín Gabera & Petr Sedlak, 2022. "Influence of COVID-19-Related Restrictions on the Prevalence of Overweight and Obese Czech Children," IJERPH, MDPI, vol. 19(19), pages 1-14, September.
    10. Sonja Greven & Fabian Scheipl, 2020. "Comments on: Inference and computation with Generalized Additive Models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 343-350, June.
    11. Du, Qianqian & Mieno, Taro & Bullock, David & Edge, Brittani, 2021. "Economically Optimal Nitrogen Side-dressing Based on Vegetation Indices from Satellite Images Through On-farm Experiments," Agri-Tech Economics Papers 316596, Harper Adams University, Land, Farm & Agribusiness Management Department.
    12. Calabrese, Raffaella & Dombrowski, Timothy & Mandel, Antoine & Pace, R. Kelley & Zanin, Luca, 2024. "Impacts of extreme weather events on mortgage risks and their evolution under climate change: A case study on Florida," European Journal of Operational Research, Elsevier, vol. 314(1), pages 377-392.
    13. Aoife K. Hurley & James Sweeney, 2024. "Irish Property Price Estimation Using A Flexible Geo-spatial Smoothing Approach: What is the Impact of an Address?," The Journal of Real Estate Finance and Economics, Springer, vol. 68(3), pages 355-393, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:stanee:v:72:y:2018:i:3:p:339-353. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0039-0402 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.