IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i19p3526-d927076.html
   My bibliography  Save this article

Surfing the Modeling of pos Taggers in Low-Resource Scenarios

Author

Listed:
  • Manuel Vilares Ferro

    (Department of Computer Science, University of Vigo, Edificio Politécnico, As Lagoas s/n, 32004 Ourense, Spain
    These authors contributed equally to this work.)

  • Víctor M. Darriba Bilbao

    (Department of Computer Science, University of Vigo, Edificio Politécnico, As Lagoas s/n, 32004 Ourense, Spain
    These authors contributed equally to this work.)

  • Francisco J. Ribadas Pena

    (Department of Computer Science, University of Vigo, Edificio Politécnico, As Lagoas s/n, 32004 Ourense, Spain
    These authors contributed equally to this work.)

  • Jorge Graña Gil

    (Department of Computer Science, Faculty of Informatics, University of A Coruña, 15071 A Coruña, Spain
    These authors contributed equally to this work.)

Abstract

The recent trend toward the application of deep structured techniques has revealed the limits of huge models in natural language processing. This has reawakened the interest in traditional machine learning algorithms, which have proved still to be competitive in certain contexts, particularly in low-resource settings. In parallel, model selection has become an essential task to boost performance at reasonable cost, even more so when we talk about processes involving domains where the training and/or computational resources are scarce. Against this backdrop, we evaluate the early estimation of learning curves as a practical mechanism for selecting the most appropriate model in scenarios characterized by the use of non-deep learners in resource-lean settings. On the basis of a formal approximation model previously evaluated under conditions of wide availability of training and validation resources, we study the reliability of such an approach in a different and much more demanding operational environment. Using as a case study the generation of pos taggers for Galician, a language belonging to the Western Ibero-Romance group, the experimental results are consistent with our expectations.

Suggested Citation

  • Manuel Vilares Ferro & Víctor M. Darriba Bilbao & Francisco J. Ribadas Pena & Jorge Graña Gil, 2022. "Surfing the Modeling of pos Taggers in Low-Resource Scenarios," Mathematics, MDPI, vol. 10(19), pages 1-17, September.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3526-:d:927076
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/19/3526/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/19/3526/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3526-:d:927076. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.