IDEAS home Printed from https://ideas.repec.org/a/vrs/bjeust/v12y2022i2p64-86n7.html
   My bibliography  Save this article

Comprehensibility and Automation: Plain Language in the Era of Digitalization

Author

Listed:
  • Üveges István

    (University of Szeged, Doctoral School in Linguistics, Egyetem utca 2 Szeged 6722, Hungary; MONTANA Knowledge Management Ltd. Szállító u. 6 Budapest 1211, Hungary, uvegesi@montana.hu)

Abstract

The current article briefly presents a pilot machine-learning experiment on the classification of official texts addressed to lay readers with the use of support vector machine as a baseline and fastText models. For this purpose, a hand-crafted corpus was used, created by the experts of the National Tax and Customs Administration of Hungary under the office’s Public Accessibility Programme. The corpus contained sentences that were paraphrased or completely rewritten by the experts to make them more readable for lay people, as well their original counter pairs. The aim was to automatically distinguish between these two classes by using supervised machine-learning algorithms. If successful, such a machine-learning-based model could be used to draw the attention of experts involved in making the texts of official bodies more comprehensible to the average reader to the potentially problematic points of a text. Therefore, the process of rephrasing such texts could be sped up drastically. Such a rephrasing (considering, above all, the needs of the average reader) can improve the overall comprehensibility of official (mostly legal) texts, and therefore supports access to justice, the transparency of governmental organizations and, most importantly, improves the rule of law in a given country.

Suggested Citation

  • Üveges István, 2022. "Comprehensibility and Automation: Plain Language in the Era of Digitalization," TalTech Journal of European Studies, Sciendo, vol. 12(2), pages 64-86, December.
  • Handle: RePEc:vrs:bjeust:v:12:y:2022:i:2:p:64-86:n:7
    DOI: 10.2478/bjes-2022-0012
    as

    Download full text from publisher

    File URL: https://doi.org/10.2478/bjes-2022-0012
    Download Restriction: no

    File URL: https://libkey.io/10.2478/bjes-2022-0012?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:bjeust:v:12:y:2022:i:2:p:64-86:n:7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.