IDEAS home Printed from https://ideas.repec.org/a/eur/ejlsjr/53.html
   My bibliography  Save this article

Equivalent Malay-Arabic Data Corpus Collection

Author

Listed:
  • Taj Rijal Muhamad Romli
  • Hasnah Mohamad

Abstract

This paper aims to introduce a search strategy and collecting comparable sentences of Arab-Malay corpus data. This method was introduced for the use of students, researchers and amateur translators to search and compare the structure of sentences in Arabic and Malay. The first stage is to collect data corpus with high impact titles from the press and must be able to enlarge the scope of study as stated by Maia (2003). The second stage is to search using the specified key words based on selected high-impact titles such as the Football World Cup year 2010 and 2014. Data search is by using Webcorp engine http://www.webcorp.org.uk/live/ corpus and also open database Google https://www.google.com. The third stage is to filter the data by using Aker et.al (2012) and Braschler's (1998) method based on similar story, related story and similar aspects. At the fourth stage every category is measured by Guidere's (2002) equivalence strength which is strong comparability (SC), medium (MC) and weak (WC). At the last stage comparable sentences between the two languages are compiled in parallel according to Mona Baker’s (1992) level of grouping which are sentence level, combination of words, grammatical, pragmatic and textual level. The result from data analysis based on Mona Baker and Vinay - Darbelnet’s (1995) comparable theory proved the existence of some sentences in large quantities are on the same level of comparability from the point of information delivery. This can be used as the basis of additional evidence concerning the validity of 'universal theory.' in the science of translation.

Suggested Citation

  • Taj Rijal Muhamad Romli & Hasnah Mohamad, 2021. "Equivalent Malay-Arabic Data Corpus Collection," European Journal of Language and Literature Studies Articles, Revistia Research and Publishing, vol. 2, January -.
  • Handle: RePEc:eur:ejlsjr:53
    DOI: 10.26417/ejls.v4i1.p65-73
    as

    Download full text from publisher

    File URL: https://revistia.com/index.php/ejls/article/view/793
    Download Restriction: no

    File URL: https://revistia.com/files/articles/ejls_v2_i1_16/Taj.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.26417/ejls.v4i1.p65-73?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    software; comparable; parallel;
    All these keywords.

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eur:ejlsjr:53. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Revistia Research and Publishing (email available below). General contact details of provider: https://revistia.com/index.php/ejls .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.