IDEAS home Printed from https://ideas.repec.org/a/igg/jthi00/v2y2006i1p39-50.html
   My bibliography  Save this article

Chinese POS Disambiguation and Unknown Word Guessing with Lexicalized HMMs

Author

Listed:
  • Guohong Fu

    (The University of Hong Kong, Hong Kong)

  • Kang-Kwong Luke

    (The University of Hong Kong, Hong Kong)

Abstract

This article presents a lexicalized HMM-based approach to Chinese part-of-speech (POS) disambiguation and unknown word guessing (UWG). In order to explore word-internal morphological features for Chinese POS tagging, four types of pattern tags are defined to indicate the way lexicon words are used in a segmented sentence. Such patterns are combined further with POS tags. Thus, Chinese POS disambiguation and UWG can be unified as a single task of assigning each known word to input a proper hybrid tag. Furthermore, a uniformly lexicalized HMM-based tagger also is developed to perform this task, which can incorporate both internal word-formation patterns and surrounding contextual information for Chinese POS tagging under the framework of HMMs. Experiments on the Peking University Corpus indicate that the tagging precision can be improved with efficiency by the proposed approach.

Suggested Citation

  • Guohong Fu & Kang-Kwong Luke, 2006. "Chinese POS Disambiguation and Unknown Word Guessing with Lexicalized HMMs," International Journal of Technology and Human Interaction (IJTHI), IGI Global, vol. 2(1), pages 39-50, January.
  • Handle: RePEc:igg:jthi00:v:2:y:2006:i:1:p:39-50
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/jthi.2006010103
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jthi00:v:2:y:2006:i:1:p:39-50. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.