Author
Abstract
This study proposes and explores a natural language processing‐ (NLP) based strategy to address out‐of‐dictionary and vocabulary mismatch problems in query translation based English–Chinese Cross‐Language Information Retrieval (EC‐CLIR). The strategy, named the LKB approach, is to construct a lexical knowledge base (LKB) and to use it for query translation. In this article, the author describes the LKB construction process, which customizes available translation resources based on the document collection of the EC‐CLIR system. The evaluation shows that the LKB approach is very promising. It consistently increased the percentage of correct translations and decreased the percentage of missing translations in addition to effectively detecting the vocabulary gap between the document collection and the translation resource of the system. The comparative analysis of the top EC‐CLIR results using the LKB and two other translation resources demonstrates that the LKB approach has produced significant improvement in EC‐CLIR performance compared to performance using the original translation resource without customization. It has also achieved the same level of performance as a sophisticated machine translation system. The study concludes that the LKB approach has the potential to be an empirical model for developing real‐world CLIR systems. Linguistic knowledge and NLP techniques, if appropriately used, can improve the effectiveness of English–Chinese cross‐language information retrieval.
Suggested Citation
Jiangping Chen, 2006.
"A lexical knowledge base approach for English–Chinese cross‐language information retrieval,"
Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(2), pages 233-243, January.
Handle:
RePEc:bla:jamist:v:57:y:2006:i:2:p:233-243
DOI: 10.1002/asi.20273
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jamist:v:57:y:2006:i:2:p:233-243. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.