IDEAS home Printed from https://ideas.repec.org/a/kap/jgeosy/v26y2024i1d10.1007_s10109-023-00433-w.html
   My bibliography  Save this article

CHTopoNER model-based method for recognizing Chinese place names from social media information

Author

Listed:
  • Mengwei Zhang

    (Information Engineering University)

  • Xingui Liu

    (Information Engineering University)

  • Zheng Zhang

    (Information Engineering University)

  • Yue Qiu

    (Information Engineering University)

  • Zhipeng Jiang

    (Information Engineering University)

  • Pengyu Zhang

    (University of Electronic Science and Technology of China)

Abstract

Chinese toponym recognition is crucial in named entity recognition and has significant implications for improving geographic information systems. Based on the real-time nature of social media and rich geographical data contained in social media, it is important to identify Chinese toponyms, including compound toponyms, informal toponyms, and other forms of social media content, for automatic geospatial information extraction. However, the strong word-building ability, diverse features, and ambiguity of Chinese toponyms combined with the linguistic irregularities of social media pose significant challenges for accurately locating toponym boundaries and resolving ambiguities. Furthermore, existing Chinese toponym recognition methods often ignore the fusion of local and global features during feature extraction, resulting in semantic information loss. Therefore, we used the Chinese-roberta-wwm-ext pre-trained language model to encode input text and obtain character-level information. An improved SoftLexicon-based statistical method was employed to acquire word-level semantic information, which was then integrated with character-level semantic information. A two-channel neural network layer comprising a bi-directional long short-term memory and an inception-dilated convolutional neural network was utilized to extract global and local features from text. Additionally, a conditional random field was applied to establish label constraints. The proposed deep neural network model, called CHTopoNER, is designed to identify various forms of Chinese toponyms in irregular Chinese social media content. Its effectiveness was validated on four publicly available annotated toponym datasets and a custom social media dataset. CHTopoNER surpasses state-of-the-art Chinese toponym recognition models and achieves promising results for extracting various types of toponyms and spatial location terms.

Suggested Citation

  • Mengwei Zhang & Xingui Liu & Zheng Zhang & Yue Qiu & Zhipeng Jiang & Pengyu Zhang, 2024. "CHTopoNER model-based method for recognizing Chinese place names from social media information," Journal of Geographical Systems, Springer, vol. 26(1), pages 149-179, January.
  • Handle: RePEc:kap:jgeosy:v:26:y:2024:i:1:d:10.1007_s10109-023-00433-w
    DOI: 10.1007/s10109-023-00433-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10109-023-00433-w
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10109-023-00433-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kai Ma & YongJian Tan & Zhong Xie & Qinjun Qiu & Siqiong Chen, 2022. "Chinese toponym recognition with variant neural structures from social media messages based on BERT methods," Journal of Geographical Systems, Springer, vol. 24(2), pages 143-169, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yijiang Zhao & Daoan Zhang & Lei Jiang & Qi Liu & Yizhi Liu & Zhuhua Liao, 2024. "EIBC: a deep learning framework for Chinese toponym recognition with multiple layers," Journal of Geographical Systems, Springer, vol. 26(3), pages 407-425, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jgeosy:v:26:y:2024:i:1:d:10.1007_s10109-023-00433-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.