IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v72y2021i5p611-626.html
   My bibliography  Save this article

Enhancing keyphrase extraction from microblogs using human reading time

Author

Listed:
  • Yingyi Zhang
  • Chengzhi Zhang

Abstract

The premise of manual keyphrase annotation is to read the corresponding content of an annotated object. Intuitively, when we read, more important words will occupy a longer reading time. Hence, by leveraging human reading time, we can find the salient words in the corresponding content. However, previous studies on keyphrase extraction ignore human reading features. In this article, we aim to leverage human reading time to extract keyphrases from microblog posts. There are two main tasks in this study. One is to determine how to measure the time spent by a human on reading a word. We use eye fixation durations (FDs) extracted from an open source eye‐tracking corpus. Moreover, we propose strategies to make eye FD more effective on keyphrase extraction. The other task is to determine how to integrate human reading time into keyphrase extraction models. We propose two novel neural network models. The first is a model in which the human reading time is used as the ground truth of the attention mechanism. In the second model, we use human reading time as the external feature. Quantitative and qualitative experiments show that our proposed models yield better performance than the baseline models on two microblog datasets.

Suggested Citation

  • Yingyi Zhang & Chengzhi Zhang, 2021. "Enhancing keyphrase extraction from microblogs using human reading time," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(5), pages 611-626, May.
  • Handle: RePEc:bla:jinfst:v:72:y:2021:i:5:p:611-626
    DOI: 10.1002/asi.24430
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24430
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24430?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ebadi, Ashkan & Auger, Alain & Gauthier, Yvan, 2022. "Detecting emerging technologies and their evolution using deep learning and weak signal analysis," Journal of Informetrics, Elsevier, vol. 16(4).
    2. Chengzhi Zhang & Lei Zhao & Mengyuan Zhao & Yingyi Zhang, 2022. "Enhancing keyphrase extraction from academic articles with their reference information," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 703-731, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:72:y:2021:i:5:p:611-626. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.