IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i8p1762-d1117821.html
   My bibliography  Save this article

Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction

Author

Listed:
  • Chandrakala Arya

    (School of Computing, Graphic Era Hill University, Dehradun 248002, India)

  • Manoj Diwakar

    (CSE Department, Graphic Era Deemed to be University, Dehradun 248002, India)

  • Prabhishek Singh

    (School of Computer Science Engineering and Technology, Bennett University, Greater Noida 201009, India)

  • Vijendra Singh

    (School of Computer Science, University of Petroleum and Energy Studies, Dehradun 248007, India)

  • Seifedine Kadry

    (Department of Applied Data Science, Noroff University College, 4608 Kristiansand, Norway
    Artificial Intelligence Research Center (AIRC), Ajman University, Ajman 346, United Arab Emirates
    Department of Electrical and Computer Engineering, Lebanese American University, Byblos 13-5053, Lebanon)

  • Jungeun Kim

    (Department of Software and CMPSI, Kongju National University, Cheonan 31080, Republic of Korea)

Abstract

In the area of text summarization, there have been significant advances recently. In the meantime, the current trend in text summarization is focused more on news summarization. Therefore, developing a synthesis approach capable of extracting, comparing, and ranking sentences is vital to create a summary of various news articles in the context of erroneous online data. It is necessary, however, for the news summarization system to be able to deal with multi-document summaries due to content redundancy. This paper presents a method for summarizing multi-document news web pages based on similarity models and sentence ranking, where relevant sentences are extracted from the original article. English-language articles are collected from five news websites that cover the same topic and event. According to our experimental results, our approach provides better results than other recent methods for summarizing news.

Suggested Citation

  • Chandrakala Arya & Manoj Diwakar & Prabhishek Singh & Vijendra Singh & Seifedine Kadry & Jungeun Kim, 2023. "Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction," Mathematics, MDPI, vol. 11(8), pages 1-20, April.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1762-:d:1117821
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/8/1762/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/8/1762/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hsin‐Hsi Chen & June‐Jei Kuo & Sheng‐Jie Huang & Chuan‐Jie Lin & Hung‐Chia Wung, 2003. "A summarization system for Chinese news from multiple sources," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(13), pages 1224-1236, November.
    2. Rautray, Rasmita & Balabantaray, Rakesh Chandra, 2017. "Cat swarm optimization based evolutionary framework for multi document summarization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 477(C), pages 174-186.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guo, Lei & Meng, Zhuo & Sun, Yize & Wang, Libiao, 2018. "A modified cat swarm optimization based maximum power point tracking method for photovoltaic system under partially shaded condition," Energy, Elsevier, vol. 144(C), pages 501-514.
    2. Altay, Elif Varol & Alatas, Bilal, 2020. "Randomness as source for inspiring solution search methods: Music based approaches," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 537(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1762-:d:1117821. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.