IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v390y2011i18p3157-3163.html
   My bibliography  Save this article

The role of entropy in word ranking

Author

Listed:
  • Mehri, Ali
  • Darooneh, Amir H.

Abstract

Entropy as a measure of complexity in the systems has been applied for ranking the words in the human written texts. We introduce a novel approach to evaluate accuracy for retrieved indices. We also have an illustrative comparison between proposed entropic metrics and some other methods in extracting the keywords. It seems that, some of the discussed metrics apply similar features for word ranking in the text. This work recommend the entropy as a systematic measure in text mining.

Suggested Citation

  • Mehri, Ali & Darooneh, Amir H., 2011. "The role of entropy in word ranking," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(18), pages 3157-3163.
  • Handle: RePEc:eee:phsmap:v:390:y:2011:i:18:p:3157-3163
    DOI: 10.1016/j.physa.2011.04.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437111003074
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2011.04.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David L. Olson & Dursun Delen, 2008. "Advanced Data Mining Techniques," Springer Books, Springer, number 978-3-540-76917-0, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mehri, Ali & Agahi, Hamzeh & Mehri-Dehnavi, Hossein, 2019. "A novel word ranking method based on distorted entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 484-492.
    2. Carretero-Campos, C. & Bernaola-Galván, P. & Coronado, A.V. & Carpena, P., 2013. "Improving statistical keyword detection in short texts: Entropic and clustering approaches," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(6), pages 1481-1492.
    3. Jamaati, Maryam & Mehri, Ali, 2018. "Text mining by Tsallis entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 1368-1376.
    4. Mehri, Ali & Jamaati, Maryam, 2021. "Statistical metrics for languages classification: A case study of the Bible translations," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mark Gilchrist & Deana Lehmann Mooers & Glenn Skrubbeltrang & Francine Vachon, 2012. "Knowledge Discovery in Databases for Competitive Advantage," Journal of Management and Strategy, Journal of Management and Strategy, Sciedu Press, vol. 3(2), pages 2-15, April.
    2. Marina Johnson & Abdullah Albizri & Serhat Simsek, 2022. "Artificial intelligence in healthcare operations to enhance treatment outcomes: a framework to predict lung cancer prognosis," Annals of Operations Research, Springer, vol. 308(1), pages 275-305, January.
    3. Simsek, Serhat & Dag, Ali & Tiahrt, Thomas & Oztekin, Asil, 2021. "A Bayesian Belief Network-based probabilistic mechanism to determine patient no-show risk categories," Omega, Elsevier, vol. 100(C).
    4. Sebastian Büsch & Volker Nissen & Arndt Wünscher, 0. "Automatic classification of data-warehouse-data for information lifecycle management using machine learning techniques," Information Systems Frontiers, Springer, vol. 0, pages 1-15.
    5. Yucel, Ahmet & Dag, Ali & Oztekin, Asil & Carpenter, Mark, 2022. "A novel text analytic methodology for classification of product and service reviews," Journal of Business Research, Elsevier, vol. 151(C), pages 287-297.
    6. Kizilaslan, Recep & Freund, Steven & Iseri, Ali, 2016. "A data analytic approach to forecasting daily stock returns in an emerging marketAuthor-Name: Oztekin, Asil," European Journal of Operational Research, Elsevier, vol. 253(3), pages 697-710.
    7. Saljooghi, Saeed & Safisamghabadib, Azamdokht, 2016. "Analyzing Semiconductor component's market sales data to create an Expert Fuzzy inference system," MPRA Paper 79846, University Library of Munich, Germany.
    8. Ramin Vakili & Mojdeh Khorsand, 2022. "A Machine Learning-Based Method for Identifying Critical Distance Relays for Transient Stability Studies," Energies, MDPI, vol. 15(23), pages 1-28, November.
    9. Delen, Dursun & Cogdell, Douglas & Kasap, Nihat, 2012. "A comparative analysis of data mining methods in predicting NCAA bowl outcomes," International Journal of Forecasting, Elsevier, vol. 28(2), pages 543-552.
    10. Chen, Kunlong & Zheng, Fangdan & Jiang, Jiuchun & Zhang, Weige & Jiang, Yan & Chen, Kunjin, 2017. "Practical failure recognition model of lithium-ion batteries based on partial charging process," Energy, Elsevier, vol. 138(C), pages 1199-1208.
    11. Emrouznejad, Ali & De Witte, Kristof, 2010. "COOPER-framework: A unified process for non-parametric projects," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1573-1586, December.
    12. Shaheen, Muhammad & Khan, Muhammad Zeb, 2016. "A method of data mining for selection of site for wind turbines," Renewable and Sustainable Energy Reviews, Elsevier, vol. 55(C), pages 1225-1233.
    13. Asil Oztekin, 2018. "Information fusion-based meta-classification predictive modeling for ETF performance," Information Systems Frontiers, Springer, vol. 20(2), pages 223-238, April.
    14. Abdorrahman Haeri, 2020. "Analyzing safety level and recognizing flaws of commercial centers through data mining approach," Journal of Risk and Reliability, , vol. 234(3), pages 512-526, June.
    15. Sebastian Büsch & Volker Nissen & Arndt Wünscher, 2017. "Automatic classification of data-warehouse-data for information lifecycle management using machine learning techniques," Information Systems Frontiers, Springer, vol. 19(5), pages 1085-1099, October.
    16. Renhe Hu & Zihan Hui & Yifan Li & Jueqi Guan, 2023. "Research on Learning Concentration Recognition with Multi-Modal Features in Virtual Reality Environments," Sustainability, MDPI, vol. 15(15), pages 1-16, July.
    17. Kazim Topuz & Hasmet Uner & Asil Oztekin & Mehmet Bayram Yildirim, 2018. "Predicting pediatric clinic no-shows: a decision analytic framework using elastic net and Bayesian belief network," Annals of Operations Research, Springer, vol. 263(1), pages 479-499, April.
    18. Cankaya, Burak & Topuz, Kazim & Delen, Dursun & Glassman, Aaron, 2023. "Evidence-based managerial decision-making with machine learning: The case of Bayesian inference in aviation incidents," Omega, Elsevier, vol. 120(C).
    19. Vangelis Marinakis & Themistoklis Koutsellis & Alexandros Nikas & Haris Doukas, 2021. "AI and Data Democratisation for Intelligent Energy Management," Energies, MDPI, vol. 14(14), pages 1-14, July.
    20. Mehri, Ali & Darooneh, Amir H. & Shariati, Ashrafalsadat, 2012. "The complex networks approach for authorship attribution of books," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(7), pages 2429-2437.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:390:y:2011:i:18:p:3157-3163. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.