IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v390y2011i18p3157-3163.html
   My bibliography  Save this article

The role of entropy in word ranking

Author

Listed:
  • Mehri, Ali
  • Darooneh, Amir H.

Abstract

Entropy as a measure of complexity in the systems has been applied for ranking the words in the human written texts. We introduce a novel approach to evaluate accuracy for retrieved indices. We also have an illustrative comparison between proposed entropic metrics and some other methods in extracting the keywords. It seems that, some of the discussed metrics apply similar features for word ranking in the text. This work recommend the entropy as a systematic measure in text mining.

Suggested Citation

  • Mehri, Ali & Darooneh, Amir H., 2011. "The role of entropy in word ranking," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(18), pages 3157-3163.
  • Handle: RePEc:eee:phsmap:v:390:y:2011:i:18:p:3157-3163
    DOI: 10.1016/j.physa.2011.04.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437111003074
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2011.04.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David L. Olson & Dursun Delen, 2008. "Advanced Data Mining Techniques," Springer Books, Springer, number 978-3-540-76917-0, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mehri, Ali & Agahi, Hamzeh & Mehri-Dehnavi, Hossein, 2019. "A novel word ranking method based on distorted entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 484-492.
    2. Carretero-Campos, C. & Bernaola-Galván, P. & Coronado, A.V. & Carpena, P., 2013. "Improving statistical keyword detection in short texts: Entropic and clustering approaches," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(6), pages 1481-1492.
    3. Jamaati, Maryam & Mehri, Ali, 2018. "Text mining by Tsallis entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 1368-1376.
    4. Mehri, Ali & Jamaati, Maryam, 2021. "Statistical metrics for languages classification: A case study of the Bible translations," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vangelis Marinakis & Themistoklis Koutsellis & Alexandros Nikas & Haris Doukas, 2021. "AI and Data Democratisation for Intelligent Energy Management," Energies, MDPI, vol. 14(14), pages 1-14, July.
    2. Mark Gilchrist & Deana Lehmann Mooers & Glenn Skrubbeltrang & Francine Vachon, 2012. "Knowledge Discovery in Databases for Competitive Advantage," Journal of Management and Strategy, Journal of Management and Strategy, Sciedu Press, vol. 3(2), pages 2-15, April.
    3. Marina Johnson & Abdullah Albizri & Serhat Simsek, 2022. "Artificial intelligence in healthcare operations to enhance treatment outcomes: a framework to predict lung cancer prognosis," Annals of Operations Research, Springer, vol. 308(1), pages 275-305, January.
    4. Mehri, Ali & Darooneh, Amir H. & Shariati, Ashrafalsadat, 2012. "The complex networks approach for authorship attribution of books," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(7), pages 2429-2437.
    5. Michał Jasiński & Tomasz Sikorski & Zbigniew Leonowicz & Klaudiusz Borkowski & Elżbieta Jasińska, 2020. "The Application of Hierarchical Clustering to Power Quality Measurements in an Electrical Power Network with Distributed Generation," Energies, MDPI, vol. 13(9), pages 1-19, May.
    6. Beni Rohrbach & Sharolyn Anderson & Patrick Laube, 2016. "The effects of sample size on data quality in participatory mapping of past land use," Environment and Planning B, , vol. 43(4), pages 681-697, July.
    7. Simsek, Serhat & Dag, Ali & Tiahrt, Thomas & Oztekin, Asil, 2021. "A Bayesian Belief Network-based probabilistic mechanism to determine patient no-show risk categories," Omega, Elsevier, vol. 100(C).
    8. Sebastian Büsch & Volker Nissen & Arndt Wünscher, 0. "Automatic classification of data-warehouse-data for information lifecycle management using machine learning techniques," Information Systems Frontiers, Springer, vol. 0, pages 1-15.
    9. Yucel, Ahmet & Dag, Ali & Oztekin, Asil & Carpenter, Mark, 2022. "A novel text analytic methodology for classification of product and service reviews," Journal of Business Research, Elsevier, vol. 151(C), pages 287-297.
    10. Kizilaslan, Recep & Freund, Steven & Iseri, Ali, 2016. "A data analytic approach to forecasting daily stock returns in an emerging marketAuthor-Name: Oztekin, Asil," European Journal of Operational Research, Elsevier, vol. 253(3), pages 697-710.
    11. Saljooghi, Saeed & Safisamghabadib, Azamdokht, 2016. "Analyzing Semiconductor component's market sales data to create an Expert Fuzzy inference system," MPRA Paper 79846, University Library of Munich, Germany.
    12. Asil Oztekin, 0. "Information fusion-based meta-classification predictive modeling for ETF performance," Information Systems Frontiers, Springer, vol. 0, pages 1-16.
    13. Ramin Vakili & Mojdeh Khorsand, 2022. "A Machine Learning-Based Method for Identifying Critical Distance Relays for Transient Stability Studies," Energies, MDPI, vol. 15(23), pages 1-28, November.
    14. Delen, Dursun & Cogdell, Douglas & Kasap, Nihat, 2012. "A comparative analysis of data mining methods in predicting NCAA bowl outcomes," International Journal of Forecasting, Elsevier, vol. 28(2), pages 543-552.
    15. ShakorShahabi, Reza & Qarahasanlou, Ali Nouri & Azimi, Seyed Reza & Mottahedi, Adel, 2021. "Application of data mining in Iran's Artisanal and Small-Scale mines challenges analysis," Resources Policy, Elsevier, vol. 74(C).
    16. Chen, Kunlong & Zheng, Fangdan & Jiang, Jiuchun & Zhang, Weige & Jiang, Yan & Chen, Kunjin, 2017. "Practical failure recognition model of lithium-ion batteries based on partial charging process," Energy, Elsevier, vol. 138(C), pages 1199-1208.
    17. Gitae Kim & Bongsug Chae & David Olson, 2013. "A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models," Service Business, Springer;Pan-Pacific Business Association, vol. 7(1), pages 167-182, March.
    18. Dinesh, Chinthaka & Welikala, Shirantha & Liyanage, Yasitha & Ekanayake, Mervyn Parakrama B. & Godaliyadda, Roshan Indika & Ekanayake, Janaka, 2017. "Non-intrusive load monitoring under residential solar power influx," Applied Energy, Elsevier, vol. 205(C), pages 1068-1080.
    19. Emrouznejad, Ali & De Witte, Kristof, 2010. "COOPER-framework: A unified process for non-parametric projects," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1573-1586, December.
    20. Javier Gomez & Cesar Alfaro & Felipe Ortega & Javier M. Moguerza & Maria Jesus Algar & Raul Moreno, 2024. "Adapting support vector optimisation algorithms to textual gender classification," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(3), pages 463-488, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:390:y:2011:i:18:p:3157-3163. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.