IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v109y2016i1d10.1007_s11192-016-1945-y.html
   My bibliography  Save this article

MapReduce: Review and open challenges

Author

Listed:
  • Ibrahim Abaker Targio Hashem

    (University of Malaya)

  • Nor Badrul Anuar

    (University of Malaya)

  • Abdullah Gani

    (University of Malaya)

  • Ibrar Yaqoob

    (University of Malaya)

  • Feng Xia

    (Dalian University of Technology)

  • Samee Ullah Khan

    (North Dakota State University)

Abstract

The continuous increase in computational capacity over the past years has produced an overwhelming flow of data or big data, which exceeds the capabilities of conventional processing tools. Big data signify a new era in data exploration and utilization. The MapReduce computational paradigm is a major enabler for underlying numerous big data platforms. MapReduce is a popular tool for the distributed and scalable processing of big data. It is increasingly being used in different applications primarily because of its important features, including scalability, fault tolerance, ease of programming, and flexibility. Thus, bibliometric analysis and review was conducted to evaluate the trend of MapReduce research assessment publications indexed in Scopus from 2006 to 2015. This trend includes the use of the MapReduce framework for big data processing and its development. The study analyzed the distribution of published articles, countries, authors, keywords, and authorship pattern. For data visualization, VOSviewer program was used to produce distance- and graph-based maps. The top 10 most cited articles were also identified based on the citation count of publications. The study utilized productivity measures, domain visualization techniques and co-word to explore papers related to MapReduce in the field of big data. Moreover, the study discussed the most influential articles contributed to the improvements in MapReduce and reviewed the corresponding solutions. Finally, it presented several open challenges on big data processing with MapReduce as future research directions.

Suggested Citation

  • Ibrahim Abaker Targio Hashem & Nor Badrul Anuar & Abdullah Gani & Ibrar Yaqoob & Feng Xia & Samee Ullah Khan, 2016. "MapReduce: Review and open challenges," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(1), pages 389-422, October.
  • Handle: RePEc:spr:scient:v:109:y:2016:i:1:d:10.1007_s11192-016-1945-y
    DOI: 10.1007/s11192-016-1945-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-016-1945-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-016-1945-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lokman I. Meho & Kiduk Yang, 2007. "Impact of data sources on citation counts and rankings of LIS faculty: Web of science versus scopus and google scholar," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(13), pages 2105-2125, November.
    2. van Eck, N.J.P. & Waltman, L., 2009. "VOSviewer: A Computer Program for Bibliometric Mapping," ERIM Report Series Research in Management ERS-2009-005-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    3. Mao, Guozhu & Zou, Hongyang & Chen, Guanyi & Du, Huibin & Zuo, Jian, 2015. "Past, current and future of biomass energy research: A bibliometric analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 52(C), pages 1823-1833.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Altarturi, Hamza H.M. & Saadoon, Muntadher & Anuar, Nor Badrul, 2020. "Cyber parental control: A bibliometric study," Children and Youth Services Review, Elsevier, vol. 116(C).
    2. Centobelli, Piera & Cerchione, Roberto & Esposito, Emilio & Oropallo, Eugenio, 2021. "Surfing blockchain wave, or drowning? Shaping the future of distributed ledgers and decentralized technologies," Technological Forecasting and Social Change, Elsevier, vol. 165(C).
    3. Mora, Luca & Deakin, Mark & Reid, Alasdair, 2019. "Combining co-citation clustering and text-based analysis to reveal the main development paths of smart cities," Technological Forecasting and Social Change, Elsevier, vol. 142(C), pages 56-69.
    4. P. V. Thayyib & Rajesh Mamilla & Mohsin Khan & Humaira Fatima & Mohd Asim & Imran Anwar & M. K. Shamsudheen & Mohd Asif Khan, 2023. "State-of-the-Art of Artificial Intelligence and Big Data Analytics Reviews in Five Different Domains: A Bibliometric Summary," Sustainability, MDPI, vol. 15(5), pages 1-38, February.
    5. Fernando Garrigós-Simón & Silvia Sanz-Blas & Yeamduan Narangajavana & Daniela Buzova, 2021. "The Nexus between Big Data and Sustainability: An Analysis of Current Trends and Developments," Sustainability, MDPI, vol. 13(12), pages 1-24, June.
    6. Hashem, Ibrahim Abaker Targio & Chang, Victor & Anuar, Nor Badrul & Adewole, Kayode & Yaqoob, Ibrar & Gani, Abdullah & Ahmed, Ejaz & Chiroma, Haruna, 2016. "The role of big data in smart city," International Journal of Information Management, Elsevier, vol. 36(5), pages 748-758.
    7. Salvatore Carta & Alessandro Sebastian Podda & Diego Reforgiato Recupero & Roberto Saia, 2020. "A Local Feature Engineering Strategy to Improve Network Anomaly Detection," Future Internet, MDPI, vol. 12(10), pages 1-30, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dejian Yu & Sun Meng, 2018. "An overview of biomass energy research with bibliometric indicators," Energy & Environment, , vol. 29(4), pages 576-590, June.
    2. Mingchun Cao & Ilan Alon, 2020. "Intellectual Structure of the Belt and Road Initiative Research: A Scientometric Analysis and Suggestions for a Future Research Agenda," Sustainability, MDPI, vol. 12(17), pages 1-40, August.
    3. Lakshmi Balachandran Nair & Michael Gibbert, 2016. "What makes a ‘good’ title and (how) does it matter for citations? A review and general model of article title attributes in management science," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1331-1359, June.
    4. Deming Lin & Tianhui Gong & Wenbin Liu & Martin Meyer, 2020. "An entropy-based measure for the evolution of h index research," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2283-2298, December.
    5. Johnson Ankrah & Ana Monteiro & Helena Madureira, 2022. "Bibliometric Analysis of Data Sources and Tools for Shoreline Change Analysis and Detection," Sustainability, MDPI, vol. 14(9), pages 1-23, April.
    6. Takanori Ida & Naomi Fukuzawa, 2013. "Effects of large-scale research funding programs: a Japanese case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 1253-1273, March.
    7. Masaru Kuno & Mary Prorok & Shubin Zhang & Huy Huynh & Thurston Miller, 2022. "Deciphering the US News and World Report Ranking of US Chemistry Graduate Programs," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2131-2150, May.
    8. Cathelijn J. F. Waaijer & Cornelis A. Bochove & Nees Jan Eck, 2011. "On the map: Nature and Science editorials," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(1), pages 99-112, January.
    9. Zoltán Lakner & Brigitta Plasek & Gyula Kasza & Anna Kiss & Sándor Soós & Ágoston Temesi, 2021. "Towards Understanding the Food Consumer Behavior–Food Safety–Sustainability Triangle: A Bibliometric Approach," Sustainability, MDPI, vol. 13(21), pages 1-23, November.
    10. Bayissa Badada Badassa & Baiqing Sun & Lixin Qiao, 2020. "Sustainable Transport Infrastructure and Economic Returns: A Bibliometric and Visualization Analysis," Sustainability, MDPI, vol. 12(5), pages 1-24, March.
    11. Teja Koler-Povh & Primož Južnič & Goran Turk, 2014. "Impact of open access on citation of scholarly publications in the field of civil engineering," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1033-1045, February.
    12. Esther Salmerón-Manzano & Francisco Manzano-Agugliaro, 2018. "The Higher Education Sustainability through Virtual Laboratories: The Spanish University as Case of Study," Sustainability, MDPI, vol. 10(11), pages 1-22, November.
    13. Yongjun Zhu & Erjia Yan, 2015. "Dynamic subfield analysis of disciplines: an examination of the trading impact and knowledge diffusion patterns of computer science," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(1), pages 335-359, July.
    14. Ahmed Tlili & Daniel Burgos & Ronghuai Huang & Sanjaya Mishra & Ramesh Chander Sharma & Aras Bozkurt, 2021. "An Analysis of Peer-Reviewed Publications on Open Educational Practices (OEP) from 2007 to 2020: A Bibliometric Mapping Analysis," Sustainability, MDPI, vol. 13(19), pages 1-15, September.
    15. Woocheol Kim & Gohar Feroz Khan & Jacob Wood & Muhammad Tariq Mahmood, 2016. "Employee Engagement for Sustainable Organizations: Keyword Analysis Using Social Network Analysis and Burst Detection Approach," Sustainability, MDPI, vol. 8(7), pages 1-11, July.
    16. Christoph Bartneck & Servaas Kokkelmans, 2011. "Detecting h-index manipulation through self-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(1), pages 85-98, April.
    17. Borrett, Stuart R. & Sheble, Laura & Moody, James & Anway, Evan C., 2018. "Bibliometric review of ecological network analysis: 2010–2016," Ecological Modelling, Elsevier, vol. 382(C), pages 63-82.
    18. Nianhang Xu & Winnie P. H. Poon & Kam C. Chan, 2014. "Contributing Institutions and Authors in International Business Research: A Quality-Based Assessment," Management International Review, Springer, vol. 54(5), pages 735-755, October.
    19. Magdalena Olczyk, 2016. "International Competitiveness in the Economics Literature: A Bibliometric Study," Athens Journal of Business & Economics, Athens Institute for Education and Research (ATINER), vol. 2(4), pages 375-388, October.
    20. De Andrés Fazio, Salvador & Urquía Grande, Elena & Pérez Estébanez, Raquel, 2022. "The “secret life” of the Statement of Cash Flow: A bibliometric analysis," Cuadernos de Gestión, Universidad del País Vasco - Instituto de Economía Aplicada a la Empresa (IEAE).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:109:y:2016:i:1:d:10.1007_s11192-016-1945-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.