IDEAS home Printed from https://ideas.repec.org/a/ist/ekoist/v0y2023i39p10-25.html
   My bibliography  Save this article

Analysis of Skills and Qualifications Required in Data Scientist Job Postings Based on the Pareto Analysis Perspective Using Text Mining

Author

Listed:
  • Erkan Işığıçok

    (Bursa Uludağ University, Faculty of Economics and Administrative Sciences, Department of Econometrics, Bursa, Turkiye)

  • Sadullah Çelik

    (Aydın Adnan Menderes University, Nazilli Faculty of Economics and Administrative Sciences, Department of International Trade and Finance, Aydın, Turkiye)

  • Dilek Özdemir Yılmaz

    (Bursa Uludağ University, Faculty of Economics and Administrative Sciences, Social Sciences Institute, Bursa, Turkiye)

Abstract

Today, there are more job posts than ever before, making it incredibly challenging for job searchers to find the position that best suits them. To overcome this difficulty, text mining methods can be used to extract information such as job titles, required skills, and required experience, and to analyze job postings. This information can also be used to match job seekers with the most relevant job postings. The main purpose of this research is to determine which skills, techniques, subjects, fields, and so on should be prioritized by job seekers. For this purpose, 200 data scientist job postings from Turkey and 200 data scientist job postings from the USA are analyzed. According to the results, employers who have announced their interest in hiring a Data Scientist prefer people who are experts in Machine Learning, Data Science, Python, SQL, R, Statistics, and Mathematics, people with BSc, MSc, and PhD education levels, people with 3+ years of work experience, and people who know Visualization, Data Mining, Prediction, NLP, and Clustering techniques. For this reason, it is recommended that people who want to become data scientists in TR or the USA improve themselves in these techniques, skills, and experiences to be accepted to data scientist position jobs more easily.

Suggested Citation

  • Erkan Işığıçok & Sadullah Çelik & Dilek Özdemir Yılmaz, 2023. "Analysis of Skills and Qualifications Required in Data Scientist Job Postings Based on the Pareto Analysis Perspective Using Text Mining," EKOIST Journal of Econometrics and Statistics, Istanbul University, Faculty of Economics, vol. 0(39), pages 10-25, December.
  • Handle: RePEc:ist:ekoist:v:0:y:2023:i:39:p:10-25
    DOI: 10.26650/ekoist.2023.39.1256697
    as

    Download full text from publisher

    File URL: https://cdn.istanbul.edu.tr/file/JTA6CLJ8T5/C204E2B65323497FB3DDC28E5F7FB6E0
    Download Restriction: no

    File URL: https://iupress.istanbul.edu.tr/tr/journal/ekoist/article/analysis-of-skills-and-qualifications-required-in-data-scientist-job-postings-based-on-the-pareto-analysis-perspective-using-text-mining
    Download Restriction: no

    File URL: https://libkey.io/10.26650/ekoist.2023.39.1256697?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Costa, Carlos & Santos, Maribel Yasmina, 2017. "The data scientist profile and its representativeness in the European e-Competence framework and the skills framework for the information age," International Journal of Information Management, Elsevier, vol. 37(6), pages 726-734.
    2. Grimmer, Justin & Stewart, Brandon M., 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts," Political Analysis, Cambridge University Press, vol. 21(3), pages 267-297, July.
    3. Jonathan Benchimol & Sophia Kazinnik & Yossi Saadon, 2022. "Text mining methodologies with R: An application to central bank texts," Post-Print emse-03953759, HAL.
    4. Alzate, Miriam & Arce-Urriza, Marta & Cebollada, Javier, 2022. "Mining the text of online consumer reviews to analyze brand image and brand positioning," Journal of Retailing and Consumer Services, Elsevier, vol. 67(C).
    5. Gary King & Patrick Lam & Margaret E. Roberts, 2017. "Computer‐Assisted Keyword and Document Set Discovery from Unstructured Text," American Journal of Political Science, John Wiley & Sons, vol. 61(4), pages 971-988, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Völker, Richard & Hirschauer, Norbert & Lind, Fabienne & Gruener, Sven, 2024. "Search term validation in agricultural economics: conceptual background and application," OSF Preprints v68r7, Center for Open Science.
    2. Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023. "Measuring partisan media bias in US newscasts from 2001 to 2012," European Journal of Political Economy, Elsevier, vol. 78(C).
    3. McCannon, Bryan & Zhou, Yang & Hall, Joshua, 2021. "Measuring a Contract’s Breadth: A Text Analysis," Working Papers 11013, George Mason University, Mercatus Center.
    4. Kim, Yeonshin & Hur, Won-Moo & Lee, Luri, 2023. "Understanding customer participation in CSR activities: The impact of perceptions of CSR, affective commitment, brand equity, and corporate reputation," Journal of Retailing and Consumer Services, Elsevier, vol. 75(C).
    5. Fraccaroli, Nicolò & Giovannini, Alessandro & Jamet, Jean-François & Persson, Eric, 2022. "Ideology and monetary policy. The role of political parties’ stances in the European Central Bank’s parliamentary hearings," European Journal of Political Economy, Elsevier, vol. 74(C).
    6. Rauh, Christian, 2015. "Communicating supranational governance? The salience of EU affairs in the German Bundestag, 1991–2013," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 16(1), pages 116-138.
    7. Julia Seiermann, 2018. "Only Words? How Power in Trade Agreement Texts Affects International Trade Flows," UNCTAD Blue Series Papers 80, United Nations Conference on Trade and Development.
    8. Arthur Dyevre & Nicolas Lampach, 2021. "Issue attention on international courts: Evidence from the European Court of Justice," The Review of International Organizations, Springer, vol. 16(4), pages 793-815, October.
    9. Dewenter, Ralf & Dulleck, Uwe & Thomas, Tobias, 2018. "The political coverage index and its application to government capture," Research Papers 6, EcoAustria – Institute for Economic Research.
    10. Pastwa, Anna M. & Shrestha, Prabal & Thewissen, James & Torsin, Wouter, 2021. "Unpacking the black box of ICO white papers: a topic modeling approach," LIDAM Discussion Papers LFIN 2021018, Université catholique de Louvain, Louvain Finance (LFIN).
    11. Maksym Polyakov & Morteza Chalak & Md. Sayed Iftekhar & Ram Pandit & Sorada Tapsuwan & Fan Zhang & Chunbo Ma, 2018. "Authorship, Collaboration, Topics, and Research Gaps in Environmental and Resource Economics 1991–2015," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 71(1), pages 217-239, September.
    12. Milena Djourelova & Ruben Durante, 2019. "Media attention and strategic timing in politics: Evidence from U.S. presidential executive orders," Economics Working Papers 1675, Department of Economics and Business, Universitat Pompeu Fabra.
    13. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    14. Ishani Patharia Chopra & Charles Jebarajakirthy & Tanu Jain & Haroon Iqbal Maseeh, 2024. "Electronic shopping cart abandonment: What do we know and where should we be heading?," Electronic Markets, Springer;IIM University of St. Gallen, vol. 34(1), pages 1-30, December.
    15. Yuting Chen & Don Bredin & Valerio Potì & Roman Matkovskyy, 2022. "COVID risk narratives: a computational linguistic approach to the econometric identification of narrative risk during a pandemic," Digital Finance, Springer, vol. 4(1), pages 17-61, March.
    16. Purwoko Haryadi Santoso & Edi Istiyono & Haryanto & Wahyu Hidayatulloh, 2022. "Thematic Analysis of Indonesian Physics Education Research Literature Using Machine Learning," Data, MDPI, vol. 7(11), pages 1-41, October.
    17. Yu-Ru Lin & Wen-Ting Chung, 2020. "The dynamics of Twitter users’ gun narratives across major mass shooting events," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-16, December.
    18. Markus Eberhardt & Giovanni Facchini & Valeria Rueda, 2023. "Gender Differences in Reference Letters: Evidence from the Economics Job Market," The Economic Journal, Royal Economic Society, vol. 133(655), pages 2676-2708.
    19. Rauh, Christian, 2018. "Validating a sentiment dictionary for German political language—a workbench note," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 15(4), pages 319-343.
    20. Ferrara, Federico M. & Masciandaro, Donato & Moschella, Manuela & Romelli, Davide, 2022. "Political voice on monetary policy: Evidence from the parliamentary hearings of the European Central Bank," European Journal of Political Economy, Elsevier, vol. 74(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ist:ekoist:v:0:y:2023:i:39:p:10-25. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Istanbul University Press Operational Team (Ertuğrul YAŞAR) (email available below). General contact details of provider: https://edirc.repec.org/data/ifisttr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.