IDEAS home Printed from https://ideas.repec.org/a/taf/amstat/v72y2018i1p89-96.html
   My bibliography  Save this article

Teaching Stats for Data Science

Author

Listed:
  • Daniel Kaplan

Abstract

“Data science” is a useful catchword for methods and concepts original to the field of statistics, but typically being applied to large, multivariate, observational records. Such datasets call for techniques not often part of an introduction to statistics: modeling, consideration of covariates, sophisticated visualization, and causal reasoning. This article re-imagines introductory statistics as an introduction to data science and proposes a sequence of 10 blocks that together compose a suitable course for extracting information from contemporary data. Recent extensions to the mosaic packages for R together with tools from the “tidyverse” provide a concise and readable notation for wrangling, visualization, model-building, and model interpretation: the fundamental computational tasks of data science.

Suggested Citation

  • Daniel Kaplan, 2018. "Teaching Stats for Data Science," The American Statistician, Taylor & Francis Journals, vol. 72(1), pages 89-96, January.
  • Handle: RePEc:taf:amstat:v:72:y:2018:i:1:p:89-96
    DOI: 10.1080/00031305.2017.1398107
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/00031305.2017.1398107
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/00031305.2017.1398107?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. George Cobb, 2015. "Mere Renovation is Too Little Too Late: We Need to Rethink our Undergraduate Curriculum from the Ground Up," The American Statistician, Taylor & Francis Journals, vol. 69(4), pages 266-282, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kevin Cummiskey & Karsten Lübke, 2022. "Causality in statistics and data science education," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 16(3), pages 277-286, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chris J. Wild, 2016. "Discussion: Locating Statistics in the World of Finding Out," International Statistical Review, International Statistical Institute, vol. 84(2), pages 194-202, August.
    2. Manuela Svoboda, 2022. "Evaluation of Motivation, Expectation, and Present Situation in 3rd Year Undergraduate Students of German Language and Literature at the University of Rijeka, Croatia," European Journal of Education Articles, Revistia Research and Publishing, vol. 5, ejed_v5_i.
    3. Nicholas Jon Horton, 2016. "Discussion: Making Progress in a Crowded Market," International Statistical Review, International Statistical Institute, vol. 84(2), pages 179-181, August.
    4. Ryan Sterling McCulloch, 2017. "Learning Outcomes in a Laboratory Environment vs. Classroom for Statistics Instruction: An Alternative Approach Using Statistical Software," International Journal of Higher Education, Sciedu Press, vol. 6(5), pages 131-131, October.
    5. Jeff Witmer, 2017. "Bayes and MCMC for Undergraduates," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 259-264, July.
    6. Robert A. Stine, 2017. "Explaining Normal Quantile-Quantile Plots Through Animation: The Water-Filling Analogy," The American Statistician, Taylor & Francis Journals, vol. 71(2), pages 145-147, April.
    7. Amy L. Phelps & Kathryn A. Szabat, 2017. "The Current Landscape of Teaching Analytics to Business Students at Institutions of Higher Education: Who is Teaching What?," The American Statistician, Taylor & Francis Journals, vol. 71(2), pages 155-161, April.
    8. Roger W. Hoerl & Ronald D. Snee, 2017. "Statistical Engineering: An Idea Whose Time Has Come?," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 209-219, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:amstat:v:72:y:2018:i:1:p:89-96. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UTAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.