Data Science: A First Introduction worksheets

Jupyter notebook worksheets to accompany Data Science: A First Introduction by Tiffany Timbers, Trevor Campbell and Melissa Lee

To use these worksheets, you can either:

  1. Click on a “launch binder” button to open an interactive, but non-peristent, version of the notebook

  2. Download this repository by clicking here and follow our computer setup instructions here.

Regardless of the method you choose to acces them, we also recommend reading our Combining code and text with Jupyter chapter before starting out.

R and the tidyverse
Reading in data locally and from the web
Cleaning and wrangling data
Effective data visualization
Classification I: training & predicting
Classification II: evaluation & tuning
Regression I: K-nearest neighbors
Regression II: linear regression
Clustering
Statistical inference (sampling)
Statistical inference (bootstrapping)
Collaboration with version control


Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)


We would like to thank the BinderHub Federation for their kind and generous support of The interactive versions of these notebooks would not be possible without their efforts.


Jupyter et al., “Binder 2.0 - Reproducible, Interactive, Sharable Environments for Science at Scale.” Proceedings of the 17th Python in Science Conference. 2018. doi://10.25080/Majora-4af1f417-011