Skip to main content

Title: The Data Scientist’s Workflow: EDA and Statistical Modeling with Python in Jupyter Notebooks

Speakers: Chris Holdgraf, UC Berkeley; choldgraf@berkeley.edu
                  David Liu, University of Toronto; david@cs.toronto.edu
                  Nathan Taback, University of Toronto; nathan.taback@utoronto.ca
                  Nathaniel Stevens, University of Waterloo; nstevens@uwaterloo.ca
Date: Saturday, June 12, 2021
Time: 13:00 - 16:00 (EDT)

Outline:

  • Introduction to Jupyter (Chris Holdgraf, UC Berkeley)
  • Introduction to Python (David Liu, University of Toronto)
  • EDA with Python (Nathan Taback, University of Toronto)
  • Statistical Modelling with Python (Nathaniel Stevens, University of Waterloo)

Workshop format:

An interactive workshop where participants will gain hands on experience with statistical analysis using Python in a Jupyter notebook.  We will assume that participants have never used Python, but do have some experience with (statistical) programming in another language (e.g., R, SAS).

Total workshop time: ~180 min.