Introduction to Python for Statistics Short Course

Introduction to Python for Statistics Short Course

by Steven Stehnach, Data Scientist 

Date: January 26, 2019 (Saturday), 9.30 am – 4 pm (with lunch break)

Location: Middlebush Hall 142 (Middlebush/Johnson Auditorium)

Due to overwhleming response and huge popularity the attendance cap is set at 150. There are only few seats left please sign up early if you are interested.

All course materials, software requirements, and installation guidelines will be provided on 01/22/2019 through e-mail to the registered participants.

No refund will be issued after 01/22/2019, 4 pm.

Course Description:

The course will provide a gentle introduction to Python for statistical modeling.  Python is one of the world's most popular programming languages, and is commonly used by major companies for web development purposes. Google has used it for much of their web spider and search engine code, Yahoo built their "Groups" and "Maps" features with it, and the Central Intelligence Agency built their entire website with it. In addition to web development, Python can be found in commercial games, business software, educational applications, animation systems and many other different types of software.

Course topics will include:

  • Using a useful Python GUI (e.g., Spyder)
  • Basic introduction to reading, manipulating, and output data files
  • Simple data analysis (regression and classification methods)
  • Basic visualization (scatterplots, histograms, kernel smooths, regressions, maps)
  • Basic interface with R

The course will be “hands on” and participants are strongly encouraged to bring their laptops.  Instructions for recommended software and other course materials will be made available before the course. All recommended software will be open source, and can be downloaded free of cost.

Course Fee Structure:

University of Missouri Department of Statistics Students (undergrad and grad): Free

University of Missouri Department of Statistics Faculty and Staff: $20

University of Missouri Students (non-Stat): $25

University of Missouri Faculty and Staff (non-Stat): $35

Non- University of Missouri Academic: $50

Non-Academic: $100

Seats are limited so register as soon as possible. To register click here.

About the Course Presenter:

Steven Stehnach

Steven Stehnach

Steven Stehnach is a data scientist,  who is enthusiastic about geography, marketing, sports, finance, and data applications in a variety of other fields. In May 2017, he completed a master's degree in statistics at the University of Missouri, where he focused on machine learning and predictive modeling. Steven is highly  proficient in both R and Python, and has worked extensively in developing and implementing complex statistical models using R and Python in various practical areas.

Coffee and Snacks will be provided, but lunch will be on your own.  There are many restaurants within a short walking distance of Middlebush hall.

Detailed Schedule for the Day

Registration and Software Installation Help: 8:30 am – 9:20 am

 

  • No laptop will be given. Bring your own laptop. MAC or PC.
  • You are strongly encouraged to follow the emailed instructions and preinstall all required software and download the course materials in your personal laptops before you arrive)

 

Welcome Speech by Prof. Chris Wikle, Curators Distinguished Professor, Department Chair, Department of Statistics, University of Missouri: 9:20 am – 9.30 am

 

Session 1: Introduction to Python

9:30 am – 10.45 am

o Overview 

o Basic Commands, Import Modules 

o Comparisons to R, Important Syntax Differences (Array Indices, etc.) 

o Data Structures, Functions, etc.

15 Minute Break

 

Session 2: Data Exploration/Manipulation

11:00 am – 12.15 pm

o Reading Data 

o Data Summaries, Descriptive Statistics 

o Data Processing w/ Pandas; Imputation, Feature Creation

Lunch (12:15 pm -1:15 pm)

 

Session 3: Data Visualization

1:15 pm – 1.45 pm

o Matplotlib 

o Seaborn 

Session 4: Regression with Statistical Models

1:45 pm – 2.15 pm

Standard regression model and analysis using python

15 Minute Break

 

Session 5: Regression & Machine Learning with Scikit-Learn

2:30 pm – 3.30 pm

Popular machine learning models using python

15 Minute Break

 

Session 6: Interfacing with R                                 3:45 pm – 4.30 pm

How to access Python and R functions from one another