Running the code will display a descriptive statistics table for the data set. Take a screenshot of this output and upload the image below.
Steps for taking a partial screenshot:
In this lesson:
Students will engage in a pre-assessment using Python and Jupyter notebooks.
Given output table for given set of data, interpret the standard deviation in context.
Ex table output:
Plot table in python (screenshot/code)
Interpret std for given data set (text box)
Interpret mean for given data set (text box)
Change labels (25%, 50%, 75%) to (Q1, median, Q3) within Python code. (screenshot/code)
Students will familiarize themselves with Jupyter notebook and various Python commands and functions. They will be asked to complete a set of questions based on a table they create and modify.
Python is a popular programming language, created by Guido van Rossum, we will use to make plots and perform calculations on various data sets.
We will use Jupyter notebooks, which is a web-based platform that acts as a diary for python code.
For the exercises, once you open the Jupyter notebook, to explore your data, refer back to the GIF and video below to see results for each line of code.
Click on the link below to start the pre-assessment:
https://mybinder.org/v2/gh/CT-STEM/Descriptive-Statistics/1.0
To begin, click on "STD_Part1"
Please note that you will need to have two tabs open on your Chromebook - one for CT-STEM and one for Jupyter notebook.
Running the code will display a descriptive statistics table for the data set. Take a screenshot of this output and upload the image below.
Steps for taking a partial screenshot:
File | Delete |
---|---|
Change the table so that only "count", "mean" and "std" are displayed. Take a screenshot of the new table and upload image below.
File | Delete |
---|---|
Interpret the mean for this data set (Average US Gas Price).
Interpret the standard deviation for this data set (Average US Gas Price).
Change the labels for 25%, 50%, and 75% to the names of those quartiles. Take a screenshot of the updated table and upload image below.
File | Delete |
---|---|
What does Q1 represent, in the context of this situation?
What does the median represent, in the context of this situation?
What does Q3 represent, in the context of this situation?
Plot a boxplot of this data set using Python commands and functions. Take a screenshot of the boxplot and upload image below.
File | Delete |
---|---|
What does the length of the "box" represent in the boxplot?
What descriptive statistics are NOT shown on the boxplot?
Plot a histogram of this data set using Python commands and functions. Take a screenshot of the histogram and upload image below.
File | Delete |
---|---|
Describe the distribution (shape, center, spread, outliers).
What descriptive statistics are NOT shown on the histogram?