Skip to content

Instantly share code, notes, and snippets.

@SamPenrose
Last active May 23, 2017 23:49
Show Gist options
  • Star 3 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save SamPenrose/04d3244c781aea00ae4b65249cc6c510 to your computer and use it in GitHub Desktop.
Save SamPenrose/04d3244c781aea00ae4b65249cc6c510 to your computer and use it in GitHub Desktop.

Data Analysis: "What, Why, and How" to "You, Here, and Now"

I. Welcome! (5 minutes)

  1. We're going to learn: - what "data" is - what "data analysis" is - the key ingredient whose absence ruins otherwise good work
  2. Open a new browser window and load https://git.io/datatalk (this page).
  3. In a second tab, create an empty Google spreadsheet - You'll need a Google account. - Alternately, you may work in a different spreadsheet application.

II. What is data analysis? (5 minutes)

  1. A warm-up exercise.
  2. Data analysis is organized thinking.
  3. Data analysis is a spiral: + categorize -> count -> compare -> communicate -> consider -> (and perhaps loop back to an earlier step)
  4. Data analysis is a skill you already have.

III. Count and compare: analysis of a real dataset (15 minutes)

  1. In a new tab, open Lena Groeger's wonderful Spreadsheets Lab . We'll do it together. * In Step 5, add a second filter: GRADE DATE is after 09/01/2015 . * In Step 16, summarize CAMIS by COUNT (not COUNTUNIQUE) and choose a pie chart (not bar chart).

IV. Communicate: what did we learn? (5 minutes)

  1. Your responses.
  2. Do we need to loop back? If so, to which step?

V. Consider and (back to) categorize (15 minutes)

  1. Maybe a second dataset will shed more light on our concern. * Create a new spreadsheet of food poisoning data * Form pairs to answer: "Can we find evidence of a problem with our choice?" (10 minutes)
  2. Is our choice in this dataset?
  3. What makes a restaurant dangerous? * Incidents -> reported? * Grades -> not "A"? * Issues -> "critical"?

VI. Discussing what we learned (5 minutes)

  1. What is the key ingredient needed to drive good data analysis? * Hint: see IV.2 and V.3
  2. Can we ever skip the key ingredient?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment