Data ('23-'24)
Exploring One Column
Resources
Unit 5 Lesson 2
Name(s)_______________________________________________ Period ______ Date ___________________
Activity Guide - Exploring One Column
Bar Charts
Do This
● Make a bar chart for each column in the "Dogs" dataset and categorize it as either "useful" or "not useful" by
placing that column name in the table below.
Useful Not Useful
Chart
Paste one of the charts that you believe is "useful" in the space below.
Questions
1. What is one piece of information someone could learn from the chart you pasted?
2. Do you notice any patterns in which charts are or are not useful?
Computer Science Principles 1
Unit 5 Lesson 2
Histograms
Do This:
● Make a histogram for the "Max Weight" column of the "Dogs" data set and choose the "Bucket Size" that you
think is most helpful for interpreting the data.
Chart
Questions
3. What bucket size did you choose? Why do you think this is the most helpful bucket size for this chart?
4. What is the most common range of maximum weights for dog breeds?
5. What is the least common range of maximum weights for dog breeds?
Do This:
● Make a histogram for one other column in the Dogs dataset and choose the "Bucket Size" that you think is
most helpful for interpreting the data.
Chart
Questions
1. What bucket size did you choose and why?
2. What is one question the chart you created could help answer?
Computer Science Principles 2
Data ('23-'24)
Filtering and Cleaning Data
Resources
Unit 5 Lesson 3
Name(s)_______________________________________________ Period ______ Date ___________________
Activity Guide - Filtering Data
Data Set: Female State Legislators
Do This:
● Filter the Female State Legislators dataset for Year by the year you were born.
● Create a histogram for Percentage of Females in Legislature. Choose a bucket size of 5.
● Copy/paste the chart below.
Chart 1
● Filter the Female State Legislators dataset for Year by the year 2019.
● Copy/paste the chart below.
● Answer the questions.
Chart 2
Questions
1. In the year you were born, how many states had between 25 and 30 percent of their legislatures made up of
female legislators?
2. In 2019, how many states had between 25 and 30 percent of their legislatures made up of female legislators?
3. What does this data show?
4. Why might that be the case?
Computer Science Principles 1
Unit 5 Lesson 3
Data Set: US Women Running for Elected Office in 2020
Do This:
● Filter the US Women Running for Elected Office in 2020 dataset to answer the following questions. The chart
type is up to you. Paste the chart below each answer.
Question: Which states had at least one woman run for Governor?
Answer:
Chart:
Question: Which states had more than 10 women win their primary race?
Answer:
Chart:
Computer Science Principles 2
Data ('23-'24)
Exploring Two Columns
Resources
Unit 5 Lesson 4
Name(s)_______________________________________________ Period ______ Date ___________________
Activity Guide - Exploring Two Columns
Cross Tab
1. Crosstab - Use the “Words” dataset
Use: Length & Part of Speech
a. Chart - Copy and paste the chart you created in the space below
b. Which part of speech shows up most often in this data set?
c. Which part of speech seems to be longest, on average?
d. Which part of speech seems to be shortest, on average?
e. About how long is the "typical" noun? How can you tell?
2. Crosstab - Use the “Favorite Classes” dataset
Use: Grade & Favorite Class
a. Chart - Copy and paste the chart you created in the space below
b. Which class do Seniors like the most?
c. Which grade likes History the most?
d. What is one other interesting pattern you can see in this diagram?
Computer Science Principles 1
Unit 5 Lesson 4
Scatter
1. Scatter - Use the “US States” dataset
Use: Median Household Income & Percent Adult College Graduates
a. Chart - Copy and paste the chart you created in the space below
e. What is the range of incomes on this chart?
f. What is the range of percentages of adult college graduates?
g. Do you see a relationship between income and percent of adult college graduates? What does the
relationship look like?
3. Scatter - Use the “US States” dataset
Use: You pick the columns to compare
In the States table find a combination of columns in a Scatter Plot that you think show an interesting pattern
a. Chart - Copy and paste the chart you created in the space below
b. What is the pattern that you observe in this data?
c. Describe what this pattern might mean or indicate.
d. Usually we can't be sure whether the pattern we observe in a scatter chart (a correlation) means that
one of the columns is causing the other. Give one way you might do more research to try to investigate
the cause of the pattern you found.
Computer Science Principles 2
Data ('23-'24)
Big, Open, and Crowdsourced Data
Resources
Unit 5 Lesson 5
Name(s)_______________________________________________ Period ______ Date ___________________
Activity Guide - Big, Open, and Crowdsourced Data
Background
Today you will research one of three topics at the intersection of data analysis, computing, and society. These topics all
use the data analysis process in new and interesting ways to address problems.
Topic Links Key Vocabulary to Look
For
Topic 1 ● AI and Parallel Systems Scalability
Big Data https://www.youtube.com/watch?v=1XGo8K1boH4 Parallel systems
● Big Data and Medicine Code.org
https://www.youtube.com/watch?v=bMrDHtGHFR4
Topic 2 ● How Pokemon Inspired A Citizen Science Project... Citizen science
Crowdsourced Data https://www.npr.org/2018/04/20/597972310 Crowdsource
● What is Citizen Science? https://youtu.be/81hhecI0p5k
Topic 3 ● What is Open Data? Open data
Open Data https://www.youtube.com/watch?v=qSD9ob8rGcs Open access
● The Case for Open Data
https://www.youtube.com/watch?v=iOrPK7p2AwI
What topic did you choose? ________________________________________________
As you watch the videos keep notes on examples of how they complete each step of the data analysis process.
Collect or Choose Data Clean and/or Filter Data Visualize and Find New Information
Patterns
What is the core idea of your topic? What is it about?
Give two examples of the problems / questions your topic is being used to solve / answer.
Computer Science Principles 1
Data ('23-'24)
Project - Tell a Data Story
Resources
Unit 5 Lesson 8
Name(s)_______________________________________________ Period ______ Date ___________________
Project Guide - Tell a Data Story
Background
Computing tools allow people from all different fields to learn from data. By transforming and visualizing data we can
find patterns that help us learn about our world.
For this project you will
● Select a dataset from the Data Library
● Use the Data Visualizer to make a visualization that helps us learn something interesting about the data
● Write a response that explains:
○ what your visualizations show
○ new insights or decisions that can be made based on this visualization
○ possible problems with your analysis including potential bias in the data or your interpretation
Tell a Data Story
Data Analysis Written Response
Process
Select a dataset from the Data Library. Read the metadata to understand what information is
available in the table
Dataset Name:
Short Description:
Did you filter or clean the data? Why or why not?
Create a visualization. Copy/paste the chart below.
Computer Science Principles 1
Unit 5 Lesson 8
What information does your visualization show:
What new insights or decisions can be made based on this visualization:
What are the possible problems with your analysis including potential bias in the dataset
or your interpretation:
Computer Science Principles 2
Unit 5 Lesson 8
Rubric
Extensive Convincing Limited Evidence No Evidence
Category
Evidence Evidence
Collect or Choose Data Dataset is correctly Dataset is correctly Dataset is correctly Dataset is not
identified and identified and identified and identified or
description is description is mostly description is description is
complete. complete. somewhat missing.
complete.
Clean/Filter Data Explanation for Explanation for Explanation for Explanation for
cleaning and/or cleaning and/or cleaning and/or cleaning and/or
filtering is complete filtering is complete filtering is somewhat filtering is
and reasonable. and mostly complete or incomplete.
reasonable. somewhat
reasonable.
Visualize and Find Visualization is Visualization is Visualization is Visualization is
Patterns readable and mostly readable and somewhat readable unreadable or
includes a title, and includes a title, and and/or is missing a missing.
citation. citation. title or citation. .
New Information: The visualization is The visualization is The visualization is The visualization is
Interpreting the thoroughly mostly explained. somewhat not explained.
visualization explained. explained.
New Information: Insights or decisions Insights or decisions Insights or decisions Insights or decisions
Insights or Decisions are reasonable and are mostly are mostly are missing.
effectively linked to reasonable and reasonable and
information effectively linked to somewhat
displayed in the information effectively linked to
visualization displayed in the information
visualization displayed in the
visualization
New Information: Possible problems Possible problems Possible problems Possible problems
Bias with analysis or with analysis or with analysis or with analysis or
potential bias are potential bias are potential bias are potential bias are
reasonable and mostly reasonable somewhat not addressed.
thoughtfully and thoughtfully reasonable and
addressed. addressed. addressed.
Computer Science Principles 3