DataScience Beginner Course
DataScience Beginner Course
• Prerequisites: None.
2 Course Overview
3 Learning Goals
There are several goals for this course. By the end of the quarter, you
should be able to:
4 Assignments
– X : complete
– X− : submitted, but incomplete
– 0 : not submitted or less than half complete
5 Course Policies
• Final Project: You will work with a partner on your final project.
You are responsible for ensuring that no written work is copied or
summarized from other sources without appropriate attribution.
I N T R O D U C T I O N TO DATA S C I E N C E U S I N G R 4
6 Resources
6.2 Immigration
The Bruin Resource Center’s (BRC) Undocumented Student Program
offers caring and personalized support to undergraduate and gradu-
ate undocumented students.
Even if you are not undocumented, you may be able to get legal
help for a family member. The USP office provides immigration legal
services to students and their family members through a partnership
with the UC Undocumented Legal Services Center.
6.3 Software
You will need to acquire some software before the first day of class:
• You will need to download and install R from this website: https:
//cran.r-project.org/.
6.4 Using R
The resource we will use most is R for Data Science by Wickham
and Grolemund. An online version of the text is available here
(http://r4ds.had.co.nz/) or you can purchase the book once it’s
published (http://goo.gl/EUUNeh).
I N T R O D U C T I O N TO DATA S C I E N C E U S I N G R 5
There are many online and print resources for learning R. I’ve
included a few below, but a Google search will turn up many others.
Online:
Print:
• R Cookbook (http://goo.gl/Hh1REr)
We are going to rely heavily on two R packages: dplyr for data ma-
nipulation (https://goo.gl/Nc5gIV), and ggplot2 for visualization
(http://docs.ggplot2.org/current/).
7 Course Schedule
1 Introductions (4/3)
? Chapters 1 and 2: Introduction. Garrett Grolemund and Hadley Wickham. R for Data Science. O’Reilly
• Mike Loukides. What is data science?, June 2010. URL https://www.oreilly.com/ideas/what-is-data-science
• Sharp Sight Labs. Why you should learn R first for data science, January 2015. URL http://www.
r-bloggers.com/why-you-should-learn-r-first-for-data-science/
• Paul Curzon. A recipe for programming, 2014. URL https://teachinglondoncomputing.files.
wordpress.com/2014/07/cs4fnissue16_pr_6_1.pdf
2 Description
3 Wrangling
4 Coding
7 Final Presentations