Iqra University, Islamabad
Dept of Computing and Technology
Course Outline
Course Course Title Introduction to Data Science
Information Course ID Course Type Elective
Credit hours 3 Hours per week (C-L) 3-0
Programs BS (CS), BS(SE), Preferred Semester 6
BS(AI)
Date 23-07-2024 Version 1.1
Instructor Qurat Ul Ain Aini
Course Data Science is an emerging field and involves the study of the generalizable extraction of knowledge from data. This
Description is a merger of various disciplines including mathematics, statistics, machine learning, databases and other branches
of computer science along with a good understanding of the craft of problem formulation to engineer effective
solutions. It has overlapping boundaries with already established areas like Data Mining and Machine Learning. The
major concepts that will be covered during course are probability, statistical inference, visualization, exploratory
data analysis (EDA), linear and logistic regression, model evaluation and various machine learning algorithms such as
Decision trees, single and complete linakge, k-means clustering, and Principal Component Analysis. On the
programming aspect, Python will be used as an implementation tool.
Course GA 2 GA 5
Learning Understand basic concepts of data science,
Outcomes CLO 1 statistics and probability and their application C4 ---
(CLO) and in understanding behavior of data.
Mapping
CSC 477
Theory
Apply basic tools for performing exploratory
with CLO 2 C5 ---
Graduate data analysis and visualization.
Attributes
(GAs) CLO 3 Understand basic predictive modeling and C6 ---
data analysis methods
Lecture type Classroom Lectures, Presentations
Textbook Title Edition Authors Publisher Year ISBN
Data Science For Dummies 2nd Lillian For Dummies 2nd 9781119327639
Edition Pierson
(Author),
Jake Porway
(Foreword)
References
Assessment Assessment Weight Used to attain CLO Assessment Weight Used to attain
Criteria CLO
(100%) Assignment 10% CLO1 – CLO3 Quiz 15% CLO1-CLO3
Lab 0% Project / Mega Quiz 10% CLO1-CLO3
Attendance 0% Participation 0%
Mid Term 25% CLO1, CLO2 Final 40% CLO1-CLO3
Methods of Assignments, Quizzes, Midterm Exam and Final Term Exam.
Evaluation
Notes -
Assessment Assessment Goal
Goal Assignment Making personas, scenarios and prototypes
Achievement Quiz Usability evaluation and heuristic evaluation, golden rules of interface design
Project/Presentation Presentation of Innovative projects/ ideas
Exam - Basic knowledge of the core theories, HCI Process models and methodologies used in real
world applications.
- Technical skills for designing, verifying and validating interactive (Web based and
application based) systems for end users.
Page 1 of 4
Week Topic Lecture Lecture Contents Relation with CLO
No. No.
Page 2 of 4
W1. L1. - What is data science, Big data and data science hype, properties CLO1
Overview of the of data, data science functionalities, what is a data scientist in
Data Science L2. academia and industry
W2. Statistical L3. - Statistical distributions, populations and samples, populations and CLO1
Distributions samples of big data
L4.
W3. Exploratory Data L5. Exploratory Data Analysis (EDA) CLO1
Analysis (EDA), Data What is EDA?
Science process Different EDA techniques
L6. Using EDA to understand your data
Data storytelling (new topic)
W4. EAD, Data L7. Developing a visualization aesthetic CLO2
Visualization Data Visualization
What is data visualization?
L8.
Different types of data visualizations
Using data visualization to communicate your findings
Interactive data visualization (new topic)
W5. Data Visualization L9. Chart types, CLO2
Great visualizations,
L10. Increasing visualization
W6. Machine Learning L11. Machine learning and Data Science CLO1
Introduction to ML algorithms
L12.
W7. Machine Learning L13. Three basic algorithms CLO2
Algorithms Decision Trees
L14. K-nearest neighbors (k-NN),
K-means, exercises
Single and Complete Linkage
W8. Mid Term Exam L15. CLO1-CLO2
Mid Term Examination
Week L16.
W9. Machine Learning L17. Linear Regression CLO2
Algorithms Logistic Regression
Support Vector Machines(SVM)
L18.
W10. Classifiers L19. Classifiers CLO3
Interpretability
L20. Scalability,
Better regression models,
Regression as parameter fitting
W11. Classification L21. Classification and logistic regression, CLO3
Issues in regression
L22.
W12. Features L23. Feature Selection CLO2
Feature Modulation
L24. Data Preprocessing
W13. Probability and L25. Population and sample CLO2
Statistics Probability and Non probability
Descriptive Statistics
L26.
Inferential Statistics
Page 3 of 4
W14. Recommendation L27. Recommendation Engines, CLO3
Algorithms Measuring distance,
Graphs,
Networks and distances,
L28. PageRank
W15. PCA & New Topics L29. Dimensionality problem, CLO3
in Data Science Principal component analysis (PCA)
L30. Big data, on being a data scientist,
Societal and ethical implications
W16. Final exam Final exam CLO2-CLO3
Page 4 of 4