0% found this document useful (0 votes)
53 views2 pages

FDS Model Exam for Data Science CS3352

This document is a model examination paper for the course CS3352 - Foundations of Data Science, intended for the second year, third semester students. It includes various questions divided into three parts, covering topics such as data science definitions, data cleansing, regression analysis, and data visualization using Python. The exam is structured to assess students' understanding and application of data science concepts and techniques.

Uploaded by

JO
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views2 pages

FDS Model Exam for Data Science CS3352

This document is a model examination paper for the course CS3352 - Foundations of Data Science, intended for the second year, third semester students. It includes various questions divided into three parts, covering topics such as data science definitions, data cleansing, regression analysis, and data visualization using Python. The exam is structured to assess students' understanding and application of data science concepts and techniques.

Uploaded by

JO
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Reg. No.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING


R21 CS3352-FOUDATIONS OF DATA SCIENCE
MODEL EXAMINATION
Time: 3 Hrs. QUESTION CODE : CS32301 Maximum Marks : 100
Session : FN Year / Sem : II/3 Date : 05-12-2024
1. K1-Remembering K2-Understanding K3- Applying
K4-Analyzing K5-Evaluating 2. K6- Creating
Answer ALL questions
PART – A (10X2=20 Marks)
BT
Q.No Question CO’s
Level
1. Define Data Science and Big data CO1 K2
2. List an overview of common errors in retrieving data and which cleansing solutions to be CO1 K2
3. employed.
Classify the types of data. CO2 K3
4. Compare and contrast qualitative and quantitative data with an example. CO2 K2
5. What do you mean by least square method? CO3 K2
6. Define multiple regressions. CO3 K2
7. Outline the two types of numpy UFuncs. CO4 K2
8. Create a data frame with key and data pairs as key-data pair as A-10, B-20, A-40, C-5, CO4 K4
B-10, C-10. Find the sum of each key and display the result as each key group.
9. Showcase 3D drawing in matplotlib with corresponding python code. CO5 K2
10. Write a python code snippet that generates a time series graph representing COVID-19 CO5 K4
incidence cases for a particular week.
Day 1 Day 2 Day 3 Day 4 Day 5 Day 6 Day 7
7 18 9 44 2 5 89

PART – B(5X13=65 Marks)


11 a Explain the different facets of data with example. CO1 K2
(OR)
b. Explain in detail about the cleansing, integrating, transforming data and build a CO1 K3
model.

12 a The number of friends reported by Facebook users is summarized in following CO2 K3


frequency distribution.
FRIENDS f
400-Above 2
350-399 5
300-349 12
250-299 17
200-249 23
150-199 49
100-149 27
50-99 29
0-49 36
Total 200
(i) What is the shape of this distribution?
(ii) Find the relative frequencies
(iii) Find the approximate percentile rank of the interval 300-349
(iv) Convert to a histogram
Why would it not be possible to convert to a stem and leaf display?
(OR)
b. Demonstrate the different types of variables used in data analytics with an
CO2 K2
example.

13 a. The values of x and their corresponding values of y are presented below. CO3 K2
x 0.5 1.5 2.5 3.5 4.5 5.5 6.5
y 2.5 3.5 5.5 4.5 6.5 8.5 10.5
(i) Find the least square regression line y= ax+b. (9)
(ii) Estimate the value of y when x=10. (4)
(OR)
b. Calculate the correlation coefficient for the heights ‘in inches’ of fathers (x) and
their son’s (y) with the data presented below.
x 66 68 68 70 71 72 72 CO3 K3
y 68 70 69 72 72 72 74

14 a. Imagine you have a series of data that represents the amount of precipitation CO4 K4
each day for a year in a given city. Load the daily rainfall statistics for the city of
Chennai in 2021 which is given a csv file chennairainfall2021.csv using pandas
generate a histogram for rainy days, and find out the days that have high rainfall.
(OR)
b. i. How to create hierarchical data from the existing data frame? (6) CO4 K4

ii. How to use group by with 2 columns in data set? Give a python code snippet.(7) CO4 K4

15 a. Outline any two three- dimensional plotting in matplotlib with an example. CO5 K2
(OR)
b. How text and image annotations are done using python? Give an example of CO5 K3
your own with appropriate python code.
PART – C (1X15=15 Marks)
16 a. Breifly explain about the data manipulation with pandas in python with suitable CO4 K3
examples.
(OR)
b. Describe in detail about the geographic data with Basemap. CO5 K2

Prepared by Checked by Approved by


Mrs.E.Jones Merlin-AP/CSE R.Dinesh Raj-HoD/CSE Principal

You might also like