0 ratings0% found this document useful (0 votes) 22 views2 pagesDev 22
It contains dev important questions
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
Reg. No.
L I CI
Question Paper Code : 70004
B.E/[Link]. DEGREE EXAMINATIONS, NOVEMBER/DECEMBER 2022.
‘Third Semester
Artificial Intelligence and Data Science
AD 3301 - DATA EXPLORATION AND VISUALIZATION
(Regulations 2021)
‘Time : Three hours Maximum : 100 marks
10.
LL.
Answer ALL questions.
PART A— (10 x 2= 20 marks)
What is meant by EDA?
How do you get cross tabulation?
What is the difference between MATLAB and matplotlib?
Isa histogram always a bar chart? Justify with your answer.
What is the main purpose of univariate analysis?
What is the mathematical mean of the following numbers? 10, 6,4, 4, 6, 4.
What are the three common methods for performing bivariate analysis?
Outline the difference between univariate and bivariate data.
Show the characteristics of multivariate analysis.
What is TSA in Statsmodel?
PART B— (5 x 13 = 65 marks)
(@) What is the primary purpose of EDA? What are the differences between
EDA with classical and Bayesian analysis? Discuss it in detail.
Or
(b) Explain various transformation techniques in EDA.12,
13,
uM.
16.
16.
fa)
)
@
)
@
)
(a)
©)
(a)
)
‘How to over plot a line on a scatter plot in Python? Ilustrate with code.
Or
Discuss with how Seaborn helps to visualize the statistical relationships,
Ilustrate with code and example,
Explain the 10 Essential Numerical Summaries in Statistics with
example
Or
How, When, and Why Should You Normalize / Standardize / Rescale
Your Data?
What is a table of frequency values for a bivariate distribution? Explain
What graph is used in the analysis of bivariate data?
Or
How do you analyze a contingency table? Discuss,
What is meant by time series data? Describe its four components.
Or
What is the beat way to visualize time series data? What patterns might
appear when you plot the time series data?
PART C—(1 x 15 = 15 marks)
What are the tools used for EDA? Give a case study on applying EDA in a
real business scenario.
Or
Discuss in detail about Data Cleaning (missing data, outliers detection
and treatment),
2 70004