ST.
XAVIER’S COLLEGE (AUTONOMOUS), KOLKATA
Department of Information Technology
B. Com Semester III
IT Skills for Business Practical 2024 under NEP 2020
Unit 4: Advanced Data Analysis Techniques
Topics Covered: Statistical Analysis - Descriptive Statistics , Correlation, Regression
Assignment 9
The aim of this practical assignment is to equip students with the skills and knowledge necessary to
perform comprehensive statistical analysis using Spreadsheet Application (Microsoft Excel).
• An in-depth exploration of Descriptive Statistics enables students to summarize and describe the
essential features of their data effectively.
• Students can gain a comprehensive understanding of key statistical functions in MS Excel.
• From Correlation Analysis, students can understand how to quantify the degree of association
between two variables.
• From Regression Analysis, students can learn to model the relationship between a dependent
variable and one or more independent variables to make any predictions.
Open a blank workbook and save it as Spreadsheet9_rollno.xlsx in your folder.
1. Add the following data in Descriptive_Summary_rollno sheet to analyse monthly
units of coffee sales.
• Insert the Descriptive Statistics Summary Table and interpret the output.
2. Add the following data in Stat1_rollno sheet.
Page 1 of 3
ST. XAVIER’S COLLEGE (AUTONOMOUS), KOLKATA
Department of Information Technology
B. Com Semester III
IT Skills for Business Practical 2024 under NEP 2020
a. Use imputation technique mean and mode to find out the discount(%) in whole
number for English and Hindi Textbooks respectively.
b. Find out the following: median quantity sold; most frequent/repetitive cost;
standard deviation of revenue; variation of revenue; first quartile, second
quartile, third quartile of revenue; maximum and minimum quantity textbooks
sold; costs of second most expensive and least expensive book
3. a. Add the following data in Correlation1_rollno sheet:
Month Temp. (C°) Geysers Sold
Jan -5 98
Feb -7 100
Mar 5 75
Apr 10 67
May 18 24
Jun 22 26
Jul 28 25
Aug 25 27
Sep 16 40
Oct 10 55
Nov 2 88
Dec -3 95
i. Find out correlation coefficient between temperature and geysers sold using the
following functions: correl() and pearson()
ii. Display correlation matrix and interpret the correlation coefficient to determine
the strength and direction of the relationship between temperature and geysers
sold.
iii. Create a Scatter Plot to visualize the relationship. Add a trendline to the scatter
plot to observe the pattern between the two variables.
b. In Correlation2_rollno sheet, copy the same data from Correlation1_rollno sheet.
Insert a column Adver. Cost between Temp and Geysers Sold and add the following values:
$100, $100, $95,$90, $50, $45, $45, $50, $75, $80, $95, $100.
i. Find out correlation between more than two variables, temperature, advertising
cost and geysers sold. Display correlation matrix and interpret the output.
ii. Plot the following two correlation graphs: Temperature vs. Geysers Sold and Adver.
Cost vs. Geysers Sold.
4. Add the following data in Regression1_rollno sheet. Here a list of average monthly
rainfall for the last 24 months and the number of raincoats sold are given.
Month Rainfall (mm) Raincoats sold
Jan 82 15
Feb 92.5 25
Page 2 of 3
ST. XAVIER’S COLLEGE (AUTONOMOUS), KOLKATA
Department of Information Technology
B. Com Semester III
IT Skills for Business Practical 2024 under NEP 2020
Mar 83.2 17
Apr 97.7 28
May 131.9 41
Jun 141.3 47
Jul 165.4 50
Aug 140 46
Sep 126.7 37
Oct 97.8 22
Nov 86.2 20
Dec 99.6 30
Jan 87 14
Feb 97.5 27
Mar 88.2 14
Apr 102.7 30
May 123 43
Jun 146.3 49
Jul 160 49
Aug 145 44
Sep 131.7 39
Oct 118 36
Nov 91.2 20
Dec 104.6 32
a. Apply simple linear regression to model the relationship between Rainfall
(independent variable) and Raincoats Sold (dependent variable).
b. Find out the linear regression equation and interpret regression analysis
output.
i. Find out regression equation.
ii. Find out how much raincoats sold changes with each unit change in
rainfall from the value of slope b.
iii. Determine predicted Raincoats Sold when the Rainfall is zero from the
value of intercept a.
iv. Determine from the R-Squared Value how well the regression model fits
the data.
c. Analyse linear regression using the following functions: linest, slope, intercept,
correl.
d. Plot linear regression graph to visualize the relationship between average
monthly rainfall and raincoats sold.
e. Predict how many raincoats would be sold for the following average monthly
rainfall : 90 mm in March, 80 mm in April, 150 mm in June, 120 mm in October
and 70 mm in November (using regression equation).
Reference Links:
https://exceltable.com/en/analyses-reports/regression-correlation-analysis
****************************************************************************************
Page 3 of 3