Data Science Minor Syllabus-Sem-04
Data Science Minor Syllabus-Sem-04
CO1 :: apply the various techniques and functions over spreadsheet for getting various insights of
data
CO2 :: practice the data representation methods like pivot table and power pivoting
CO3 :: determining the need of the graphical representation in the spreadsheet by using various
graphs and charts
CO4 :: apply various techniques to validate data in the spreadsheet and perform what-if analysis
CO6 :: extend the concepts of creating interactive dashboards using slicers, macros and
advanced charts
Unit I
Spreadsheet functions to organize data : cell reference styles, creating and working with
formulas, text functions, date and time functions, lookup and reference functions, mathematical and
statistical functions, information and volatile functions, logical and financial functions, formula
auditing, error handling, string functions
Introduction to Microsoft Excel : UI Basics : introduction to UI basics, about excel, workbooks
and worksheets, customizing excel, reference styles, number formatting, custom number formatting,
conditional formatting, format as table
Unit II
Data representation and manipulation : filter, advanced filter for complex criterion, sorting and
custom sorting, pivot table and pivot chart, power pivot, import data from different sources into
power pivot, reducing file size in power pivoting, connect to multiple different external datasets, DAX
functions
Unit III
Advanced graphing and charting : charts, combo charts, working with objects charts, dynamic
charts and dynamic data source for charts print areas, views for a worksheet, various printing
techniques
Data protection techniques : worksheet protection, protect specific range, workbook protection
and encryption
Unit IV
What-if analysis : Goal Seeker, Scenario Manager, Data Table
Data Validation : Understanding the need for Data Validation, creating a Validation List, adding
Custom Validation Error, Dynamic Formulas by using Validation Techniques
Unit V
Macros : Understanding Excel Macros, Activating the Developer Tab in Excel, creating a Macro with
the Macro Recorder, editing a Macro with VBA, creating Buttons to Run Macros
Unit VI
Creating an interactive dashboard : Principles of Dashboarding, mastering charting techniques,
Macros for interactive dashboard, Visualizations with Sparklines and Shapes, specialized charts
Waterfall chart, funnel chart, adding maps on dashboard, adding slicers and timelines, connecting
slicers with multiple pivot tables, adding hyperlinks to navigate between different sheets
Introduction to spreadsheets
• basic terminology of excel
• spreadsheet environment
• object model of excel
• customizing excel
• reference styles
Rkr Kr ([email protected])
lOMoAR cPSD| 45718333
• number formatting
• custom number formatting
• conditional formatting
• format as table representation and
manipulation of data
• filter
• sorting techniques
Data Cleaning
• Perform data cleaning using String functions, date and Time Functions
• Use Text to Columns
• Removing unwanted values and handling NULL values in Excel
• Restructuring the dataset
• Removing Extra spaces or white spaces
References:
1. FUNDAMENTALS OF BUSINESS ANALYTICS by R.N. PRASAD, SEEMA ACHARYA, WILEY
2. EXCEL HACKS,2/ED TIPS & TOOLS FOR STREAMLINING YOUR SPREADSHEETS by DAVID,
SHROFF/O’REILLY
Rkr Kr ([email protected])
lOMoAR cPSD| 45718333
CO1 :: Analyze and configure R software for statistical programming environment and describe
generic programming language concepts implemented in a high-level statistical language
CO2 :: Demonstrate the programs in the R environment to create custom analytical models to meet
the dynamic business needs
CO3 :: Evaluate and verify the analysis findings by using various packages in R programming
CO4 :: Visualize and customize the various graphical packages for creating various types of graphs,
plots and charts.
CO5 :: Review advanced data science concepts using predictive analytics fundamentals
CO6 :: Appraise and verify the analysis findings by conducting various statistical tests
Unit I
Installation and development environment overview : downloading and installing R from CRAN,
installing R on your windows computer, installation Rstudio, libraries in R and R studio, installing
packages,, using R reference card
Introduction to basics : discover the basic data types and operators in R
Unit II
Vectors and matrices : learn how to work with vectors and matrices in R
Factors : R stores categorical data in factors, learn how to create subset and compare categorical
data
Data frames : creating, merging, naming, filtering, indexing and selection in data frames
Lists : naming, extracting, adding, deleting components from lists, subsetting a list
Unit III
R syntax : conditional statements, loops, functions and packages in R
Data input and output in R : CSV files, excel files and SQL with R
Unit IV
Advanced R programming : mathematical functions, apply family of functions, regular expressions,
dates and timestamps
Data manipulation with R using : data filters, handling missing data, dplyr, tidyr, pipe
Unit V
Text mining in R : Text mining functions, string functions used in R,, analyzing text data for mining
Social media data mining : Facebook data analysis, twitter data analysis
Unit VI
DATA VISUALIZATION WITH R : Explanation and Implementation of Basic types of graphs
(SCATTER PLOT, LINE CHART, BAR CHART, PIE CHART), Explanation and Implementation of Advanced
types of graphs (Word Cloud, Heat Map, Bollinger Band, Donot Chart etc.), Dynamic Visualization
using GGPLOTS, Advanced Visualization using PLOTLY, Implementation of DASHBOARDS using
RMARKDOWN
List of Practicals
• Programs to define basic data types.
• Program to demonstrate different operators.
• Program to implement vector and matrices.
• Program to implement factors, data frame and list
Rkr Kr ([email protected])
lOMoAR cPSD| 45718333
Text Books:
1. DATA ANALYTICS USING R by SEEMA ACHARYA, Tata McGraw Hill, India
References:
1. DATA ANALYSIS : USING STATISTICS AND PROBABILITY WITH R LANGUAGE by BISHNU
PARTHA SARATHI, BHATTACHERJEE VANDANA, PHI Learning
2. DATA SCIENCE AND MACHINE LEARNING IN R by REEMA THAREJA, Tata McGraw Hill, India
Rkr Kr ([email protected])