0% found this document useful (0 votes)
7 views

Data Science Minor Syllabus-Sem-04

The document outlines two courses: INT217 Introduction to Data Management and INT232 Data Science Toolbox: R Programming, detailing their course outcomes and unit structures. INT217 focuses on spreadsheet techniques, data representation, and automation using Excel, while INT232 covers R programming fundamentals, data manipulation, and visualization. Each course includes practical experiments and references for further study.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Data Science Minor Syllabus-Sem-04

The document outlines two courses: INT217 Introduction to Data Management and INT232 Data Science Toolbox: R Programming, detailing their course outcomes and unit structures. INT217 focuses on spreadsheet techniques, data representation, and automation using Excel, while INT232 covers R programming fundamentals, data manipulation, and visualization. Each course includes practical experiments and references for further study.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

lOMoAR cPSD| 45718333

INT217:INTRODUCTION TO DATA MANAGEMENT


L:2 T:0 P:2 Credits:3

Course Outcomes: Through this course students should be able to

CO1 :: apply the various techniques and functions over spreadsheet for getting various insights of
data
CO2 :: practice the data representation methods like pivot table and power pivoting

CO3 :: determining the need of the graphical representation in the spreadsheet by using various
graphs and charts
CO4 :: apply various techniques to validate data in the spreadsheet and perform what-if analysis

CO5 :: employ the power of macros to automate and optimize spreadsheets

CO6 :: extend the concepts of creating interactive dashboards using slicers, macros and
advanced charts

Unit I
Spreadsheet functions to organize data : cell reference styles, creating and working with
formulas, text functions, date and time functions, lookup and reference functions, mathematical and
statistical functions, information and volatile functions, logical and financial functions, formula
auditing, error handling, string functions
Introduction to Microsoft Excel : UI Basics : introduction to UI basics, about excel, workbooks
and worksheets, customizing excel, reference styles, number formatting, custom number formatting,
conditional formatting, format as table
Unit II
Data representation and manipulation : filter, advanced filter for complex criterion, sorting and
custom sorting, pivot table and pivot chart, power pivot, import data from different sources into
power pivot, reducing file size in power pivoting, connect to multiple different external datasets, DAX
functions
Unit III
Advanced graphing and charting : charts, combo charts, working with objects charts, dynamic
charts and dynamic data source for charts print areas, views for a worksheet, various printing
techniques
Data protection techniques : worksheet protection, protect specific range, workbook protection
and encryption
Unit IV
What-if analysis : Goal Seeker, Scenario Manager, Data Table

Data Validation : Understanding the need for Data Validation, creating a Validation List, adding
Custom Validation Error, Dynamic Formulas by using Validation Techniques
Unit V
Macros : Understanding Excel Macros, Activating the Developer Tab in Excel, creating a Macro with
the Macro Recorder, editing a Macro with VBA, creating Buttons to Run Macros
Unit VI
Creating an interactive dashboard : Principles of Dashboarding, mastering charting techniques,
Macros for interactive dashboard, Visualizations with Sparklines and Shapes, specialized charts
Waterfall chart, funnel chart, adding maps on dashboard, adding slicers and timelines, connecting
slicers with multiple pivot tables, adding hyperlinks to navigate between different sheets

List of Practicals / Experiments:

Introduction to spreadsheets
• basic terminology of excel
• spreadsheet environment
• object model of excel
• customizing excel
• reference styles

Rkr Kr ([email protected])
lOMoAR cPSD| 45718333

Session 2023-24 Page:1/2

• number formatting
• custom number formatting
• conditional formatting
• format as table representation and

manipulation of data

• filter
• sorting techniques

• practical on pivot table and pivot chart charts and graphs

• advance graph and charts

Data Cleaning
• Perform data cleaning using String functions, date and Time Functions
• Use Text to Columns
• Removing unwanted values and handling NULL values in Excel
• Restructuring the dataset
• Removing Extra spaces or white spaces

References:
1. FUNDAMENTALS OF BUSINESS ANALYTICS by R.N. PRASAD, SEEMA ACHARYA, WILEY

2. EXCEL HACKS,2/ED TIPS & TOOLS FOR STREAMLINING YOUR SPREADSHEETS by DAVID,
SHROFF/O’REILLY

Rkr Kr ([email protected])
lOMoAR cPSD| 45718333

INT232:DATA SCIENCE TOOLBOX : R PROGRAMMING


L:2 T:0 P:2 Credits:3

Course Outcomes: Through this course students should be able to

CO1 :: Analyze and configure R software for statistical programming environment and describe
generic programming language concepts implemented in a high-level statistical language
CO2 :: Demonstrate the programs in the R environment to create custom analytical models to meet
the dynamic business needs
CO3 :: Evaluate and verify the analysis findings by using various packages in R programming
CO4 :: Visualize and customize the various graphical packages for creating various types of graphs,
plots and charts.
CO5 :: Review advanced data science concepts using predictive analytics fundamentals

CO6 :: Appraise and verify the analysis findings by conducting various statistical tests

Unit I
Installation and development environment overview : downloading and installing R from CRAN,
installing R on your windows computer, installation Rstudio, libraries in R and R studio, installing
packages,, using R reference card
Introduction to basics : discover the basic data types and operators in R
Unit II
Vectors and matrices : learn how to work with vectors and matrices in R

Factors : R stores categorical data in factors, learn how to create subset and compare categorical
data
Data frames : creating, merging, naming, filtering, indexing and selection in data frames

Lists : naming, extracting, adding, deleting components from lists, subsetting a list
Unit III
R syntax : conditional statements, loops, functions and packages in R

Data input and output in R : CSV files, excel files and SQL with R
Unit IV
Advanced R programming : mathematical functions, apply family of functions, regular expressions,
dates and timestamps
Data manipulation with R using : data filters, handling missing data, dplyr, tidyr, pipe
Unit V
Text mining in R : Text mining functions, string functions used in R,, analyzing text data for mining

Social media data mining : Facebook data analysis, twitter data analysis
Unit VI
DATA VISUALIZATION WITH R : Explanation and Implementation of Basic types of graphs
(SCATTER PLOT, LINE CHART, BAR CHART, PIE CHART), Explanation and Implementation of Advanced
types of graphs (Word Cloud, Heat Map, Bollinger Band, Donot Chart etc.), Dynamic Visualization
using GGPLOTS, Advanced Visualization using PLOTLY, Implementation of DASHBOARDS using
RMARKDOWN

List of Practicals / Experiments:

List of Practicals
• Programs to define basic data types.
• Program to demonstrate different operators.
• Program to implement vector and matrices.
• Program to implement factors, data frame and list

Session 2023-24 Page:1/2

Rkr Kr ([email protected])
lOMoAR cPSD| 45718333

• Program to demonstrate the type of loops.


• Program to implement the different types of functions.
• Program to perform different SQL queries.
• Program to demonstrate the different built in statistical, date and timestamp functions.
• Program to demonstrate the concept of data wrangling.
• Program to perform data manipulation using built in packages.
• Program to demonstrate the concept of text mining.
• Program to extract and analyze social media data.
• Program to demonstrate basic visualization methods.
• Program to implement advanced visualization methods.
• Program to demonstrate dashboard in R.

Text Books:
1. DATA ANALYTICS USING R by SEEMA ACHARYA, Tata McGraw Hill, India

References:
1. DATA ANALYSIS : USING STATISTICS AND PROBABILITY WITH R LANGUAGE by BISHNU
PARTHA SARATHI, BHATTACHERJEE VANDANA, PHI Learning
2. DATA SCIENCE AND MACHINE LEARNING IN R by REEMA THAREJA, Tata McGraw Hill, India

3. DATA ANALYTICS by ANIL MAHESHWARI, Tata McGraw Hill, India

Rkr Kr ([email protected])

You might also like