Data Analytics
Vijay Kumar .A
Assoc. Professor
Dept. of CSE
NMREC
1
Outlines
• Data Analytics
• Analytics Vs Analysis,
• How it works
• Analytics Stages
• Methodology
• Tools
• Applications 2
Data is Powerful and Everywhere
• 2.7 Zetta bytes of electronic data exist in the world today –
2,700,000,000,000,000,000,000 bytes
• This is equal to the storage required for more than 200 billion HD movies
• New data is produced at an exponential rate.
•.
Data Hierarchy
Data Lake cloud/Data
warehouse cloud
Data Lake
Data
warehouse
Database
File
Record
Field
Byte
Bit
What is Analytics?
• The science of using data to build models that lead to better decisions that add value
to individuals, to companies, to institutions.
How analytics works
• Analytics
• It is the science of wisely acquiring meaningful results
from given data using various methods and technologies.
• Aims at discovering pattern of variation from the given
data
• It helps to understand the future from past data and the
uncertainty related to business.
Gather Organize Analyze
Data Data Data
Big Data Analytics: Basic Data Models 12/25/2019 6
Knowledge Requirements for Data Analytics
Business Domain
Data Modeling
• Choosing the right data to include in models is important.
• Important to have some thoughts as to what variables might be related
.
• Domain knowledge is necessary to understand how they can be
• used. Role of Business Analyst is crucial
MATCH THE FOLLOWING
1. DATA ANALYTICS a) SCIENCE
2. DATA ANALYSIS b) FIELD
3. DATA SCIENCE c) PROCESS
DATA ANALYTICS - SCIENCE DATA ANALYSIS - PROCESS
Analytics vs Analysis
Analytics: Analysis:
Analytics is the science of analysis Analysis is the process of breaking
whereby statistics, data mining, down a complex object into its
computer technology, etc. is used simpler forms.
in doing analysis.
12/25/2019 10
Data Science: A Multidisciplinary Field
Why Data Analytics
• Gather hidden insights
• Generate reports
• Perform market analysis
• Improve Business Requirement
Who is Data Analyst
• Who collects data
• Analyse data
• Create reports
Data Analyst Skills
• Statistics
• Data cleaning and Data Manipulation(EDA)
• Data Visualization
• Data Mining is a popular type of data analysis technique
Types of Data Analysis
Analytics types
Why has the Which
How many Which
drop-out rate students
students students are
increased in should I target
dropped out most likely to
the last one to keep from
last year? drop out?
year? dropping out?
Descriptive Diagnostic Predictive Prescriptive
Information Insights Decision
12/25/2019 18
Data Analytics Life cycle / Methodology
DISCOVERY
DATA
PUT INTO USE PREPARING
DELIVER MODEL
RESULTS PLANNING
MODEL
BUILDING
12/25/2019 20
Data Analytics Life cycle / Methodology
Data Analytics tools
Most popular Data Analytics tools are
Microsoft Excel
R
Python
IBM Watson 22
Tools
Data Analytics applications
24
Application of Data Analytics:
25
Financial services: Credit Retail: Promotions, demand
scoring, fraud detection, forecasting, merchandizing
pricing, claims analysis optimization
Manufacturing: Inventory Health care: Drug interaction,
replenishment, product preliminary diagnosis, disease
customization, supply chain
Applications management
optimization
Energy: Trading, supply, Communications: Customer
demand forecasting, retention, capacity planning,
compliance network optimization
12/25/2019 26
• Sentiment Analysis – Unstructured data
• Recommendation systems – Netflix, Amazon
• Advertising on the Web
• Social-Network Graphs
27
Summarizing Data
Summarizing is the process of Summaries differ based on the
converting huge amounts of type of data; and can be
raw data into a format that descriptive or graphical.
can be easily analyzed.
Marital Status Frequency
Population by Marital Status
3000 Single 203
2,580
2500 Married 2,580
2000
Widowed 334
1500
1000 Divorced 367
500 334 367
203 Separated 46
0
Single Married Widowed Divorced Total 3,530
29
30