BIG DATA
ANDHRA LOYOLA
COLLEGE
BIG DATA
› Big Data is a field dedicated to the analysis,
processing, and storage of large collections of
data sets that frequently originate from disparate
sources.
BIG DATA
› It is required when traditional technologies and
techniques are insufficient.
› DataficationCapturing/Collecting Big Data
BIG DATA - Sources
› Social Data FB, Twitter, Instagram, LinkedIn
› Machine Data RFID, Sensors, GPS
› Transactional Data Amazon, Flipkart, e-Bay
Structuring Big Data
› Arrangement of the available data in a manner that is easy to study,
analyze and derive conclusions from it.
› Todays Information Processing Systems can analyze and structure a large
amount of data specially for you on the basis of your interests and
search criteria.
› It helps in understanding user behaviours, requirements and preferences
to make personalized recommendations for every individual.
Structuring Big Data
Ex: Recommended list of products based on earlier
purchases
Big Data can be useful for structuring the data and
presenting a specially customized recommendation set for
every user.
Types of Big Data
› 1. Structured Data
› 2. UnStructured Data
› 3. Semi-Structured Data
› *Meta Data
Types of Big Data
› 1. Structured Data:
› Conforms to a data model or schema.
› It is often stored in a tabular form.
› Makes it easier for any program to sort, read and process the data.
› It is most often stored in a relational database
› ERP and CRM systems
Types of Big Data
› 1. Structured Data:
› Represented using the following figure:
›
› Ex; Banking transactions, invoices, and customer records.
Types of Big Data
› 2. UnStructured Data:
› Does NOT Conform to any data model
› It has a faster growth rate
› Ex: Some common types of Un-Structured data:
Types of Big Data
› 2. UnStructured Data:
› This form of data is either textual or binary
› Texual may contain the contents of various tweets or
blog postings.
› Binary may be the media files that contain image,
audio or video data
Types of Big Data
› 2. UnStructured Data:
› This form of data is either textual or binary
› Stored in BLOB
› NoSQL
Types of Big Data
› 3. Semi-Structured Data:
› It has a defined level of structure and consistency
› A semi-structured data is hierarchical or graph-based.
› For example stored in XML and JSON files
› EDI Files, Spreadsheets
Types of Big Data
› 4. Meta Data
› Provides information about a dataset’s characteristics and
structure
› Important for Semi, Unstructured data processing.
Elements of Big Data
› The Five Vs of Big Data
For a dataset to be considered Big Data:
1. Volume
2. Velocity
3. Variety
4. Veracity
5. Value
Elements of Big Data
› The Five Vs of Big Data
1. Volume :
Volume refers to the scale (amount) of data
generated each second
from social media, smart phones, cars, credit cards,
M2M sensors, photographs, video, etc.
Elements of Big Data
› The Five Vs of Big Data
2. Velocity :
In Big Data environments, Velocity refers to the
speed at which vast amounts of data are being
generated, collected and analyzed.
Elements of Big Data
› The Five Vs of Big Data
2. Velocity :
In Big Data environments, Velocity refers to the
speed at which vast amounts of data are being
generated, collected and analyzed.
Elements of Big Data
› The Five Vs of Big Data
2. Velocity :
Figure: Examples of High Velocity Big data sets
THANK YOU