Hill Side School
Big Data Technologies
Subject: ICT
Class: 12A
Prepared By: Group - 3
Group members
1. Kaleab Zeru ………..
2. Meklit Kifle ………..
3. Mersen Birhanu …………
4. Mikiyas Aschalew ………
5. Nahom Degefu ..……….
6. Nathan Dawit …………
7. Noub Tefera …………
8. Nathnael Kifle …………
9. Rahel Abebe …………
10. Yonathan Mengistu………
Submitted to: Sir Yohannes Tsegaye
Oct, 2024
Introduction
This sub unit introduces Big – data, a collection of data that are so massive and complex that
they become challenging to process using typical data processing software
or readily available database management tools. This sub unit will emphasize the
characteristics, Benefits, Challenges and Applications of Big-Data .
What is Big-Data?
The amount of data generated is increasing in different dimensions including data
sizes. This vast amount of generated data is leading us to the creation of Big-data.
Big-data starts with the exponential explosion in the amount of data we have
generated since the dawn of the digital age. This is largely due to the rise of
computers, the Internet, and technology capable of capturing information from the
real and physical world we live in,
According to the latest estimates 402.74 million terabytes of data are created each
day. Data generated has grown significantly since 2010, In space of 13 years data
generated has grown by 74x from just 2 million terabytes.
In fact it is estimated that 90% of the world’s data was created in the last 2 years and
It is expected to increase by over 150%
What is Big data science
Big- data science is a field that deals with the collection analysis and
interpretation of large complex data sets
What is Big data technology
Big data technologies refers to the software and hard ware systems that enable
the collection storage , processing
Characteristics of Big-data
Big-data is characterized 5v, this refers to the first letters of big data characteristics ,
namely: Velocity, Veracity, Volume, Value, Veracity.
1. Variety
Refers to the nature of data
The data generated could be structured, unstructured or semi-structured
Structured data types – spread sheet, Relational Data base ,
Unstructured data types- Audio, Image , Text, Video
Semi-structured data types- Programming languages ( Json, XML, HTML)
2. Velocity
Refers to the speed at which data is being created in real time
3. Volume
Refers to the huge amount of data that is being created from various sources
4. Value
Refers to the amount of valuable , reliable and trust worthy data that needs to be stored,
processed, and analyzed to find insights
5. Veracity
Refers to degree of reliability of that data has to offer
Benefits of Big Data
Big data provides numerous benefits to organizations and industries, cure disease and
prevent cancer, maximize crop yields, explore distant planets, predict and respond to
natural and man-made disasters, prevent crimes, and more.
Big data has many advantages
Customer acquisition and retention:
Consumer data can help the marketing efforts of companies, which can act on trends to
increase customer satisfaction.
Targeted advertisements (ads):
personalized ads are made after analyzing past purchases, interaction patterns, and
product page viewing histories
Product development
Allows to update existing products/services while innovating new ones..
Price optimization
This minimizes the manual work and reduces the possibility of any man-made errors.
Risk management:
Helps organizations predict and mitigate potential risks by analyzing patterns.
Improved decision making:
By analyzing large datasets, businesses can make faster and more informed decisions.
APPLICATION OF BIG DATA
The following are sectors in which Big-data can contribute by generating value:
A. Health care: inform patients, prevent diseases, intervene and manage hospital data
using variety of big data technologies. These efforts can improve the patient experience, care
efficiency, and quality, and reduce healthcare costs
B. Education: is used for personal learning based on the students performance and
engagement level , it is also used to evaluate applications of applicants to determine who will
be a good fit for the institutions.
C. Banking : Big-data solutions can detect fraudulent behaviors in real-time, such
as credit/debit card usage, inspection track archiving, and more. It also helps banks in their
compliance verification, auditing, and reporting processes.
This simplifies the processes while lowering overhead costs
D. Agriculture: smart agriculture and precision agriculture practices using big-data will
improve decision making of farmers in crop management . This ultimately increases the
agricultural out puts
E. Manufacturing: In the manufacturing sector, Big-data helps create a transparent
infrastructure, predicting uncertainties and incompetence that can affect the business
adversely
F. Retail: personalized customer experiences agreeing used widely to boost sales, increase
revenue and deliver improved customer service
G. Transportation: transportation corporations employ Big-data technologies to optimize
route planning, control traffic, manage road congestion, and improve services in countries all
over the world.
H. Media and Entertainment: this sector has been revolutionized under Big-data, by
providing valuable insight into consumer behaviour, content preferences and market trends
CHALLENGES OF BIG DATA
- Aside from the benefits, Big-data also has challenges related to data quality, storage, a
shortage of data science experts, validating data, and gathering data from various sources.
o Managing the big data growth: Big data grows quickly over time making management of
them more difficult
o Lack of data professionals: Companies demand skilled data specialists to manage
Big-data solutions. These experts are
Data scientists : Use statistical and machine learning techniques to extract insights
from data
Data analysts: clean prepare and analyze data to answer specific questions
Data engineers: build and maintain the infrastructure for storing, processing and
analyzing data
o Securing data : protecting such a massive and complex data from cyber attacks is
challenging
o Integrating data from variety sources: big data is collected from various sources, but not
all data obtained is necessary, ensuring the reliability and accuracy is difficult
SUMMARY
This sub unit introduces Big – data, a collection of data that are so massive and complex that
they become challenging to process using typical data processing software
Big-data is characterized 5v, this refers to the first letters of big data characteristics ,
namely: Velocity, Veracity, Volume, Value, Veracity.
Benefits of Big Data include Customer acquisition and retention ,Targeted
advertisements (ads),Product development Price optimization, Risk management and
Improved decision making
Big data involves in different sectors like, Health care, Education, Banking Agriculture
, Manufacturing, Retail, Transportation, Media and Entertainment
Managing big data growth, Lack of data professionals ,Securing data , Integrating data
from variety sources are challenges of Big Data .