0% found this document useful (0 votes)
79 views

GPECOM2023 BigData

The document provides an overview of big data concepts, applications, challenges and related technologies. It discusses the rapid growth of data generation and defines key aspects of big data. Examples of big data applications across different fields are examined. Technologies for big data analysis, storage and visualization are also explored.

Uploaded by

Ali Mohammad
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views

GPECOM2023 BigData

The document provides an overview of big data concepts, applications, challenges and related technologies. It discusses the rapid growth of data generation and defines key aspects of big data. Examples of big data applications across different fields are examined. Technologies for big data analysis, storage and visualization are also explored.

Uploaded by

Ali Mohammad
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/372368521

An Overview of Big Data Concepts, Methods, and Analytics: Challenges,


Issues, and Opportunities

Conference Paper · June 2023


DOI: 10.1109/GPECOM58364.2023.10175760

CITATIONS READS

3 1,360

6 authors, including:

s. Mohammadali Zanjani Hossein Shahinzadeh


Islamic Azad University, Najafabad Branch Amirkabir University of Technology
44 PUBLICATIONS 154 CITATIONS 120 PUBLICATIONS 1,813 CITATIONS

SEE PROFILE SEE PROFILE

Yasin Kabalci Ersan Kabalcı


Nigde Ömer Halisdemir University Nevşehir Hacı Bektaş Veli University
99 PUBLICATIONS 1,728 CITATIONS 179 PUBLICATIONS 3,643 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Hossein Shahinzadeh on 15 July 2023.

The user has requested enhancement of the downloaded file.


2023 5th Global Power, Energy and Communication Conference (IEEE GPECOM2023), June 14-16, 2023, Cappadocia, Turkey

An Overview of Big Data Concepts, Methods, and


Analytics: Challenges, Issues, and Opportunities
Mahshad Mahmoudian S. Mohammadali Zanjani* Hossein Shahinzadeh
Department of Computer Engineering, Smart Microgrid Research Center, Department of Electrical Engineering
Najafabad Branch, Islamic Azad University, Najafabad Branch, Islamic Azad University, Amirkabir University of Technology
Najafabad, Iran. Najafabad, Iran. Tehran, Iran
[email protected] [email protected] [email protected]

Yasin Kabalci Ersan Kabalci Farshad Ebrahimi


Department of Electrical Engineering Department of Electrical Engineering Department of Electrical and Computer Engineering
Nigde Ömer Halisdemir University Nevsehir Haci Bektas Veli University University of Houston
Nigde, Turkey Nevsehir, Turkey Houston, TX 77004, USA
[email protected] [email protected] [email protected]

Abstract— In recent years, data generation is increasing on Corporation (IDC) in 2011, the total volume of data that was
a large scale and fast pace, and the development of Internet produced and copied in the world was 1.8 zettabytes
applications, mobile applications, and network-connected (1.8*1024 exabytes) [5]. This figure has since increased to 40
sensors has also increased widely. These applications and zettabytes and is projected to reach 175 zettabytes by the year
extensive internet connections continuously produce a large 2025. The progression of big data's expansion from 2010 to its
volume of data, with a wide diversity and different structures, anticipated level of development in 2025 is seen in Figure 1.
which is called big data. At the same time, technologies related 180
175 ZB
to big data are also developing. The rapid growth of cloud
160
computing and the Internet of Things (IoT) is accelerating the
140
dramatic growth of data generation. Sensors around the world
Zetabytes

120
are collecting and transmitting data that will be stored and
100
processed in the cloud, and the era of big data is coming. In this
80
article, first, an overview of big data and the definitions of its
features are explained, and then the applications of big data in 60

different fields are examined and the challenges facing it are 40

discussed. Finally, technologies related to big data in the field of 20

big data analysis, data storage technologies, and visualization 0


2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 2025
tools are proposed and cloud computing, IoT, and data center Time (Year)
are examined as new technologies that are closely related to big Fig. 1. Annual Growth Rate of Global Data [6]
data. The main goal of this article is to provide a comprehensive
overview of big data and examine and explain various aspects of According to this report, the volume of data has grown
its applications and implementation. more than 20 times in less than a decade, and this figure will
double every two years in the near future. With the growth of
Keywords— Big Data, Data mining, Social networking, Big global data, the phrase "big data" has evolved to characterize
Data analytics, Decision making, Information technology,
such huge collections. Big data is a very high volume of semi-
Internet of Things, Cloud computing.
structured and unstructured data that needs more quick
I. INTRODUCTION analysis compared to other typical datasets and their
associated procedures [7]. Additionally, big data creates new
"Big data" is a relatively new topic in the field of opportunities to discover new values and helps to gain a
information technology. There are a lot of researchers working deeper understanding of hidden data values, and of course,
on research and studies in this sector right now, and at the comes with new challenges such as how to efficiently manage
same time, a lot of corporations have gotten interested in it for and organize such data [8-9]. The generation of data has been
a variety of reasons [1]. As a result of the considerable simpler as a result of advancements in information
applications offered by big data analysis, a variety of technology; nowadays, big data originates from the everyday
businesses and fields of study, most notably those in the fields activities of individuals, particularly in connection to the
of power distribution, healthcare, social sciences, insurance, services provided by internet companies. For instance, Google
and finance, as well as governmental institutions, have also analyzes hundreds of petabytes of data, Facebook generates
begun to utilize it [2]. The analysis of large amounts of data over ten petabytes of new material each month, and on
has become more important in modern research as well as in YouTube, an average of three hundred and fifty hours of video
modern business. These data are generated as a result of online are posted every minute. In addition, the rapid expansion of
transactions, emails, movies, music, photos, click streams, cloud computing and the Internet of Things has contributed to
logs, postings, search queries, medical records, interactions on an increase in the volume of data. Computing in the cloud
social networks, scientific data, sensors, mobile phones, and offers a standardized method for storing and accessing the
the programs that run on them [3]. Big data is stored in digital assets of an organization [10]. As part of the Internet of
databases that grow incrementally and contain a large volume Things, sensors located all over the world are gathering and
of information, making storage, management, sharing, transferring data that is then saved and processed in the cloud.
analysis, and data visualization complex tasks that require This volume of data creates many issues and challenges in
software tools and complex databases. Throughout the course storing and retrieving heterogeneous and massive datasets,
of the previous two decades, there has been a significant which require hardware and software infrastructure and new
expansion of data in a variety of domains [4]. According to a technologies to manage and leverage them. In this article, we
report that was published by the International Data will review big data, its challenges, and related technologies.
* Assistant Professor, Department of Electrical Engineering, Najafabad
Branch, Islamic Azad University, Najafabad, Iran.

979-8-3503-0198-4/23/$31.00 ©2023 IEEE


First, the definition of big data, its features, and its widely recognized. The 4V characteristics of big data are
applications in various fields are explained. Then, the shown in Figure 2 [17].
challenges of big data in different areas such as data storage, • Speed of data

Velocity
Generation
data visualization, data analysis, data privacy, performance • Data at scale • Speed of data
and scalability will be discussed. Finally, the technologies • Processing at scale Processing
related to big data in the field of data analysis, data storage, • Speed of data
and data virtualization, as well as the connection of big data Requests
with cloud computing, the Internet of Things, and data centers
will be discussed [11-12].
Volume Veracity
II. OVERVIEW OF BIG DATA
The term big data refers to a rapidly growing collection of Big Data
massive and heterogeneous data in structured, unstructured,
and semi-structured formats. Due to their complex nature, big • Data source diversity

Variety
• Data trustworthiness
data require powerful technologies and advanced algorithms • Data structure
• Data quality
heterogeneity
for management and analysis, and traditional business tools
are not effective for dealing with big data [13]. The definition
of big data is a topic on which different people disagree. Big Fig. 2. 4V characteristics of big data
data, in general, is a group of data that cannot be
comprehended, gathered, managed, and processed B. Applications of Big Data
simultaneously using conventional hardware/software tools There are numerous applications for big data, some of
and information technologies. Because of the importance of which are illustrated in Figure 3 [18].
the topic, technology companies, researchers, and data Building And
analysts have different definitions of big data, which will be Constructions
discussed further below [14]. Political Unemployment
Decisions
A. Definition of Big Data Characteristics
Big data refers to data assets that are both enormous and
complicated, and which need analysis in order to comprehend
Health Smart
and get information from them [15]. In 2010, Apache Hadoop Grid
Welfare
defined big data as "A dataset that has a high volume,
velocity, or variety, and traditional methods are limited in
Big Data
their ability to efficiently analyze it." Based on this definition,
in May 2011, McKinsey & Company (a global consulting
organization) introduced big data as the "next frontier of
innovation, competition, and productivity." The National Tax Evaders Agriculture
Institute of Standards and Technology (NIST) defines big
data as "data sets that have such high volume, velocity, or Natural Disaster Insurance
variety that traditional methods for efficient analysis are
Fig. 3. Applications of Big Data
limited." This definition focuses on the technological aspect
of big data. Most data scientists and big data experts define
a) Fraud Detection and Control
big data with three main characteristics (known as the "3Vs").
* Volume: The dataset that conforms to the big data In business operations, various types of fraudulent claims
standard is constantly changing and increasing over time. In or fake data exist, and identifying and controlling these data
big data, there is a large amount of data with sizes ranging and fraud in transactions is one of the most important
from terabytes to zettabytes. applications of big data. In most cases, fraud is discovered
* Velocity: Big data is characterized by the rapid after a long period of its occurrence when data is lost, and in
generation of data, which, in turn, necessitates the rapid this case, only its effects can be reduced or policies can be
processing of that data in order to derive useful insights. The implemented to prevent its recurrence. Big data-based
term "velocity" alludes to the real-time aspect of big data, and platforms can examine and analyze transactions and business
in order to make the most of the potential benefits of big data operations in real-time and detect inappropriate behavior
for businesses, it is necessary to gather, analyze, and use the from a user by examining large-scale patterns for all
data in a prompt and efficient manner. transactions and deals, thereby changing the way fraud and
* Variety: Data comes in various types, including fake data are detected [19].
structured data such as database data, semi-structured data b) Call Center Data Analysis
such as XML data, and unstructured data such as sound, Analyzing call center data is one of the useful applications
images, videos, web pages, text, etc [16]. of big data. In current processes, there are no solutions for
However, others, including IDC, which is one of the most processing customer data in the call center, and the
influential leaders in big data and its research fields, have information and knowledge that a call center can provide is
different opinions. In 2011, IDC defined big data as follows: ignored or presented with delay. Big data-based solutions in
"Big data technologies introduce a new generation of call centers can identify recurring problems and behavioral
technologies and architectures designed to extract value patterns of customers and employees by receiving and
economically from very large volumes of data with a wide processing call content, and help improve organizational
range of diversity, received, discovered, or analyzed at high performance and increase customer satisfaction [20].
speeds." With this definition, the characteristics of big data c) Social network analysis
can be defined in the form of 4V, meaning volume (large
volume), variety (different methods), velocity (fast One of the most important applications of big data directly
production), and value (high value but low density), which is related to users is the analysis of user activity on social
networks. Users are widely active on social networks and
record a lot of information about their activities on a daily and their levels of accessibility can vary widely. The purpose
basis, from expressing interest in a company's products on of displaying data is to give it meaning so that it may be
Facebook to expressing opinions or complaints about other interpreted meaningfully by both users and computers. The
products in the form of a message on Twitter. Social network value of the primary data, on the other hand, is diminished by
data can provide useful real-time information about market an unsuitable display of the data, which may even impede an
responses to products and campaigns, enabling companies to effective study of the data [26]. Displaying data effectively
prepare and offer their products in line with market and requires taking into account not only the structure, class, and
customer opinions [21]. data type, but also the requirements and preferences of the
d) Financial data analysis end user.
Big data analysis can also be used for financial analysis C. Redundancy reduction and data compression
and forecasting. For example, big data is used in tools for In most cases, there is a significant amount of duplicate
predicting stock market trends to support decision-making in information present in datasets. If the data's potential value is
this area [22]. not diminished in the process of decreasing this duplication
e) Agriculture and compressing the data, the system's overall indirect costs
Biotechnology centers use sensor data in agriculture to will be reduced to a greater extent than would have been the
increase crop productivity. They study and simulate plant case otherwise. For instance, the majority of the information
reactions in different environmental conditions so that plants that is produced by sensor networks has a significant amount
can adjust to the environment based on this information. In of redundancy. This redundancy may be eliminated, and the
addition, big data can be used to select the type of crop to be quantity of the resulting data can be reduced [27].
cultivated [23].
D. Data Lifecycle Management
III. BIG DATA CHALLENGES Sensors and ubiquitous computing systems are creating
Data analysis of big data provides attractive and valuable data at an unprecedented rate and scale, and present storage
opportunities. However, researchers and experts in this field systems are not capable of sustaining such enormous volumes
face multiple challenges when exploring big data and of data. This is in contrast to the comparatively modest
extracting knowledge and value from it. These problems exist advances that storage systems have been making in
at various levels of storage, data display, analysis, lifecycle comparison. The worth of the data is taken into consideration
management, reducing redundancy and compression, etc. In during the process of managing the data lifecycle to
addition, issues related to privacy and confidentiality are determine which data should be kept and which should be
especially obstacles and challenges that must be overcome in discarded.
distributed applications of big data. Some key obstacles and E. Analysis
challenges that must be overcome in developing big data The big data analysis process, which has a large volume
applications are described below [24]. Some of the existing of unstructured or semi-structured heterogeneous data,
challenges for big data are shown in Figure 4, which we will requires a lot of resources and time. To address this issue,
explain below. distributed processing architectures are used, where data is
4 divided into smaller sections and made available for
3
processing by the number of computers in the network, and
5 finally, the processed data are combined [28].
2
F. Confidentiality of Information
One of the important challenges of big data is the
confidentiality and preservation of information. Most big
Storage
Big Data Data Lifecycle data providers and owners cannot efficiently maintain and
1 Challenges 6
Management analyze their large datasets due to their limited capacity. They
rely on data analysis experts and tools that increase potential
security risks. Therefore, maintaining the confidentiality of
information is a major issue and challenge in big data [29].
7
10 G. Energy Management
8 With the increasing volume of data and demand for
9 analysis, processing, storage, and transfer of big data,
inevitably more electrical energy will be consumed for these
Fig. 4. Some of the challenges for Big Data
purposes. Therefore, mechanisms for controlling and
A. Storage managing energy consumption levels for big data must be
established.
Today's hard drives have a capacity of terabytes, while the
data generated in big data is far beyond that and is increasing H. Scalability
exponentially, reaching exabytes. Traditional data The big data analytics system must support current and
management and analysis systems are based on Relational future datasets. Therefore, the analytics algorithm should be
Data Base Management Systems (RDBMS) and are only capable of processing increasingly complex datasets that are
suitable for managing structured data and are unable to store expanding over time [30].
and process such large amounts of data that are semi-
structured and unstructured [25]. The solution to this problem I. Collaboration
is to use distributed file systems and NoSQL databases, which Big data analytics is interdisciplinary research that calls
are designed to manage unstructured data on a large scale. for the cooperation of specialists from diverse domains in
B. Data Display order to fully utilize the potential of big data. To enable
scientists and engineers from diverse professions to access
The types of datasets, their structures, the meanings of the various types of data and fully utilize their knowledge to
datasets, their organizations, the granularity of the datasets,
interact with one another in order to achieve the analytics * Non-Relational Databases: A strategy for managing
objectives, a comprehensive big data network architecture and constructing databases that are appropriate for use with
must be developed [31]. vast amounts of data in contexts that are dispersed is referred
to as a non-relational database, which is also known as
IV. BIG DATA MANAGEMENT TOOLS AND TECHNOLOGIES NoSQL. The most widely used of these databases is Apache
Big Data management involves organizing and utilizing a Cassandra, which was initially developed for Facebook in
large amount of data. Assuring data quality and accessibility 2008 before being made available under an open-source
for use in Business Intelligence (BI) projects and Big Data license. Additional examples of these databases are
analytics is the aim of big data management. For analytics, SimpleDB, Google BigTable, MongoDB, and Voldemort.
storage, and visualization, a variety of Big Data management Large organizations like Netflix, LinkedIn, and Twitter
solutions are employed, some of which are briefly covered in employ one or more of these databases [34].
this section [32]. In addition, the relevant technologies related C. Data visualization tools
to Big Data will also be discussed in this section.
There are numerous open-source data visualization tools,
A. Data Analysis some of which are mentioned below [35].
* Hadoop: An open-source software framework that * R: A free and open-source programming language and
provides scalable solutions for solving problems with big data development environment designed for visualizing and
on a set of computers. Hadoop is made up of two key graphically representing data based on graphic and statistical
components: the MapReduce (MR) framework and the computations. R is a programming language that is often
Hadoop Data File System (HDFS). The data storage source utilized in the statistical software development and data
for MR is HDFS, a distributed file system created by Google analysis fields.
based on the Data File System and running on commercial * Tableau: A tool used for visualizing results in the form
hardware (DFS). of charts, maps, graphs, and other graphics. There is also the
* Hive: An open-source data warehouse for querying and possibility of connecting Hadoop and Tableau, and
analyzing large sets of data stored in Hadoop files. It features interaction between these two products.
a SQL-like user interface for querying data held in multiple * Infogram: This tool allows for the easy selection of a
Hadoop-integrated databases and storage systems. It was wide range of ready-made visual templates. Additionally,
there are additional templates such as map charts and videos
initially introduced and developed by Facebook and is now
in this software, and the ability to share created models are
offered as an open-source tool.
also provided.
* Pig: An advanced environment for developing
* ChartBlocks: A free online tool that provides the ability
MapReduce applications using Hadoop. Pig Latin, a high-
to visualize databases and extensive pages without the need
level descriptive language that can express huge data
for any complex code.
gathering and analysis tasks in MR programming, is the * Tangle: This visual tool provides capabilities beyond
language utilized in this platform. data visualization and allows designers and developers to
* Platform: It is a tool for analyzing and discovering big design programs interactively for a better understanding of
data. It is a platform that automatically takes user queries to data relationships.
the target and allows users to interact visually with vast
amounts of data at a petabyte scale in the shortest possible D. Big data-related technologies
time. In fact, it creates an abstraction layer that anyone can Some significant technologies that are closely connected
use to simplify and organize their datasets. to big data are covered in this section.
* Rapidminer: It is software that offers an integrated a) Cloud computing
platform for business analysis, predictive analytics, text
mining, machine learning, and data mining. Rapidminer Cloud computing has a close relationship with big data.
covers all data mining operations, including data preparation, Figure 5 depicts the main components of cloud computing.
The term "cloud computing" refers to a type of technology
validation, visualization, and result optimization. It is used
that is capable of storing significant amounts of data. The
for both the development of commercial applications as well
main goal of cloud computing is to use centralized
as research and education [33].
management of computational resources and capacities to
B. Storage Technologies provide various applications by sharing resources in a unified
For the administration of huge volumes of data, methods manner and making these applications accessible to users in
of data storage that are both efficient and effective are a transparent and efficient manner [36].
necessary. This is due to the fact that the size and volume of Cloud Computing Applications And Services
Traditional Applications Bigdata Applications And
the data continue to rise at an alarming rate. Both the And Services Services
virtualization of storage and the compression of data have Virtual Resources Pool Inquiry, Analysis And
been major contributors to the total development that has Flexible Resource
Excavate Parallel Algorithm

been made in this sector. Scheduling Management Parallel Computing

* HBase: The columnar, non-relational database known Virtualization Distributed Storage


as HBase is supported by the Apache Hadoop File System Cloud Computing Resources And Platform
(HDFS), which acts as the basis for the database. Users are Fig. 5. The main components of cloud computing [37]
able to get read and write access in real-time to vast volumes
of data that come from a broad range of sources and The proliferation of cloud-based computing services has
organizational forms with the help of HBase, which is a free opened up new avenues for managing massive datasets. What
and open-source database system that anyone may download this means is that the advent of big data serves to hasten the
and install on their own computers. maturation of cloud computing. Cloud computing and its
* SkyTree: A high-performance platform for machine offshoot, cloud storage, have made it possible to effectively
learning and data analysis that specifically focuses on handle massive data sets. Big data acquisition and analysis in
managing and analyzing big data. the cloud may be sped up with the use of parallel computing
power [38].
Sustainability “Smartization” Smart Grid IoT Integration
Smart Grid 1. Power & IT Management 4. Environment Information System
and Smart 2. Integrated City Mobility 5. Weather Intelligence
Cities 3. Security Management 6. E-Mobility

Smart Homes and


Smart Energy Smart Mobility Smart Water Smart Public Service
Buildings

Smart Grid EV Charging Distribution Public Safety High-Performance


Automation & Infrastructure & Management & Leak • Video Surveillance Buildings
Flexible Distribution Supervision Services Detection • Emergency • Energy Efficiency &
Management • Security Solutions
Smart Metering Energy Services
Management and Traffic Management
Demand Response Power Control & Digital City Services
Security Systems Efficient Homes
•e-Government Home Energy
Renewable Integration •

Tolling and •Education Management


Integration and Congestion Charging •Healthcare
Micro Grid
•Tourism

Real-Time Smart Integrated Mobility Stormwater Street Lighting Connection to the Smart
Grid Software Suite • Public Transit Management and Management Grid
• Traveler Urban Flooding
Information
Gas Distribution
Management

Big Data Analytics


Social Economical Technical Policy

Source Data Access Data Processing Modeling Deployment Monitoring

Raw Data Clean Data Models Production Components Monitoring Data

Experiments, Exploratory Reports


Temporary Data
Analysis, Reporting

Fig. 6. A visual representation of equipment and methods for collecting data in the smart grid and smart cities Based on IoT

b) Internet of Things responsibilities. Data collection, data processing, data


A large number of network sensors are installed in various organization, data value optimization, and operations are all
devices around the world, collecting various types of data performed in a data center. A data center organizes and
such as network communication data, environmental data, maintains a large volume of data in accordance with its
geographic data, astronomical data, and more. Since the primary goal and development route. Big data's rise has
information resources collected in the Internet of Things are presented data centers with both possibilities and obstacles
from various environments, the big data generated by the for expansion [40-42].
Internet of Things has different characteristics compared to V. CONCLUSION
general big data [39]. Heterogeneity, diversity, non-
structuredness, redundancy, and rapid growth are some of the In this review article, big data and related concepts
characteristics of big data generated by the Internet of Things. including definitions, features, challenges, and leading issues
Figure 6 shows the equipment and methods for collecting and technologies in data analysis, storage, and data
information in the smart grid and smart cities Based on the visualization have been discussed. Additionally, the Internet
Internet of Things platform. of Things, cloud computing, and data centers as technologies
An Intel report has mentioned three characteristics of big closely related to big data that contribute to its progress and
data in the Internet of Things: development have been explained. Despite significant
* Abundant terminals that produce large amounts of data. advancements in the field of big data, compared to other
* The Internet of Things often produces semi-structured technologies, there are still significant shortcomings in this
or unstructured data. area and many issues remain to be resolved. Standardization,
* Only after analysis is data from the Internet of Things technologies related to big data storage, real-time
valuable. performance, big data management, search, exploration and
c) Data Center analysis of big data, the development of big data applications
in various fields, data security, and mechanisms related to big
Data centers are not just centralized storage facilities for data security and privacy are issues that researchers must
data by one organization; they also have additional address and provide appropriate solutions.
REFERENCES [23] Cravero, A., Pardo, S., Sepúlveda, S., & Muñoz, L. (2022). Challenges
to Use Machine Learning in Agricultural Big Data: A Systematic
[1] Escobar, C. A., McGovern, M. E., & Morales-Menendez, R. (2021). Literature Review. Agronomy, 12(3), 748.
Quality 4.0: a review of big data challenges in manufacturing. Journal
of Intelligent Manufacturing, 32, 2319-2334. [24] Demirol, D., Das, R., & Hanbay, D. (2022). A key review on security
and privacy of big data: issues, challenges, and future research
[2] Kong, L., Liu, Z., & Wu, J. (2020). A systematic review of big data- directions. Signal, Image and Video Processing, 1-9.
based urban sustainability research: State-of-the-science and future
directions. Journal of Cleaner Production, 273, 123142. [25] Mazumdar, S., Seybold, D., Kritikos, K., & Verginadis, Y. (2019). A
survey on data storage and placement methodologies for cloud-big data
[3] Nti, I. K., Quarcoo, J. A., Aning, J., & Fosu, G. K. (2022). A mini- ecosystem. Journal of Big Data, 6(1), 1-37.
review of machine learning in big data analytics: Applications,
challenges, and prospects. Big Data Mining and Analytics, 5(2), 81-97. [26] Mikalef, P., Boura, M., Lekakos, G., & Krogstie, J. (2019). Big data
analytics and firm performance: Findings from a mixed-method
[4] Hajjaji, Y., Boulila, W., Farah, I. R., Romdhani, I., & Hussain, A. approach. Journal of Business Research, 98, 261-276.
(2021). Big data and IoT-based applications in smart environments: A
systematic review. Computer Science Review, 39, 100318. [27] Dai, H. N., Wang, H., Xu, G., Wan, J., & Imran, M. (2020). Big data
analytics for manufacturing internet of things: opportunities,
[5] Mallappallil, M., Sabu, J., Gruessner, A., & Salifu, M. (2020). A review challenges and enabling technologies. Enterprise Information
of big data and medical research. SAGE open medicine, 8, Systems, 14(9-10), 1279-1303.
2050312120934839.
[28] ur Rehman, M. H., Yaqoob, I., Salah, K., Imran, M., Jayaraman, P. P.,
[6] Janev, V. (2021). Semantic intelligence in big data applications. Smart & Perera, C. (2019). The role of big data analytics in industrial Internet
Connected World: Technologies and Applications Shaping the Future, of Things. Future Generation Computer Systems, 99, 247-259.
71-89.
[29] Talha, M., Abou El Kalam, A., & Elmarzouqi, N. (2019). Big data:
[7] Lampropoulos, G. (2023). Artificial Intelligence, Big Data, and Trade-off between data quality and data security. Procedia Computer
Machine Learning in Industry 4.0. In Encyclopedia of Data Science Science, 151, 916-922.
and Machine Learning (pp. 2101-2109). IGI Global.
[30] Gupta, R., Jadav, N. K., Nair, A., Tanwar, S., & Shahinzadeh, H. (2022,
[8] Moradi, J., Shahinzadeh, H., Nafisi, H., Marzband, M., & September). Blockchain and AI-based Secure Onion Routing
Gharehpetian, G. B. (2019, December). Attributes of big data analytics Framework for Data Dissemination in IoT Environment Underlying 6G
for data-driven decision making in cyber-physical power systems. Networks. In 2022 Sixth International Conference on Smart Cities,
In 2020 14th international conference on protection and automation of Internet of Things and Applications (SCIoT) (pp. 1-6). IEEE.
power systems (IPAPS) (pp. 83-92). IEEE.
[31] Han, H., & Trimi, S. (2022). Towards a data science platform for
[9] Kabalcı, Y., & Ali, M. (2019, June). Energy Internet: A Novel Vision improving SME collaboration through Industry 4.0
for Next-Generation Smart Grid Communications. In 2019 1st Global technologies. Technological Forecasting and Social Change, 174,
Power, Energy and Communication Conference (GPECOM) (pp. 96- 121242.
100). IEEE.
[32] Rao, T. R., Mitra, P., Bhatt, R., & Goswami, A. (2019). The big data
[10] Pramanik, S., & Bandyopadhyay, S. K. (2023). Analysis of Big Data. system, components, tools, and technologies: a survey. Knowledge and
In Encyclopedia of Data Science and Machine Learning (pp. 97-115). Information Systems, 60, 1165-1245.
IGI Global.
[33] Pavithra, N., & Manasa, C. M. (2021, December). Big Data Analytics
[11] Pal, K. (2023). A Review of Big Data Analytics for the Internet of Tools: A Comparative Study. In 2021 IEEE International Conference
Things Applications in Supply Chain Management. Applied AI and on Computation System and Information Technology for Sustainable
Multimedia Technologies for Smart Manufacturing and CPS Solutions (CSITSS) (pp. 1-6). IEEE.
Applications, 221-245.
[34] Ikegwu, A. C., Nweke, H. F., Anikwe, C. V., Alo, U. R., & Okonkwo,
[12] Escobar, C. A., McGovern, M. E., & Morales-Menendez, R. (2021). O. R. (2022). Big data analytics for data-driven industry: a review of
Quality 4.0: a review of big data challenges in manufacturing. Journal data sources, tools, challenges, solutions, and research
of Intelligent Manufacturing, 32, 2319-2334. directions. Cluster Computing, 25(5), 3343-3387.
[13] Shahinzadeh, H., Zanjani, S. M., Moradi, J., Fayaz-dastgerdi, M. H., [35] Archana Acharya, T., & Veda Upasan, P. (2020). A stitch in time saves
Yaïci, W., & Benbouzid, M. (2022, October). The Transition Toward nine: a Big Data analytics perspective. In Smart Technologies in Data
Merging Big Data Analytics, IoT, and Artificial Intelligence with Science and Communication: Proceedings of SMART-DSC 2019 (pp.
Blockchain in Transactive Energy Markets. In 2022 Global Energy 227-243). Springer Singapore.
Conference (GEC) (pp. 241-246). IEEE.
[36] Abualigah, L., Diabat, A., & Elaziz, M. A. (2021). Intelligent workflow
[14] Sharma, A., Singh, G., & Rehman, S. (2020). A review of big data scheduling for Big Data applications in IoT cloud computing
challenges and preserving privacy in big data. Advances in Data and environments. Cluster Computing, 24(4), 2957-2976.
Information Sciences: Proceedings of ICDIS 2019, 57-65.
[37] Srinivas, J., Das, A. K., & Rodrigues, J. J. (2020). 2PBDC: privacy-
[15] Ahmed, H., & Ismail, M. A. (2020). Towards a novel framework for preserving bigdata collection in cloud environment. The Journal of
automatic big data detection. IEEE Access, 8, 186304-186322. Supercomputing, 76, 4772-4801.
[16] Zanjani, S. M., Zanjani, S. H., Shahinzadeh, H., Rezaei, Z., Kaviani- [38] Bagherzadeh, L., Shahinzadeh, H., Shayeghi, H., Dejamkhooy, A.,
Baghbaderani, B., & Moradi, J. (2022, November). Big Data Analytics Bayindir, R., & Iranpour, M. (2020, July). Integration of cloud
in IoT with the Approach of Storage and Processing in Blockchain. computing and IoT (CloudIoT) in smart grids: Benefits, challenges, and
In 2022 6th Iranian Conference on Advances in Enterprise solutions. In 2020 International Conference on Computational
Architecture (ICAEA) (pp. 1-6). IEEE. Intelligence for Smart Power System and Sustainable Energy
[17] Reda, O., Sassi, I., Zellou, A., & Anter, S. (2020, September). Towards (CISPSSE) (pp. 1-8). IEEE.
a data quality assessment in big data. In Proceedings of the 13th [39] Kabalci, Y., Kabalci, E., Padmanaban, S., Holm-Nielsen, J. B., &
International Conference on Intelligent Systems: Theories and Blaabjerg, F. (2019). Internet of things applications as energy internet
Applications (pp. 1-6). in smart grids and smart environments. Electronics, 8(9), 972.
[18] Shi, Y. (2022). Advances in big data analytics. Adv Big Data Anal. [40] Shahinzadeh, H., Mirhedayati, A. S., Shaneh, M., Nafisi, H.,
[19] Keskar, V., Yadav, J., & Kumar, A. (2022). Perspective of anomaly Gharehpetian, G. B., & Moradi, J. (2020, December). Role of joint 5G-
detection in big data for data quality improvement. Materials Today: IoT framework for smart grid interoperability enhancement. In 2020
Proceedings, 51, 532-537. 15th International Conference on Protection and Automation of Power
[20] Wang, J., Yang, Y., Wang, T., Sherratt, R. S., & Zhang, J. (2020). Big Systems (IPAPS) (pp. 12-18). IEEE.
data service architecture: a survey. Journal of Internet [41] Shahinzadeh, H., Moradi, J., Gharehpetian, G. B., Nafisi, H., & Abedi,
Technology, 21(2), 393-405. M. (2019, January). IoT architecture for smart grids. In 2019
[21] Ghani, N. A., Hamid, S., Hashem, I. A. T., & Ahmed, E. (2019). Social International Conference on Protection and Automation of Power
media big data analytics: A survey. Computers in Human System (IPAPS) (pp. 22-30). IEEE.
Behavior, 101, 417-428. [42] Moazzami, M., Sheini-Shahvand, N., Kabalci, E., Shahinzadeh, H.,
[22] Ren, S. (2022). Optimization of enterprise financial management and Kabalci, Y., & Gharehpetian, G. B. (2021, October). Internet of things
decision-making systems based on big data. Journal of architecture for intelligent transportation systems in a smart city.
Mathematics, 2022, 1-11. In 2021 3rd Global Power, Energy and Communication Conference
(GPECOM) (pp. 285-290). IEEE.

View publication stats

You might also like