0% found this document useful (0 votes)
55 views5 pages

Ijet V3i3p13

This document discusses big data analytics. It begins by defining big data as large volumes of data from various sources that is difficult to handle using traditional databases due to its size and variety. It describes the four V's of big data - volume, velocity, variety, and value. Examples of big data sources and analytics are provided, such as analyzing social media data and seismic data. Benefits of big data analytics include increased productivity, output, and decision making. Related works discussed include data warehousing and the need for a clear business need, strong sponsorship, and fact-based decision making culture for big data projects.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
55 views5 pages

Ijet V3i3p13

This document discusses big data analytics. It begins by defining big data as large volumes of data from various sources that is difficult to handle using traditional databases due to its size and variety. It describes the four V's of big data - volume, velocity, variety, and value. Examples of big data sources and analytics are provided, such as analyzing social media data and seismic data. Benefits of big data analytics include increased productivity, output, and decision making. Related works discussed include data warehousing and the need for a clear business need, strong sponsorship, and fact-based decision making culture for big data projects.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

International Journal of Engineering and Techniques - Volume 3 Issue 3, May-June 2017

RESEARCH ARTICLE OPEN ACCESS

Automated Predictive Big Data Analytics Using


Ontology
1
Based
2
Semantics
3 4
Karthikeyan R , Dr.T.Geetha , Vasanthakumar K , Vetrivel A
1,2
Asst.Prof, Dept of MCA, Gnanamani college of Technology, Namakkal, INDIA.
3,4
P.G.Scholar, Dept of MCA, Gnanamani college of Technology, Namakkal, INDIA.

Abstract:
Study the growth history of galaxies by following their 'merger trees' in large-scale astrophysical simulations. The
service uses the addition of lambda expressions and the Stream API in Java 8, Java has gained a powerful and
expressive query language that operates over in-memory collections of Java objects, making the transformation and
analysis of data more convenient. There is much enthusiasm currently about the possibilities created by new and more
extensive sources of data to better understand and manage cities. Here, I explore how big data can be useful in urban
planning by formalizing the planning process as a general In this paper we discuss Pig SPARQL, a competitive yet easy
to use SPARQL query processing system on Map Reduce that allows adhoc SPARQL query processing on large RDF
graphs out of the box. Instead of a direct mapping, Pig SPARQL uses the query.

INTRODUCTION
I.WHAT IS BIG-DATA?
Big data and analytics are vast topics in both One perspective is that big data is more and
the popular and business process. Today, many different kinds of data than is easily handled by
organizations are collecting, storing, and analyzing traditional relational database management systems
massive amounts of data. This data is commonly (RDBMSs). Some people consider 10 terabytes to be
referred to as “big data” because of its volume, the big data, but any numerical definition is likely to
velocity with which it arrives, and the variety of forms change over time as organizations collect, store, and
it takes. Big data is creating a new generation of analyze more data.
decision support data management Collecting and
storing big data creates little value.What is new is the
coming together of advances in computer technology
and software, new sources of data (e.g., social media),
and business opportunity.This confluence has created
the current interest and opportunities in big data
analytics. It is even spawning a new area of practice
and study called “data science” that encompasses the
techniques, tools, technologies, and processes for
making sense out of big data.

Describes the four Vs of big-data. Like as


Volume ,Velocity, Veriety,And Value.
A. Big-Data Analytics
By itself, stored data does not generate business
value, and this is true of traditional databases, data

ISSN: 2395-1303 http://www.ijetjournal.org Page 77


International Journal of Engineering and Techniques - Volume 3 Issue 3, May-June 2017

warehouses, and the new technologies such as Hadoop related to the condition of the trucks and their locations
for storing big data. Once the data is appropriately [Watson and Leonard, 2011]. This data is stored in the
stored, however, it can be analyzed, which can create cloud and analysed in various ways, with information
tremendous value. A variety of analysis technologies, delivered to various users, from drivers to senior
approaches, and products have emerged that are executives, on iPads and other tablet computers.
especially applicable to big data, such as in-memory
analytics, in-databaseanalytics, and appliances (all II BENEFITS OF BIG-DATA
discussed later).
ANALYTICS
Research shows the benefits of using data and
B. Big-Data Sources analytics in decision making. One study of 179 large
Big data has many sources. For example, every mouse publicly tradedfirms found that companies that have
click on a web sitecan be captured in Web log files and adopted data-driven decision making have output and
analyzed in order to better understand shoppers buying productivity that is 5%to 6% higher than that of other
behaviors and to influence their shopping by firms. The relationship extends to other performance
dynamically recommending products. Social measures such as asset utilization, return on equity, and
mediasources such as Facebook and Twitter generate market value. Imagine a world with an expanding
tremendous amounts of comments and tweets. This data population but a reduced strain on services and
can be captured and analyzed to understand, for infrastructure; dramatically improved health care
example, what people think about new product outcomes with greater efficiency and less investment;
introductions.Image, voice, and audiodata can be intensified threats to public safety and national borders,
analyzed for applications such as facial recognition but greater levels of security; more frequent and intense
systems in security systems. weather events, but greater accuracy in prediction and
management. Imagine a world with more cars, but less
II.EXAMPLES OF BIG-DATA congestion; more insurance claims but less fraud; fewer
ANALYTICS natural resources, but more abundant and less.
expensive energy. The impact of big data has the
potential to be asprofound as the development of
A. Introducing a New Coffee Product at theInternet itself.
Starbucks
Starbucks was introducing a new coffee III.RELATED WORKS
product but was concerned that customers would find cloud-based conference[1], Data warehousing
its taste too strong.The morning that the coffee was
[2],
rolled out, Starbucks monitored blogs, Twitter, and
niche coffee forum discussion groups to assess
customers reactions A. A Clear Business Need
It is common knowledge that projects should
B.Drilling for Oil at Chevron be business rather than technology driven. They
Each drilling miss in the Gulf of Mexico costs should address a business need such as solving a
Chevron upwards of $100 million. To improve its problem or seizing an opportunity
chances of finding oil,Chevronanalyzes 50 terabytes of Automobile insurance
seismic data.The geologists at Chevron took this time to Telecommunications
Manufacturing, distribution, and retail
seize the opportunity offered by advances in computing
Transportation and logistics
powerand storage capacity to refine their already Utilities
advanced computer models. Gaming
C.Monitoring Trucks at U.s.Xpress Law enforcement
U.S. Xpress is a transportation company. Its cabs In many organizations, the initial business case
for big data analytics focuses on customer-centric
continuously stream more than 900 pieces of data
objectives and uses existing and newly accessible

ISSN: 2395-1303 http://www.ijetjournal.org Page 78


International Journal of Engineering and Techniques - Volume 3 Issue 3, May-June 2017

internal sources of data. Big data analytics can be


especially helpful for companies that seek to understand
customers better, develop meaningful relationships with
customers, and improve operations that enhance the
customer experience.

B. Strong, Committed Sponsorship

Without solid sponsorship, it is difficult to


succeed with any IT project, and this includes big data
analytics projects. If the project is departmental,
sponsorship can reside at the departmental level.
However, projects that are more strategic and enterprise
wide should have senior management support.
Users and applications access the data from
C. A Fact-Based Decision-Making Culture the warehouse to support decision making. Data
To benefit from big data analytics, decisions warehouses are primarily designed for the storage and
must be based on “the facts” (generated by analytics) analysis of structured data—that is, data easily stored in
and there should be constant experimentation to see the rows and columns of relational databases. The data
what works best. Changing the organizational culture is used for queries, reporting, online Analytical
associated for how decisions are made can be more processing (OLAP), dashboards/scorecards, data
challenging than solving technical issues. As the CEO visualization, and regulatory and compliance
explained: “Their idea of marketing was giving requirements
balloons and suckers along the teller line and running
focus groups, but marking has become very A. In-Memory Analytics
analytical.”Senior management can do other things to In-memory technology comes in two forms:
change the culture either on the platform or with the BI tool. When
implemented on theplatform, the server stores the data
D.A Strong Data Infrastructure in-memory, and data is accessed by the BI tools. In-
Data is critically important to BI and analytics. memory BI tools also move data between disk (e.g., in
When a strong data infrastructure is in place, the data warehouse) and the local desktop memory so
applications can often be developed in days. Without a that the most frequently used data (so called “hot data")
strong data infrastructure, applications may never be is available in memory.
completed. IT understands the importance of the data
infrastructure, but the business units sometimes assume B. In-Database Analytics
it is a given and don’t fully appreciate what is required A change is taking place as to where analytics
to create and maintain it. is performed. In the past, data was moved to a server
(think of asandbox) and the analysis was performed
IV DATA WAREHOUSES there.
For many organizations, data warehouses
provide the single version (or source) of the truth for
decision support data. The data is extracted from source
systems (e.g., operational systems, ERPs), transformed
(e.g., consistent formats), integrated (e.g., around a
common key, such as a customer ID), and loaded into
the data warehouse. The data can be thought of as
“squeaky clean” because of the care taken to ensure its
accuracy.

ISSN: 2395-1303 http://www.ijetjournal.org Page 79


International Journal of Engineering and Techniques - Volume 3 Issue 3, May-June 2017

In describes for Data has been transferred from storage and access of any type of data (e.g., Web logs,
transactional analytics to database analytics to generate XML files) as long as the data can be put in a file and
a report. copied into HDFS.

V CLOUD-BASED SERVICES CONCLUSION


The cloud is now in the mainstream of
computing. The potential benefits of the cloud include
Should be effective and strong .
access to specialized resources, quick deployment, Summarize the main program.
easily expanded capacity, the ability to discontinue a Suggest future avenues of research.
cloud service when it is no longer needed, cost savings, Chance to give opinion.
and good backup and recovery. private clouds are With the advent of Hadoop the new release of
implemented within a company’s firewall. Concerns Hadoop known as Yet Another Resource thinking has
been solidified. As is explained in this chapte, Hadoop
about data security is a primary reason that private
YARN separates the resource scheduling part from the
clouds are sometimes preferred over public clouds. We MR paradigm. It should be noted that in the first-
will discuss public clouds—although the same generation Hadoop, the scheduling was tied with
approaches and technologies are used with private implying that the only processing that was possible on
clouds. Cloud services are available as software-as-a- data was the MR type or its orchestrations.
service (SaaS), platform-as-a-service (PaaS), or
infrastructure-as-aservice(IaaS), depending on what
software is provided. REFERENCES

VI .HADOOP/MAP REDUCE 1. Y. Feng, B. Li and B. Li, "Airlift": Video


Apache Hadoop is a software framework for conferencing as a cloud service using inter-
processing large amounts of data across potentially datacenter networks, in Proceedings of the
IEEE International Conference on Network
massively parallel clusters of servers.
Protocols(ICNP'12), (2012).
2. R.Karthikeyan,” Improved Apriori Algorithm
for Mining Rules” in the International Journal
of Advanced Research in biology Engineering
science and Technology Volume 11, Issue 4,
April 2016, Page No:71-77.
3. R.Karthikeyan,Dr.T.Geetha “Honeypots for
Network Security”, International journal for
Research & Development in
Technology.Volume 7.Issue 2 ,Jan 2017,Page
No.:62-66 ISSN:2349-3585
4. R.Karthikeyan,”A Survey on Position Based
Routing in Mobile Adhoc Networks” in the
international journal of P2P Network Trends
and Technology, Volume 3 Issue 7 2013,
ISSN:2249-2615
. 5. R.Karthikeyan,”A Survey on Sensor
To illustrate, Yahoo has over 42,000 servers in Networks” in the International Journal for
its Hadoop installation. The key component of Hadoop Research & Development in Technology
is the Hadoop Distributed File System (HDFS), which Volume 7, Issue 1, Jan 2017, Page No:71-77
manages the data spread across the various servers. It is 6. R.Karthikeyan,Dr.T.Geetha “Web Based
Honeypots Network”,in the International
because of HDFS that so many servers can be managed
journal for Research & Development in
in parallel. HDFS is file based and does not need a data Technology.Volume 7.Issue 2 ,Jan 2017,Page
model to store and process data. It can store data of any No.:67-73 ISSN:2349-3585.
structure, but is not a RDBMS. HDFS can manage the

ISSN: 2395-1303 http://www.ijetjournal.org Page 80


International Journal of Engineering and Techniques - Volume 3 Issue 3, May-June 2017

7. R.Karthikeyan,Dr.T.Geetha,“A Simple
Transmit Diversity Technique for Wireless
Communication”,in the International journal
for Engineering and Techniques. Volume 3.
Issue 1, Feb 2017, Page No.:56-61 ISSN:2395-
1303.
8. C.Ganesh,B.Sathyabhama,Dr.T.Geetha “ Fast
Frequent Pattern Mining using Vertical Data
Format for Knowledge Discovery
“International Journal of Engineering Research
in Management & Technology. Vol.5,Issue-
5,Pages:141-149.
9. R.Karthikeyan,Dr.T.Geetha “Strategy of
Trible – E on Solving Trojan Defense in Cyber
Crime Cases”, International journal for
Research & Development in
Technology.Volume 7.Issue 1 ,Jan 2017,Page
No.:167-171
10. R.Karthikeyan,”A Survey on Position Based
Routing in Mobile Adhoc Networks” in the
international journal of P2P Network Trends
and Technology, Volume 3 Issue 7 2013,
ISSN:2249-2615.
11. K.Ramya and K.Pavithradevi “Effective
Wireless Communication”,International
journal of Advanced Research, Vol 4(12),
pp.1599-1562 dec 2016.
12. R.Karthikeyan,Dr.T.Geetha ”FLIP-OFDM for
Optical Wireless Communications” in the
international journal of Engineering and
Techniques, Volume 3 Issue 1, Jan - Feb 2017,
ISSN:2395-1303,PP No.:115-120.
13. R.Karthikeyan,Dr.T.Geetha”Application
Optimization in Mobile Cloud Computing” in
the international journal of Engineering and
Techniques, Volume 3 Issue 1, Jan - Feb 2017,
ISSN:2395-1303,PP No.:121-125.
14. "Eckerson", W. (2004) “Gauge Your Data
Warehousing Maturity", DM Review, (14)11,
pp. 34.
15. R.Karthikeyan,Dr.T.Geetha”Estimating
Driving Behavior by a smart phone” in the
international journal of Engineering and
Techniques, Volume 3 Issue 2, March 2017,
ISSN:2395-1303,PP No.:84-91.
16. R.Karthikeyan,Dr.T.Geetha”Advanced Honey
Pot Architecture for Network Threats
Quantification” in the international journal of
Engineering and Techniques, Volume 3 Issue
2, March 2017, ISSN:2395-1303, PP No.:92-
96.

ISSN: 2395-1303 http://www.ijetjournal.org Page 81

You might also like