Zookeeper
Zookeeper
1. Name service
It is a service which provide name to some data associated with that name,
4. Configuration management
To store and manage configuration information of distributed system
centrally, Zookeeper can be used. This shows that
updated configuration
information will be provided from zookeeper to the node which joins the
system.
3-44
Big Data Analytics
HDFS, HIVE and HIVEQL, HBASE
This permits you that
state changed by changing
the centralized of distributed system can be
5. Leader election
configuration by using z0okeeper cue
5
In casea
system fails in distributed
should be
svstem. automatic taikove
implemented. Zookeeper provides supports TOr failover
by using leader election.
au
.Zookeeper is itselt a distributed application along with the coordination ce for
distributed system.
Zookeeper uses client-server model, the nodes who uses the service are clients. and
connected.
that the server is
The Zookeeper server responds with acknowledging the pig, specifying
also alive.
does receive the
T h e client connects to the other server in ensemble if it not
time.
acknowledgement sent by server within specified
to the new server.
I n such case the client's session is transferred
shows the client-server
architecture of ZooKeeper.
Fig. 3.3.1
Zookeeper service
Leader
Zookeeper
HBASE
responds to small set of commands where every command is made up off up of four
eters. These commands are issued via telnet to zookeeper at client port
rollowing are few ofthe command which are used to monitor cluster by zookeeper
1) stat
7) envi
11) wchs
Lists details on watches for the server.
12) wchc
Displays detail information about the watches for
the server, by session. It
list of sessions with related
watches. displa
3 46
13) wchp HDFS, HIVE and HIVEQL, HBASE
Displays detail
of paths with information about the
related sessions. ne watches for the server, by
Example of ruok server, by path.
patn. It display list
command:
S echo ruok nc
127.0.0.1 5111
imok
HBase uses
3.3.2
Zookeeper
For real time read or write access of
used, which runs on
huge dataset distributed NoSQL database HBase is
top of HDFS.
.HBase follows master-slave
act as slaves architecture where HBase acts as a server and
region Seerver
v
.Installation of HBase distributed application is
based on a running zookeeper
.To track the status of data
which is distributed across master and Ciusi
Zookeeper. It is done slaves, HBase uses
by using the centralized configuration and distributed mutex
techniques.
.Following are few use cases of HBase:
Telecom
Billions of call records stored by telecom industry and in real time scenario
are
accessing of these huge data is very difficult. HBase can be used to process such a
large data in real time, efficiently and easily.
Social network:
Social networking sites like, facebook, Linkedln and Twitter receives a large
amount of data in the form of posts created by users. HBase can be used to iscover
the interesting facts and recent trends.
3.3.3 Building Application with Zookeeper