Cluster Computing
Unit 1
Dr Hitesh Mohapatra
Associate Professor
School of Computer Engineering
KIIT University
Introduction :
• Cluster computing is a collection of tightly or
loosely connected computers that work
together so that they act as a single entity.
• The connected computers execute operations
all together thus creating the idea of a single
system.
• The clusters are generally connected through
fast local area networks (LANs)
Why is Cluster Computing important?
[Link] computing gives a relatively inexpensive, unconventional to the
large server or mainframe computer solutions.
[Link] resolves the demand for content criticality and process services in a
faster way.
[Link] organizations and IT companies are implementing cluster computing
to augment their scalability, availability, processing speed and resource
management at economic prices.
[Link] ensures that computational power is always available.
[Link] provides a single general strategy for the implementation and application
of parallel high-performance systems independent of certain hardware
vendors and their product decisions.
Cluster computing layout
Types of Cluster computing :
• High performance (HP) clusters :
• HP clusters use computer clusters and supercomputers to solve advanced
computational problems.
• These are used to perform functions that need nodes to communicate as they
perform their jobs.
• These are designed to take benefit of the parallel processing power of several
nodes.
Cont.…
• Load-balancing clusters :
• Incoming requests are distributed for resources among several nodes running
similar programs or having similar content.
• This prevents any single node from receiving a disproportionate amount of
tasks.
• This type of distribution is generally used in a web-hosting environment.
Cont.…
• High Availability (HA) Clusters :
• HA clusters are designed to maintain redundant nodes that can act as backup
systems in case any failure occurs.
• Consistent computing services like business activities, complicated databases,
customer services like e-websites and network file distribution are provided.
• They are designed to give uninterrupted data availability to the customers.
Classification of Cluster :
• Open Cluster :
• IPs are needed by every node and those are accessed only through the
internet or the web. This type of cluster causes enhanced security concerns.
• Close Cluster :
• The nodes are hidden behind the gateway node, and they provide increased
protection. They need fewer IP addresses and are good for computational
tasks.
Cluster Computing Architecture :
• It is designed with an array of interconnected individual computers
and computer systems operating collectively as a single standalone
system.
• It is a group of workstations or computers working together as a
single, integrated computing resource connected via high-speed
interconnects.
• A node – Either a single or multiprocessor network having memory,
input and output functions and an operating system.
• Two or more nodes are connected on a single line or every node
might be connected individually through a LAN connection.
Cluster Computing Architecture Diagram
Components of a Cluster Computer :
[Link] Nodes
[Link] Operating System
[Link] switch or node interconnect
[Link] switching hardware
Component Diagram
Advantages of Cluster Computing :
• High Performance :
• The systems offer better and enhanced performance than that of mainframe computer networks.
• Easy to manage :
• Cluster Computing is manageable and easy to implement.
• Scalable :
• Resources can be added to the clusters accordingly.
• Expandability :
• Computer clusters can be expanded easily by adding additional computers to the network. Cluster
computing is capable of combining several additional resources or networks to the existing
computer system.
• Availability :
• The other nodes will be active when one node gets failed and will function as a proxy for the failed
node. This makes sure for enhanced availability.
• Flexibility :
• It can be upgraded to the superior specification or additional nodes can be added.
Disadvantages of Cluster Computing :
• High cost :
• It is not so much cost-effective due to its high hardware and its design.
• Problem in finding fault :
• It is difficult to find which component has a fault.
• More space is needed :
• Infrastructure may increase as more servers are needed to manage and
monitor.
Applications of Cluster Computing :
• Various complex computational problems can be solved.
• It can be used in the applications of aerodynamics, astrophysics and
in data mining.
• Weather forecasting.
• Image Rendering.
• Various e-commerce applications.
• Earthquake Simulation.
• Petroleum reservoir simulation.
Any questions?
Reference:
• V. Koukis, C. Venetsanopoulos and N. Koziris, "~okeanos: Building a Cloud, Cluster by Cluster,"
in IEEE Internet Computing, vol. 17, no. 3, pp. 67-71, May-June 2013, doi:
10.1109/MIC.2013.43.
• R. L. Grossman, "The Case for Cloud Computing," in IT Professional, vol. 11, no. 2, pp. 23-27,
March-April 2009, doi: 10.1109/MITP.2009.40.
• J. Riley, J. Noss, W. Dillingham, J. Cuff and I. M. Llorente, "A High-Availability Cloud for
Research Computing," in Computer, vol. 50, no. 6, pp. 92-95, 2017, doi:
10.1109/MC.2017.182.
• M. Parashar, M. AbdelBaky, I. Rodero and A. Devarakonda, "Cloud Paradigms and Practices for
Computational and Data-Enabled Science and Engineering," in Computing in Science &
Engineering, vol. 15, no. 4, pp. 10-18, July-Aug. 2013, doi: 10.1109/MCSE.2013.49.
• D. A. Popescu and A. W. Moore, "Network Latency and Application Performance Aware
Cluster Scheduling in Data Centers," in IEEE Network, vol. 36, no. 2, pp. 58-65, March/April
2022, doi: 10.1109/MNET.001.2100414.