0% found this document useful (0 votes)

278 views16 pages

UNIT-1: Overview of Grid Computing

HPC ncs 081 notes unit 1

Uploaded by

Fake Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

278 views16 pages

UNIT-1: Overview of Grid Computing

HPC ncs 081 notes unit 1

Uploaded by

Fake Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

UNIT-1

1. OVERVIEW OF GRID COMPUTING:

The term grid is referring to a distributed computing infrastructure, able to provide resources based on the needs of
each client. Grid technology can largely enhance productivity and efficiency of virtual organizations, which must
face the challenges by optimizing processes and resources and by sharing their networking and collaboration. Grid
computing technology is a set of techniques and methods applied for the coordinated use of multiple servers. These
servers are specialized and works as a single, logic integrated system. The grid has developed as a computing
technology that connects machines and resources geographically dispersed in order to create as virtual
supercomputer. A virtual system like this is perceived like it has all the computing resources, even if they are
distributed and has a computing capacity to execute tasks that different machines cannot execute individually. In the
past few years grid computing is defined as a technology that allows strengthening, accessing and managing IT
resources in a distributed computing environment. Being an advanced distributed technology, grid computing brings
into a single system: servers, databases and applications, using specialized software. In terms of partnership between
organizations, grid technology may include the same enterprise organizations as well as external organizations.
Therefore, grids may involve both internal and external partners, as well as only internal ones. The complexity of the
environment in which the grid will be developed and the requirements that have to be applied depend on the
environmental impact of trade, defining the relationship of trust, security considerations, globalization and
integration period, of the company involved, in the market.
Grid computing
Grid computing is the collection of computer resources from multiple locations to reach a common goal. The grid
can be thought of as a distributed system with non-interactive workloads that involve a large number of files. Grid
computing is distinguished from conventional high performance computing systems such as cluster computing in
that grid computers have each node set to perform a different task/application. Grid computers also tend to be more
heterogeneous and geographically dispersed (thus not physically coupled) than cluster computers. Although a single
grid can be dedicated to a particular application, commonly a grid is used for a variety of purposes. Grids are often
constructed with general-purpose grid middleware software libraries. Grid sizes can be quite large.
Grids are a form of distributed computing whereby a "super virtual computer" is composed of many networked
loosely coupled computers acting together to perform large tasks. For certain applications, distributed or grid
computing can be seen as a special type of parallel computing that relies on complete computers (with onboard
CPUs, storage, power supplies, network interfaces, etc.) connected to a computer network (private or public) by a
conventional network interface, such as Ethernet. This is in contrast to the traditional notion of a supercomputer,
which has many processors connected by a local high-speed computer bus.
Grid computing combines computers from multiple administrative domains to reach a common goal, to solve a
single task, and may then disappear just as quickly.
One of the main strategies of grid computing is to use middleware to divide and apportion pieces of a program
among several computers, sometimes up to many thousands. Grid computing involves computation in a distributed
fashion, which may also involve the aggregation of large-scale clusters.
The size of a grid may vary from small—confined to a network of computer workstations within a corporation, for
example—to large, public collaborations across many companies and networks. "The notion of a confined grid may
also be known as intra-nodes cooperation whilst the notion of a larger, wider grid may thus refer to inter-nodes
cooperation".
Therefore, grid computing can be seen as a journey along a path of integrating various technologies and
solutions that move us closer to the final goal. Its key values are in the underlying distributed computing
infrastructure technologies that are evolving in support of cross-organizational application and resource
sharing—in a word, virtualization—virtualization across technologies, platforms, and organizations. This
kind of virtualization is only achievable through the use of open standards. Open standards help ensure that
applications can transparently take advantage of whatever appropriate resources can be made available to
them. An environment that provides the ability to share and transparently access resources across a
distributed and heterogeneous environment not only requires the technology to virtualizes certain resources,
but also technologies and standards in the areas of scheduling, security, accounting, systems management,
and so on.
Grid computing could be defined as any of a variety of levels of virtualization along a continuum. Exactly
where along that continuum one might say that a particular solution is an implementation of grid computing
versus a relatively simple implementation using virtual resources is a matter of opinion. But even at the
simplest levels of virtualization, one could say that grid-enabling technologies are being utilized. Early
implementations of grid computing have tended to be internal to a particular company or organization.
However, cross-organizational grids are also being implemented and will be an important part of computing
and business optimization in the future. The distinctions between intraorganizational grids and
interorganizational grids are not based in technological differences. Instead, they are based on configuration
choices given: Security domains, degrees of isolation desired, type of policies and their scope, and contractual
obligations between users and providers of the infrastructures. These issues are not fundamentally
architectural in nature. It is in the industry’s best interest to ensure that there is not an artificial split of
distributed computing paradigms and models across organizational boundaries and internal IT
infrastructures. Grid computing involves an evolving set of open standards for Web services and interfaces
that make services, or computing resources, available over the Internet.
Very often grid technologies are used on homogeneous clusters, and they can add value on those clusters by
assisting, for example, with scheduling or provisioning of the resources in the cluster. The term grid, and its
related technologies, applies across this entire spectrum. If we focus our attention on distributed computing
solutions, then we could consider one definition of grid computing to be distributed computing across
virtualized resources. The goal is to create the illusion of a simple yet large and powerful virtual computer out
of a collection of connected (and possibly heterogeneous) systems sharing various combinations of resources.
2. HISTORY OF GRID COMPUTING
The term grid computing originated in the early 1990s as a metaphor for making computer power as easy to access
as an electric power grid. The power grid metaphor for accessible computing quickly became canonical when Ian
Foster and Carl Kesselman published their seminal work, "The Grid: Blueprint for a new computing infrastructure"
(1999). This was preceded by decades by the metaphor of utility computing (1961): computing as a public utility,
analogous to the phone system.
CPU scavenging and volunteer computing were popularized beginning in 1997 by [Link] and later in 1999
by SETI@home (SETI@home ("SETI at home") is an Internet-based public volunteer computing project employing
the BOINC software platform, hosted by the Space Sciences Laboratory, at the University of California,
Berkeley) to harness the power of networked PCs worldwide, in order to solve CPU-intensive research problems.
The ideas of the grid (including those from distributed computing, object-oriented programming, and Web services)
were brought together by Ian Foster and Steve Tuecke of the University of Chicago, and Carl Kesselman of
the University of Southern California's Information Sciences Institute. The trio, who led the effort to create
the Globus Toolkit, are widely regarded as the "fathers of the grid". The toolkit incorporates not just computation
management but also storage management, security provisioning, data movement, monitoring, and a toolkit for
developing additional services based on the same infrastructure, including agreement negotiation, notification
mechanisms, trigger services, and information aggregation. While the Globus Toolkit remains the de facto standard
for building grid solutions, a number of other tools have been built that answer some subset of services needed to
create an enterprise or global grid.
In 2007 the term cloud computing came into popularity, which is conceptually similar to the canonical Foster
definition of grid computing (in terms of computing resources being consumed as electricity is from the power grid)
and earlier utility computing. Indeed, grid computing is often (but not always) associated with the delivery of cloud
computing systems as exemplified by the AppLogic system from 3tera.
3. HIGH PERFORMANCE COMPUTING
High-performance computing (HPC) is the use of parallel processing for running advanced application programs
efficiently, reliably and quickly. The term applies especially to systems that function above a teraflop or 1012
floating-point operations per second. The term HPC is occasionally used as a synonym for supercomputing,
although technically a supercomputer is a system that performs at or near the currently highest operational rate for
computers. Some supercomputers work at more than a petaflop or 1015 floating-point operations per second.
The most common users of HPC systems are scientific researchers, engineers and academic institutions. Some
government agencies, particularly the military, also rely on HPC for complex applications. High-performance
systems often use custom-made components in addition to so-called commodity components. As demand for
processing power and speed grows, HPC will likely interest businesses of all sizes, particularly for transaction
processing and data warehouses. Occasional techno-fiends might use an HPC system to satisfy an exceptional desire
for advanced technology.

CASE STUDIES:
➢ Case study-1: The Next-Generation Fabric
Intel® Omni-Path Architecture (Intel® OPA), an element of Intel® Scalable System Framework, delivers the
performance for tomorrow’s high performance computing (HPC) workloads and the ability to scale to tens of
thousands of nodes—and eventually more—at a price competitive with today’s fabrics. The Intel OPA 100 Series
product line is an end-to-end solution of PCIe* adapters, silicon, switches, cables, and management software. As the
successor to Intel® True Scale Fabric, this optimized HPC fabric is built upon a combination of enhanced IP and
Intel® technology.
For software applications, Intel OPA will maintain consistency and compatibility with existing Intel True Scale
Fabric and InfiniBand* APIs by working through the open source OpenFabrics Alliance (OFA) software stack on
leading Linux* distribution releases. Intel True Scale Fabric customers will be able to migrate to Intel OPA through
an upgrade program.
The Future of High Performance Fabrics
Current standards-based high performance fabrics, such as InfiniBand*, were not originally designed for HPC,
resulting in performance and scaling weaknesses that are currently impeding the path to Exascale computing. Intel®
Omni-Path Architecture is being designed specifically to address these issues and scale cost-effectively from entry
level HPC clusters to larger clusters with 10,000 nodes or more. To improve on the InfiniBand specification and
design, Intel is using the industry’s best technologies including those acquired from QLogic and Cray alongside
Intel® technologies.
While both Intel OPA and InfiniBand Enhanced Data Rate (EDR) will run at 100Gbps, there are many differences.
The enhancements of Intel OPA will help enable the progression towards Exascale while cost-effectively supporting
clusters of all sizes with optimization for HPC applications at both the host and fabric levels for benefits that are not
possible with the standard InfiniBand-based designs.
Intel OPA is designed to provide the:
• Features and functionality at both the host and fabric levels to greatly raise levels of scaling
• CPU and fabric integration necessary for the increased computing density, improved reliability, reduced power, and
lower costs required by significantly larger HPC deployments
• Fabric tools to readily install, verify, and manage fabrics at this level of complexity
➢ Case study-2: Optimal Workload Performance Meets Intelligent Orchestration
The powerful new Intel® Xeon® processor E5-2600 v4 product family offers versatility across diverse workloads.
These processors are designed for architecting next-generation data centers running on, software defined
infrastructure supercharged for efficiency, performance, and agile services delivery across cloud-native and
traditional applications. They support workloads for cloud, high-performance computing, networking, and storage.
The Intel® Xeon® processor E5-4600 v4 product family delivers the compute horsepower in a 4-socket-based dense
form factor. This processor product family provides high-density, energy-efficient compute resources to support
larger workloads and high virtual machine densities in your data center or cloud. These 4-socket server platforms
give you more options and greater flexibility for scaling your infrastructure and growing your business.
The Intel® Xeon® processor E5-1600 v4 product family provides a professional, high-
performance workstation platform ideal for efficient multitasking, advanced model generation, and complex
applications.
➢ Case study-3: IBM Elastic Storage Server (ESS)
IBM Elastic Storage Server is a modern implementation of software defined cluster storage, combining IBM
Spectrum Scale™ software with POWER8 servers and disk arrays. Deploy petascale class high-speed storage
quickly with pre-assembled and optimized servers, storage and software.
➢ Case study-4: IBM Power System S822LC for high performance computing
The IBM Power System S822LC is built on industry standards and incorporates innovation from the OpenPOWER
Foundation ecosystem: including up to 2 NVIDIA® Tesla® GPU Accelerators and Mellanox® InfiniBand. The
Power S822LC delivers faster time to insight by pairing the built-for-big-data architecture of POWER8 and
accelerator performance. Also available without GPU accelerators as IBM Power System S822LC for commercial
computing.
➢ Case study-4: In india NETWEB Technology, HP/WIPRO working in HPC .

4. CLUSTER COMPUTING
A computer cluster consists of a set of loosely or tightly connected computers that work together so that, in many
respects, they can be viewed as a single system. Unlike grid computers, computer clusters have each node set to
perform the same task, controlled and scheduled by software.
The components of a cluster are usually connected to each other through fast local area networks ("LAN"), with
each node (computer used as a server) running its own instance of an operating system. In most circumstances, all of
the nodes use the same hardware and the same operating system, although in some setups (i.e. using Open Source
Cluster Application Resources (OSCAR)), different operating systems can be used on each computer, and/or
different hardware.
They are usually deployed to improve performance and availability over that of a single computer, while typically
being much more cost-effective than single computers of comparable speed or availability.
Computer clusters emerged as a result of convergence of a number of computing trends including the availability of
low-cost microprocessors, high speed networks, and software for high-performance distributed computing.[citation
needed] They have a wide range of applicability and deployment, ranging from small business clusters with a
handful of nodes to some of the fastest supercomputers in the world such as IBM's Sequoia.
The desire to get more computing power and better reliability by orchestrating a number of low-cost commercial
off-the-shelf computers has given rise to a variety of architectures and configurations.
The computer clustering approach usually (but not always) connects a number of readily available computing nodes
(e.g. personal computers used as servers) via a fast local area network. The activities of the computing nodes are
orchestrated by "clustering middleware", a software layer that sits atop the nodes and allows the users to treat the
cluster as by and large one cohesive computing unit, e.g. via a single system image concept.
Computer clustering relies on a centralized management approach which makes the nodes available as orchestrated
shared servers. It is distinct from other approaches such as peer to peer or grid computing which also uses many
nodes, but with a far more distributed nature.
A computer cluster may be a simple two-node system which just connects two personal computers, or may be a very
fast supercomputer. A basic approach to building a cluster is that of a Beowulf cluster which may be built with a few
personal computers to produce a cost-effective alternative to traditional high performance computing. An early
project that showed the viability of the concept was the 133-node Stone Supercomputer. The developers used Linux,
the Parallel Virtual Machine toolkit and the Message Passing Interface library to achieve high performance at a
relatively low cost.

5. PEER-TO-PEER COMPUTING
Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or
workloads between peers. Peers are equally privileged, equipotent participants in the application. They are said to
form a peer-to-peer network of nodes.
Peers make a portion of their resources, such as processing power, disk storage or network bandwidth, directly
available to other network participants, without the need for central coordination by servers or stable hosts. [1] Peers
are both suppliers and consumers of resources, in contrast to the traditional client-server model in which the
consumption and supply of resources is divided. Emerging collaborative P2P systems are going beyond the era of
peers doing similar things while sharing resources, and are looking for diverse peers that can bring in unique
resources and capabilities to a virtual community thereby empowering it to engage in greater tasks beyond those that
can be accomplished by individual peers, yet that are beneficial to all the peers.
While P2P systems had previously been used in many application domains, the architecture was popularized by the
file sharing system Napster, originally released in 1999. The concept has inspired new structures and philosophies in
many areas of human interaction. In such social contexts, peer-to-peer as a meme refers to the egalitarian social
networking that has emerged throughout society, enabled by Internet technologies in general.
While P2P networks open a new channel for efficient downloading and sharing of files and data, users need to be
fully aware of the security threats associated with this technology. Security measures and adequate prevention
should be implemented to avoid any potential leakage of sensitive and/or personal information, and other security
breaches. Before deciding to open firewall ports to allow for peer-to-peer traffic, system administrators should
ensure that each request complies with the corporate security policy and should only open a minimal set of firewall
ports needed to fulfill P2P needs. For end users, including home users, care must be taken to avoid any possible
spread of viruses over the peer-to-peer network
MANAGEMENT CONSIDERATIONS
TRENDS AND IMPACT
The first appearance of open source systems such as Napster in 1999 radically changed file-sharing mechanisms.
The traditional client-server file sharing and distribution approach using protocols like FTP (File Transfer
Protocol) was supplemented with a new alternative — P2P networks. At the time, Napster was used extensively
for the sharing of music files. Napster was shut down in mid-20012 due to legal action by the major record labels.
The shutting of Napster did not stop the growth of P2P applications. A number of publicly available P2P systems
have appeared in the past few years, including Gnutella, KaZaA, WinMX and BitTorrent, to name but a few.
From analysis of P2P traffic in 2007, BitTorrent is still the most popular file sharing protocol, accounting for 50-
75% of all P2P traffic and roughly 40% of all Internet traffic.
P2P technology is not just used for media file sharing. For example, in the bioinformatics research community, a
P2P service called Chinook4 has been developed to facilitate exchange of analysis techniques. The technology is
also used in other areas including IP-based telephone networks, such as Skype , and television networks, such as
PPLive. Skype allows people to chat, make phone calls or make video calls. When launched, each Skype client
acts as a peer, building and refreshing a table of reachable nodes in order to communicate for chat, making phone
calls or video calls. PPLive shares live television content. Each peer downloads and redistributes live television
content from and to other peers
GOVERNANCE AND REGULATIONS
In the U.S., a number of politicians have raised concerns about possible threats to national security due to P2P
network technology. The possibility of accidental leaks of classified information by government officers to
foreign governments, terrorists or organized crime via P2P file sharing programs has prompted a view that “new
laws and rules should be enacted to protect personal information held by federal agencies and other
organizations”. The proposal does not restrict P2P networks as a whole, but attempts to strike “a balance that
protects sensitive government, personal and corporate information and copyright laws”.
A P2P network itself is only a form of technology, and is not related to disputes over content and intellectual
property rights. However, there have been court cases in Hong Kong against illegal P2P activities. In 2005, a
Hong Kong resident was convicted of breaching the Copyright Ordinance by uploading illegal copies of
copyrighted works to the Internet using the BitTorrent peer-to-peer file sharing program, and making files
available for download by other Internet users.
SECURITY CONSIDERATIONS
CLASSIFICATION OF P2P NETWORKS
P2P networks can be roughly classified into two types — “pure P2P networks” and “hybrid P2P networks”. In a
pure P2P network, all participating peers are equal, and each peer plays both the role of client and of server. The
system does not rely on a central server to help control, coordinate, or manage the exchanges among the peers.
Gnutella and Freenet are examples of a pure P2P network.
In a hybrid P2P network, a central server exists to perform certain “administrative” functions to facilitate P2P
services. For example, in Napster, a server helps peers to “search for particular files and initiate a direct transfer
between the clients”. Only a catalogue of available files is kept on the server, while the actual files are scattered
across the peers on the network. Another example is BitTorrent (BT), where a central server called a tracker helps
coordinate communication among BT peers in order to complete a download.
The central distinction between the two types of P2P network is that hybrid P2P networks have a central entity to
perform certain administrative functions while there is no such server in pure P2P networks. Compared to the
hybrid P2P architecture, the pure P2P architecture is simpler and has a higher level of fault tolerance. On the other
hand, the hybrid P2P architecture consumes less network resources and is more scalable than the pure P2P
approach.
SECURITY THREATS
A P2P network treats every user as a peer. In file sharing protocols such as BT, each peer contributes to service
performance by uploading files to other peers while downloading. This opens a channel for files stored in the user
machine to be uploaded to other foreign peers.
The potential security risks include:
1. TCP ports issues: Usually, P2P applications need the firewall to open a number of ports in order to function
properly. BitTorrent, for example, will use TCP ports 6881-6889 (prior to version 3.2). The range of TCP ports
has been extended to 6881-6999 as of 3.2 and later. Each open port in the firewall is a potential avenue that
attackers might use to exploit the network. It is not a good idea to open a large number of ports in order to allow
for P2P networks.
2. Propagation of malicious code such as viruses: As P2P networks facilitate file transfer and sharing, malicious
code can exploit this channel to propagate to other peers. For example, a worm called VBS. Gnutella was detected
in 2000 which propagated across the Gnutella file sharing network by making and sharing a copy of itself in the
Gnutella program directory. Trojan horses have also been found over P2P networks. An example is W32/Inject-H,
which contained an IRC backdoor Trojan that utilized P2P networks to propagate itself. The Trojan would open a
backdoor in a user’s Windows PC to allow a remote intruder access and control of the computer. Theoretically
speaking, sensitive and personal information stored in the infected computer could be copied to other machines on
the P2P network.
3. Risks of downloaded content: When a file is downloaded using the P2P software, it is not possible to know
who created the file or whether it is trustworthy. In addition to the risks of viruses or malicious code associated
with the file, the person downloading the file might also be exposed to criminal and/or civil litigation if any illegal
content is downloaded to a company machine. Also, when downloading via a P2P network, it is not possible to
know what peers are connected at any one time and whether these peers are trustworthy or not. Untrusted sources
induce another security threat.
4. Vulnerability in P2P software: Like any software, P2P software is vulnerable to bugs. As each peer is both a
client and a server, it constantly receives requests from other peers, and if the server component of the P2P
software is buggy, it could introduce certain vulnerabilities to a user’s machine. Intruders could exploit this to
spread viruses, hack into a machine, or even launch a denial of service attack. It was reported in 2003 that a bug in
the P2P software Kazaa Media Desktop could cause a denial of service attack, or allow a remote attacker to
exploit arbitrary code.
In addition to general security risks, the use of P2P applications in a company network situation could generate an
unnecessarily large amount of network traffic, monopolizing network bandwidth that should be available for other
business applications. The time spent by employees in dealing with the effects of P2P download or upload will
affect employee productivity and the organization’s bottom line.
6. INTERNET COMPUTING
What exactly is Internet computing? Many people think they know, but they are surprised to learn that they don't.
Are you one of the select few who has a grasp of the subject?
Retailers today face a competitive marketplace with unprecedented challenges and opportunities. Increasing labor
costs, Blurring of market segmentation, Reduced customer loyalty. You know the list. But, now there is a whole
new set of challenges brought on by the Internet:
▪ Retailers are being "dot-commed" right out of their markets.
▪ Price visibility is allowing customers instant access to the lowest cost merchant.
▪ Manufacturers and new competitors are removing some retailers from the supply chain altogether.
▪ Online auctions have fundamentally changed the way merchandise is sold and purchased. This list continues to
grow as people think of more and more ways to leverage the Internet.
E-Business Or Out Of Business
It is a new world. For brick-and-mortar retailers in particular, the Internet is creating enormous disruption. But, it is also
presenting unprecedented opportunities for those who understand the use, implications, and terminology of Internet
technologies, and for those who move quickly and intelligently to become an e-business themselves. Increasingly, the
choice facing retailers is simple: it's e-business or out of business. Unfortunately, it's not as simple as deciding to
become an e-business. Terms like e-business, e-commerce, Web-deployed, Internet-enabled, customer relationship
management, and the like all seem to have different meanings to each retailer and software vendor. For retailers, one
fundamental term that must be clearly understood to succeed, is the true meaning of the words "Internet computing."
Why? Because the differences between true Internet computing, and the faux offerings that mimic the look of true
Internet computing, are subtle to the untrained eye. However, they are dramatic in the capabilities and benefits they
provide.
Defining Internet Computing
Internet computing is the foundation on which e-business runs. It is the only architecture that can run all facets of
business, from supplier collaboration and merchandise purchasing, to distribution and store operations, to customer
sales and service. Internet computing is the only architecture that supports all information flows and processes over the
Internet — providing access to all applications. With Internet computing, all a user needs is a standard Web browser
and security clearance. The Internet computing model represents a fundamental shift from the traditional client/server
enterprise application model. The four-walled efficiency that was once the goal of monolithic enterprise resource
planning implementations — known as business process redesign (BPR) — has been replaced. The new environment is
one in which economic gains are a result of systems efficiencies and collaboration across the extended network of
customers, retailers, manufacturers, and suppliers.
Shift In Focus
There are three tiers in true Internet computing. These three tiers provide the benefit of centralized data that supports a
unified view of the retailer's financial, human resources, inventory, logistics, trading partner, and customer information.
The business logic at the next layer accesses and transacts the data. The user interface is a simple, non-proprietary Web
browser. No complexity resides on the users' device, which can be anything from a PC to a mobile phone, or even a
uniquely purposed mobile unit. (Note: A unique "tool set" that allows writing in multiple languages allows Web
deployment functionality to occur within the application server.)

An Internet computing architecture provides:

▪ universal access to any person with a browser
▪ unified views of critical data across the enterprise
▪ scalability to support retailers of any size
▪ flexibility and agility that allows retailers to quickly implement new business rules
▪ lower total cost of ownership resulting from simplified IT administration and the adoption of self-service
applications
▪ streamlined communication processes and simplified distribution of critical business information across the
enterprise.
What Internet Computing Is Not
The most common misnomer equated to Internet computing is "Web-deployed," meaning a self-service application
that allows a particular transaction to occur through a browser. For example, checking the status of a UPS delivery is
a Web-deployed application conducting a pre-defined task. For some casual users of applications, that's sufficient.
But Internet computing is more than that. It is about deploying all business applications — for casual and power
users — across the Internet, using the most streamlined architecture available.
To appreciate the benefits of Internet computing, it's useful to compare and contrast the alternative architectures —
including two-tier, three-tier with proprietary user interface, and four-tier. Most IT professionals are familiar with
the two-tier alternatives and understand the drawbacks of each. In fact, the desktop administration headaches
featured by the fat client version of two-tier architecture are what precipitated the move to the Internet computing
model. The fat server version, while causing a less painful desktop administration headache, still requires vendor-
specific software to be installed for the client.
Avoiding Upgrade Problems
Internet computing, which grew out of two-tier architecture, supports scalability at a much lower cost. There is,
however, alternate three-tier architecture to Internet computing. But, it does not provide the benefits of Internet
computing. The key area that distinguishes the two models is the client layer. The alternative includes a proprietary
presentation layer. Oftentimes, this user interface is on a different upgrade path or is unique to particular application
versions. So, when the process to upgrade applications occurs, another upgrade must occur throughout all the
desktops. While this alternate model offers a much thinner client when compared to the two-tier fat client, it fails to
provide the low-cost administration benefits of Internet computing.
Avoid Additional Complexity
Finally, there is the four-tiered environment, which can give the illusion of Internet computing, because users can
access information through a browser interface. But, it still distributes complexity throughout the IT system. In this
option, users have access to a limited number of predefined pieces of information over the Web. However, the IT
staff accomplishes this by creating an additional layer of complexity through servers that convert proprietary
applications (not based on Internet standards). This allows users access through a browser. This fails to lower the
overall cost of the IT environment and lacks the flexibility of open Internet standards. Moreover, it adds additional
burdens to network performance.
Identifying True Internet Computing Capabilities
Given the urgency for software vendors to adapt to the age of the Internet, marketing language and high-tech
buzzwords often fail to communicate what exactly is being offered. Retailers can ask three questions to help
determine whether the applications hold true Internet computing benefits:
1. Can all users access all applications with only a browser?
2. Was the application originally designed on top of the Web server, rather than having Web functionality bolted on
later?
3. Can the application handle XML directly?
If the answer to all three of these questions is yes, then the application in question will deliver the benefits of true
Internet computing.
Success In The New Millennium
People today should embrace Internet computing. With the Age of the Internet well under way, the need for retailers
to transform into e-businesses is increasingly apparent. Embracing true Internet computing is the way to compete in
the new millennium — to expand into new markets, improve extended enterprise efficiencies, and attract and retain
customers. It's either that or getting "dot-commed."

7. GRID COMPUTING MODEL AND PROTOCOLS

A grid is an environment that allows service oriented, flexible and seamless sharing of heterogeneous network of
resources for compute intensive and data intensive tasks and provides faster throughput and scalability at lower
costs. The distinct benefits of using grids include performance with scalability, resource utilization, management and
reliability and virtualization. Grid computing environment provides more computational capabilities and helps to
increase the efficiency and scalability of the infrastructure. Many enterprises require flexible and scalable
infrastructure. Therefore most of the applications running on the grid infrastructure are compute-intensive or batch
type applications. Another grid requirement is the need to use the resources more efficiently. The ability to share
resources across geography makes the grid really attractive. A grid can harness the idle processing power available
in computing systems located in different time zones.
The Internet and the grid, though both are distributed systems, have their own differences as well. The Internet is a
network of communication whereas grid computing is seen as a network of computation which provides tools and
protocols for sharing of a variety of resources. Grid computing is also known by a number of other names such as,
“grid”, “computational grid”, “computing-on-demand” and “on-demand-computing”. The term grid technology
describes the entire collection of grid computing elements, middleware, networks and protocols. In a grid
environment, the ensemble of resources is able to work together cohesively because of defined protocols that control
connectivity, coordination, resource allocation, resource management and security. Generally the protocols are
implemented in the middleware. The systems “glued” together by a computational grid may be in the same room, or
may be distributed across the globe; they may be running on homogenous or heterogeneous hardware platforms;
they may be running on similar or dissimilar operating systems; and they may be owned by one or more
organizations. The goal of grid computing is to provide users with a single view and/or single mechanism that can
be utilized to support any number of computing tasks.
According to Ian Foster, the essence of grid computing can be captured in a simple checklist, according to which a
grid is a system that:
• coordinates resources that are not subject to centralized control
(A grid integrates and coordinates resources and users that live within different control domains, for example, the
user’s desktop vs. central computing; different administrative units of the same company; or different companies;
and addresses the issues of security, policy, payment, membership, and so forth that arise in these settings).
• using standard, open, general-purpose protocols and interfaces
(A grid is built from multi-purpose protocols and interfaces that address such fundamental issues as authentication,
authorization, resource discovery, and resource access. These protocols and interfaces need to be standard and open
so that the system is able to execute generic applications).
• to deliver nontrivial qualities of service
(A grid allows its constituent resources to be used in a coordinated fashion to deliver various qualities of service,
relating - for example - to response time, throughput, availability, and security, and/or co-allocation of multiple
resource types to meet complex user demands, so that the utility of the combined system is significantly greater than
that of the sum of its parts).
The grid vision requires standard protocols, interfaces and policies that are not only open but also general-purpose.
These protocols allow us to establish resource-sharing arrangements dynamically with any interested party and thus
to create compatible and inter-operable distributed systems. The definition of standard “Inter-Grid” protocols is the
single most critical problem facing the grid community today. Global Grid Forum (Open Grid Forum) is a
consortium working in this direction. For implementation purposes, Globus Toolkit, an open source middleware is
being widely used as the platform.
➢ ISSUES IN COMMUNICATION PROTOCOLS FOR GRID COMPUTING
The emerging high-performance grid can have a wide range of network infrastructures and different communication
patterns for different types of applications, these combinations and factors made researchers to develop a new data
communication protocols especially for grid environments. For these applications, available standard protocols
(TCP and UDP) are not sufficient, because of their properties and lack of flexibility. Performance of TCP not only
depends on transfer rate, but also on the product of round-trip delay and transfer rate. This bandwidth*delay product
measures the amount of data that would fill the pipe, and that amount of data is the buffer space required at sender
and receiver side to obtain maximum throughput on the TCP connection over the path. TCP performance problems
arise when the bandwidth*delay product becomes large. Three fundamental performance problems with the TCP
over high bandwidth delay network paths are;
i) Window size limit,
ii) Recovery from losses, and
iii) Round-trip measurement.
In grid high performance research document, it summarizes the networking issues available in grid applications and
gives some consideration for designing a protocol for grid computing, they are:
i) slow start,
ii) congestion control,
iii) ACK clocking,
iv) connection setup and teardown
In some general set of transport attributes are listed for grid computing, i) connection oriented vs. connectionless, ii)
reliability, iii) latency and jitter, iv) sequenced vs. unsequenced, v) rate shaping vs. best effort, vi) fairness and vii)
congestion control. The rapid growth of network infrastructures and communication methods made the grid
community to demand enhanced and diverse transport functionality. Because of these growth in network
environment, QoS guarantees and its properties depends on these parameters, i) type of link (dedicated or shared), ii)
communication methods (point-to-point or multipoint-to-point) and iii) application requirements.

➢ NEED OF HIGH PERFORMANCE COMMUNICATION PROTOCOL

1. Increasing Capabilities: Increasing in computing power has a doubling time of 18 months, storage device
capacity doubles every 12 months and communication speed doubles every 9 months. The difference in rate of
increase creates a situation in all areas, although significant power available, because of traditional
implementation tradeoff method changes. This increasing capability has resulted in work to define new ways to
interconnect and manage computing resources.
2. New Application Requirements: New applications are being developed in physics, biology, astronomy,
visualization, digital image processing, meteorology etc. Many of the anticipated applications have
communication requirements between a small numbers of sites. This presents a very different requirement than
presented by “typical” Internet use, where the communication is among many sites. Applications can be defined in
three classes: 1) lightweight “classical” Internet applications (mail, browsing), 2) medium applications (business,
streaming, VPN) and 3) heavyweight applications (e-science, computing, data grids, and virtual presence). The
total bandwidth estimate for all users of each class of network application is 20 Gb/sec for the lightweight
Internet, 40 Gb/sec for all users of the intermediate class of applications and 100 Gb/sec for the heavyweight
applications. Note that the heavyweight applications use significantly more bandwidth than the total bandwidth of
all applications on the classical Internet. Different application types value different capabilities. Lightweight
applications give importance to interconnectivity, middleweight applications to throughput and QoS, while the
heavyweight applications to throughput and performance.
CONGESTION CONTROL
Moving bulk data quickly over high-speed data network is a requirement for many applications. These
applications require high bandwidth link between network nodes. To maintain the stability of internet, all
applications should be subjected to congestion control. TCP is well-developed, extensively used and widely
available Internet transport protocol. TCP is fast, efficient and responsive to network congestion conditions, but
one objection to using TCP congestion control is that TCP’s AIMD congestion back-off algorithm, which is too
abrupt in decreasing the window size, thus it hurts the data rate.
The performance of the congestion control system, TCP algorithm and the link congestion signal algorithm has
many facets and these variables can impact the Quality of Service (QoS).
A variety of merits described they are:
Fairness: The Jain index is a popular fairness metric that measures how equally the sources sharing a single
bottleneck link. A value of 1 indicates perfectly equal sharing and smaller values indicate worse fairness.
Throughput: Throughput is simply the data rate, typically in Mbps, delivered to the application. For a single
source this should be close to the capacity of the link. When the BDP is high, that is, when the link capacity or
RTT or both are high, some protocols are unable to achieve good throughput.
Stability: The stability metric measures the variations of the source rate and/or the queue length in the router
around the mean values when everything else in the network is held fixed. Stability is typically measured as the
standard deviation of the rate around the mean rate, so that a lower value indicates better performance. If a
protocol is unstable, the rate can oscillate between exceeding the link capacity, and thus resulting in poor delay
jitter and throughput performance.
Responsiveness: measures how fast a protocol reacts to a change in network operating conditions. If the source
rates take long time to converge to a new level, say after the capacity of the link changes, either the link may
become underutilized or the buffer may overflow. The responsiveness metric measures the time or the number of
round trips to obtain the right rate.
Queuing delay: Once congestion window is greater than the BDP, the link is well utilized and however, if the
congestion window is increased more, queuing delay builds up. Different TCP and AQM protocol combinations
operate on how to minimize the Queuing Delay.
Loss recovery: packet loss can be a result because of overflowing buffers, which indicates network congestion,
and also of transmission error, such as bit errors over a wireless channel. It is desirable that, when packet loss
occurs due to transmission error, the source continues to transmit uninterrupted. However, when the loss is due to
congestion, the source should slow down. Loss recovery is typically measured as the throughput that can be
sustained under the condition of a certain random packet loss
caused by transmission error. Loss based protocols typically
cannot distinguish between congestion and transmission
error losses.
In the past few years, more number of TCP variant were
developed, that address the under-utilization problem most
notably due to the slow growth of TCP congestion window,
which makes TCP unfavorable for high BDP networks.

➢ TCP BASED PROTOCOLS FOR GRID

NETWORKS

An overview and study of protocols that are designed for high

speed bulk data transfer in high bandwidth delay networks are
given here, and the TCP variants are summarized in Figure 1.
Scalable TCP (S-TCP)
Scalable TCP is designed in such a way that it automatically switches to traditional TCP stacks in low bandwidth
and to incremental method at the time of high bandwidth available. STCP is a simple sender side modification to
TCP congestion control, and it employs Multiplicative Increase Multiplicative Decrease (MIMD) technique. Using
Scalable TCP, better utilization of a network link with the high bandwidth-delay product can be achieved. If STCP
is mixed with regular TCP then STCP dominates the bandwidth for sufficiently large bandwidth-delay product
region, which results in unfriendliness towards standard TCP.
The main feature in the Scalable TCP is it adopts constant window increase and decrease factors in case of
congestion and multiplicative increase in absence of congestion.
High Speed TCP (HS-TCP)
HSTCP (HighSpeed TCP) is a variant to the TCP, which is specifically designed for use in high speed, high
bandwidth network. Congestion management allows the protocol to react and to recover from congestion and
operate in a state of high throughput yet sharing the link fairly with other traffic.
The HighSpeed TCP for large congestion window was introduced as a modified version of TCP congestion control
mechanism. It is designed to have different responses in very low congestion event rate, and also to have the
standard TCP responses with low packet loss rates. Further, TCP behavior is unchanged when there is mild to heavy
congestion and doesn’t increase the congestion collapse.
TCP Africa
The limitations of standard TCP are apparent in networks with large bandwidth-delay-products, which are becoming
increasingly common in the modern Internet. For example, supercomputer grids, high energy physics, and large
biological simulations require efficient use of very high bandwidth Internet links, often with the need to transfer data
between different continents. The desired congestion window used by standard TCP is roughly equal to the BDP of
the connection. For high bandwidth-delay-product links, this desired congestion window is quite high, as high as
80,000 packets for a 10 Gbps link with 100 ms RTT.
TCP-Africa, a new delay sensitive two-mode congestion avoidance rule for TCP, promises excellent utilization,
efficiency, and acquisition of available bandwidth, with significantly improved safety, fairness, and RTT bias
properties. This new protocol uses an aggressive, scalable window increase rule to allow quick utilization of
available bandwidth, but uses packet round trip time measurements to predict eminent congestion events.
TCP-Africa is a hybrid protocol that uses a delay metric to determine whether the bottleneck link is congested or
not. In the absence of congestion, the fast mode, where it uses aggressive, scalable congestion avoidance rule. In the
presence of congestion, the slow mode, it switches to the more conservative standard TCP congestion avoidance rule
Fast TCP
Advances in computing, communication, and storage technologies in global grid systems started to provide the
required capacities and an effective environment for computing and science. The key challenge to overcome the
problem in TCP is, using Fast TCP congestion control algorithm which does not scale to this advancement. The
currently deployed TCP implementation is a loss-based approach. It uses additive increase multiplicative decrease
(AIMD) and it works well at low speed, but additive increase (AI) is too slow and multiplicative decrease (MD) too
drastic leading to low utilization of network resources. Moreover, it perpetually pushes the queue to overflow and
also discriminates against flows with large RTTs (round-trip time). To address these problems, Fast TCP was
designed, which adopts delay based approach. Fast TCP has three key differences: i) it is an equation based
algorithm, ii) for measuring congestion, it uses queuing delay as a primary value, iii) has stable flow dynamics and
achieves weighted fairness in equilibrium that it does not penalize long flows. The advantages of using queuing
delay as the congestion measure are, i) queuing delay is estimated more accurately than loss probability, and also
loss samples provide coarser information than queuing delay samples. This makes easier stabilize a network into a
steady state with a high throughput and high utilization and ii) the dynamics of queuing delay have the right scaling
with respect to network capacity.
TCP-Illinois
TCP-Illinois a variant of TCP congestion control protocol, developed at the University of Illinois at Urbana-
Champaign, targeted at high-speed, long distance and long fat networks. It achieves high average throughput than
the standard TCP, by modifying congestion control in sender side. It allocates the network resource fairly, a TCP
friendly protocol and provides incentive for TCP users to switch. TCP-Illinois is a loss-delay based algorithm, which
uses packet loss as the primary congestion signal, and it uses queuing delay as the secondary congestion signal to
adjust the pace of window size.
Compound TCP
Compound TCP (CTCP) is a synergy of delay-based and loss-based approach. The sending rate of Compound TCP
is controlled by both sender and receiver components. This delay-based component rapidly increases sending rate at
the time of underutilized, but retreats slowly in a busy network when bottleneck queue is built. Compound TCP
integrates a scalable delay-based component into the standard TCP congestion avoidance algorithm. This scalable
delay-based component has a fast window increase function when the network is underutilized and reduces the
sending rate when a congestion event is sensed.
CUBIC TCP
CUBIC TCP is an enhanced version of Binary Increase Congestion Control shortly BIC. It simplifies the BIC
window control function and is TCP-friendliness and RTT fairness. As the name of the protocol represents, the
window growth function of CUBIC is a cubic function in terms of the elapsed time accounting from the last loss
event. The protocol keeps the window growth rate independent of RTT, which keeps the protocol TCP friendly
under short and long RTTs. The congestion period of CUBIC is determined by the packet loss rate alone. As TCP’s
throughput is defined by the packet loss rate as well as RTT, the throughput of CUBIC is defined only by the packet
loss rate. Thus, when the loss rate is high and/or RTT is short, CUBIC can operate in a TCP mode.
An important feature of CUBIC is that it keeps the epoch fairly long without losing scalability and network
utilization.
Xpress Transport Protocol (XTP)
XTP provides for the reliable transmission of data in an inter-networked environment, with real time processing of
the XTP protocol i.e., the processing time for incoming or outgoing packets is no greater than transmission time.
XTP contains error, flow and rate control mechanisms similar to those found in other more modem transport layer
protocols in addition to multicast capability. Xpress Transport Protocol (XTP) is a next generation protocol,
designed to be used in high-speed networks and for multimedia applications. It meets the data transfer requirements
of many real time systems and has flexible transfer capabilities, in conjunction with various underlying high
performance network technology. It provides good support for event-type distributed systems, in both LAN and
other internetwork topologies. XTP has been attracting, as a most suitable protocol for high speed network
applications.

➢ UDP BASED PROTOCOLS FOR GRID NETWORKS

UDP-based protocols provide much better portability and are easy to install. Although implementation of user level
protocols needs less time to test and debug than in kernel implementations, it is difficult to make them as efficient,
because user level implementations cannot modify the kernel code, there may be additional context switches and
memory copies. At high transfer speeds, these operations are very sensitive to CPU utilization and protocol
performance. In fact, one of the purposes of the standard UDP protocol is to
allow new transport protocols to be built on top of it. For example, the RTP
protocol is built on top of UDP and supports streaming multimedia. In this
section we study some UDP based transport protocol for data intensive grid
applications.
NETBLT
Bulk data transmission is needed for more applications in various fields and it is
must for grid applications. The major performance concern of a bulk data
transfer protocols is high throughput. In reality, achievable end-to-end
throughput over high bandwidth channels is often an order of magnitude lower
than the provided bandwidth. This is because it is often limited by the transport protocols mechanism, so it is
especially difficult to achieve high throughput and reliable data transmission across long delay, unreliable network
paths.
NETBLT works by opening a connection between two clients (the sender and the receiver) transferring data in a
series of large numbered blocks (buffers), and then closing the connection. NETBLT transfer works as follows: the
sending client provides a buffer of data for the NETBLT layer to transfer. NETBLT breaks the buffer up into
packets and sends the packets using the internet datagram’s. The receiving NETBLT layer loads these packets into a
matching buffer provided by the receiving client. When the last packet in that buffer has arrived, the receiving
NETBLT part will check to see if all packets in buffer have been correctly received or if some packets are missing.
If there are any missing packets, the receiver requests to resend the packets. When the buffer has been completely
transmitted, the receiving client is notified by its NETBLT layer. The receiving client disposes the buffer and
provides a new buffer to receive more data. The receiving NETBLT notifies the sender that the new buffer is created
for receiving and this continues until all the data has been sent.
Reliable Blast UDP (RBUDP)
RBUDP is designed for extremely high bandwidth, dedicated or quality of service enabled networks, which require
high speed bulk data transfer which is an important part. RBUDP has two goals: i) keeping the network buffer full
during data transfer and ii) avoiding TCP’s per packet interaction and sending acknowledgements at the end of a
transmission.
There are 3 version of RBUDP available:
i) Version 1: without scatter/gather optimization - this is naive implementation of RBUDP where each
incoming packet is examined and then moved.
ii) Version 2: with scatter/gather optimization this implementation takes advantage of the fact that most
incoming packets are likely to arrive in order, and if transmission rates are below the maximum throughput
of the network, packets are unlikely to be lost.
iii) Version 3: Fake RBUDP this implementation is the same as the scheme without the scatter/gather
optimization except that the incoming data is never moved to application memory.
The implementation result of RBUDP shows that it performs very efficiently over high speed, high bandwidth, and
Quality of Service enabled networks such as optically switched network. Also through mathematical modeling and
experiments, RBUDP has proved that it effectively utilizes available bandwidth for reliable data transfer.
TSUNAMI
A reliable transfer protocol, Tsunami, is designed for transferring large files fast over high speed networks. Tsunami
is a protocol which uses inter-packet delay for adjusting the rate control instead of sliding window mechanism.
UDP is used for sending data and TCP for sending control data. The goal of Tsunami is to increase the speed of file
transfer in high speed networks that use standard TCP.

During a file transfer, the client has two running threads. The network thread handles all network communication,
maintains the retransmission queue, and places blocks that are ready for disk storage into a ring buffer. The disk
thread simply moves blocks from the ring buffer to the destination file on disk. The server creates a single thread in
response to each client connection that handles all disk and network activity. The client initiates a Tsunami session
by connecting it to the TCP port of the server. Upon connection, the server sends a random data to the client. The
client checks the random data by using XOR with a shared secret key and calculates a MD5 check sum, then
transmits it to the server. The server does the same operation and checks the check sum and if both are same, the
connection is up. After performing the authentication and connection steps, the client sends the name of file to the
server. In the server side, it checks whether the file is available and if it is available, it sends a message to client.
After receiving a positive message from server, client sends its block size, transfer rate, error threshold value. The
server responds with the receiver parameters and sends a time-stamp. After receiving the timestamp, client creates a
port for receiving file from the server and server sends the file to the receiver.
Tsunami is an improvement of Reliable Blast UDP in two points. First, Tsunami receiver makes a retransmission
request periodically (every 50 packets) and it doesn’t wait until finishing of all data transfer, then it calculates
current error rate and sends it to the sender. Second, Tsunami uses rate based congestion control. Tsunami has best
performance in networks with limited distance, when it comes to long distance network, bandwidth utilization goes
down, absence of flow control affects its performance and issues like fairness and TCP friendliness has to be
studied.

8. Types of grids:
Based on the different levels of complexity for the enterprise, grids can be categorized as follows:
• Infra-Grid – this type of grid architecture allows optimizing the resource sharing within a division of the
organization’s departments. Infra-grid forms a tightly controlled environment with well defined business policies,
integration and security.
• Intra-Grid – it’s a more complex implementation than the previous because it’s focused on integrating various
resources of several departments and divisions of an enterprise. These types of grids require a complex security
policies and sharing resources and data. However, because the resources are found in the same enterprise, the focus
is on the technical implementation of the policies.
• Extra-Grid – unlike intra-grid, this type of grid is referring to resource sharing to/from a foreign partner towards
certain relationships are established. These grids extend over the administrative management of local resources of an
enterprise and therefore mutual conventions on managing the access to resources are necessary.
• Inter-Grid – this kind of grid computing technology enables sharing and storage resources and data using the Web
and enabling the collaborations between various companies and organizations. The complexity of the grid comes
from the special requirements of service levels, security and integration. This type of grid involves most of the
mechanism found in the three previous types of grid.

Elements of Grid Computing

Grid computing combines elements such as distributed computing, high-performance computing and disposable
computing depending on the application of the technology and the scale of operation. Grids can create a virtual
supercomputer out of the existing servers, workstations and personal computers.
Present-day grids encompass the following types
Computational grids, in which machines will set aside resources to “number crunch” data or provide coverage for
other intensive workloads.
Scavenging grids, commonly used to find and harvest machine cycles from idle servers and desktop computers for
use in resource-intensive tasks (scavenging is usually implemented in a way that is unobtrusive to the owner/user of
the processor)
Data grids, which provide a unified interface for all data repositories in an organization, and through which data can
be queried, managed and secured.
Market-oriented grids, which deal with price setting and negotiation, grid economy management and utility driven
scheduling and resource allocation.
The key components of grid computing include the following.

• Resource management: a grid must be aware of what resources are available for different tasks
• Security management: the grid needs to take care that only authorized users can access and use the
available resources
• Data management: data must be transported, cleansed, parceled and processed
• Services management: users and applications must be able to query the grid in an effective and efficient
manner
More specifically, grid computing environment can be viewed as a computing setup constituted by a number of
logical hierarchical layers. Figure 1 represents these layers. They include grid fabric resources, grid security
infrastructure, core grid middleware, user level middleware and resource aggregators, grid programming
environment and tools and grid applications.
The major constituents of a grid computing system can be identified into various categories from different
perspectives as follows:
• functional view
• physical view
• service view
Basic constituents of a grid from a functional view are decided depending on the grid design and its expected use.
Some of the functional constituents of a grid are
1. Security (in the form of grid security infrastructure)
2. Resource Broker
3. Scheduler
4. Data Management
5. Job and resource management
6. Resources

A resource is an entity that is to be shared; this includes computers, storage, data and software. A resource need not
be a physical entity. Normally, a grid portal acts as a user interaction mechanism which is application specific and
can take many forms. A user-security functional block usually exists in the grid environment and is a key
requirement for grid computing. In a grid environment, there is a need for mechanisms to provide authentication,
authorization, data confidentiality, data integrity and availability, particularly from a user’s point of view. In the case
of inter-domain grids, there is also a requirement to support security across organizational boundaries. This makes a
centrally managed security system impractical. The grid security infrastructure (GSI) provides a “single sign-on”,
run anywhere authentication service with support for local control over access rights and mapping from global to
local identities.
➢ HPC AND THE GRID
The latest fashion in some academic IT circles is ``The Grid'' , and many people have a quite incorrect view that
``The Grid'' in some way is deeply connected to high performance computing, or even that it is high performance
computing in its latest guise.
``The Grid'' is not related to high performance computing.
High performance computing and supercomputing have been around for tens of years without ``The Grid'' and they
will continue to be around for tens of years without it. The other view, which is also quite incorrect, is that there is
just one ``Grid'', which is also ``The Grid'', and that this grid is based on ``Globus'' , which is a collection of utilities
and libraries developed by various folks, but mostly by folks from the University of Chicago and the Argonne
National Laboratory. Many people, who actually know something about distributed computing, pointed out that
what is called ``The Grid'' nowadays, was called ``Distributed Computing'' only a decade ago. It is often the case in
Information Technology, especially in academia, that old washed out ideas are being given new names and flogged
off yet again by the same people who failed to sell them under the old names.
There are some successful examples of grids in place today. The most successful one, and probably the only one that
will truly flourish in years to come, is the Microsoft ``.NET Password'' program. It works like this: when you start
up your PC running windows, ``MSN Messenger'' logs you in with the ``.NET Password''. This way you acquire
credentials, which are then passed to all other WWW sites that participate in the ``.NET Password'' program. For
example, once I have been authenticated to ``.NET Password'', I can connect to [Link], Nature, Science,
Monster, The New York Times, and various other well known sites, which recognize me instantaneously and
provide me with customized services.
Another example of a grid is AFS, the Andrew File System. AFS is a world-wide file system, which, when mounted
on a client machine, provides its user with transparent access to file systems at various institutions. It can be
compared to the World Wide Web, but unlike WWW, AFS provides access to files on the kernel and file system
level. You don't need to use a special tool such as a WWW browser. If you have AFS mounted on your computer,
you can use native OS methods in order to access, view, modify, and execute files that live at other institutions. User
authentication and verification is based on MIT Kerberos and user authorization is based on AFS Access Control
Lists (ACLs).

9. High performance application development environment

Abstract:
One of the key issues in developing applications on the new multicore clustering environment is to select a right
programming system. Traditional parallel programming system development is usually targeted the development of
scientific applications. As a result, these approaches lack many features that are necessary for business application
development such as powerful graphical development environment, integration with components from various
sources, seamless integration with databases.
Software Environment
Currently most of our servers run the 64-bit Solaris 10 Operating Environment. Additional installed software
includes: Native Sun Studio 12u2 compilers for FORTRAN, C, and C++, together with a complete software
development environment, which includes a complete set of graphical and command line tools to help you build,
debug, run, and tune your high performance applications. Sun HPC ClusterTools 6.0, 7.1 and 8.2 are suites of
applications and libraries for high performance software development of parallel applications based on the Message
Passing Interface (MPI) and capable of running on clusters as well as on parallel shared-memory machines and
combinations of both.
Sun Grid Engine 6.2 Enterprise Edition is a distributed workload manager that optimizes utilization of software and
hardware resources in heterogeneous networked environments. Its use is mandatory for scheduling production jobs
on HPCVL clusters. A great variety of application software is pre-installed on HPCVL clusters. The applications
range from computational quantum chemistry to finite-element analysis, from mathematical and statistical libraries
to computational fluid dynamics, and from ab-initio molecular dynamics to solid-state physics.

Overview of Grid Computing Systems
No ratings yet
Overview of Grid Computing Systems
15 pages
Understanding Grid Computing Basics
No ratings yet
Understanding Grid Computing Basics
7 pages
Grid Computing Explained
No ratings yet
Grid Computing Explained
10 pages
Group V Grid Computing
No ratings yet
Group V Grid Computing
14 pages
Grid Computing: Mansoor Ahmed Aziz 7865 Warda Aqil
100% (1)
Grid Computing: Mansoor Ahmed Aziz 7865 Warda Aqil
4 pages
Grid Computing Explained: Key Features & Benefits
No ratings yet
Grid Computing Explained: Key Features & Benefits
8 pages
Jsaer2015 02 03 09 12
No ratings yet
Jsaer2015 02 03 09 12
4 pages
Overview of Grid Computing Concepts
No ratings yet
Overview of Grid Computing Concepts
4 pages
Module 3 Compressed
No ratings yet
Module 3 Compressed
29 pages
Understanding Grid Computing Basics
No ratings yet
Understanding Grid Computing Basics
35 pages
Final
No ratings yet
Final
19 pages
Argentina Grid Computing Overview
No ratings yet
Argentina Grid Computing Overview
5 pages
Grid Computing: An Overview
No ratings yet
Grid Computing: An Overview
12 pages
Grid Computing Security Policies
No ratings yet
Grid Computing Security Policies
10 pages
Grid Computing Market Overview
No ratings yet
Grid Computing Market Overview
25 pages
Grid Computing for Business & Science
No ratings yet
Grid Computing for Business & Science
4 pages
Security in Grid Computing Systems
No ratings yet
Security in Grid Computing Systems
10 pages
E-Notes 1664 Content Document 20240924044143PM
No ratings yet
E-Notes 1664 Content Document 20240924044143PM
16 pages
Grid Computing
No ratings yet
Grid Computing
19 pages
Introduction To Grid Computing ,: Networked Computers
No ratings yet
Introduction To Grid Computing ,: Networked Computers
18 pages
Grid Computing: 3 Semester, CSE, NCET
No ratings yet
Grid Computing: 3 Semester, CSE, NCET
7 pages
Grid Computing: The Current State and Future Trends: (In General and From The University of Canterbury's Perspective)
No ratings yet
Grid Computing: The Current State and Future Trends: (In General and From The University of Canterbury's Perspective)
13 pages
Cloud Computing - Notes
No ratings yet
Cloud Computing - Notes
75 pages
05 Chapter 1
No ratings yet
05 Chapter 1
30 pages
Assignment
No ratings yet
Assignment
12 pages
Presented by K.Sreekala (2007-2011) Iiird It From XENIX Group
No ratings yet
Presented by K.Sreekala (2007-2011) Iiird It From XENIX Group
13 pages
Grid Computing Overview at CUSAT
No ratings yet
Grid Computing Overview at CUSAT
12 pages
Overview of Grid Computing Concepts
No ratings yet
Overview of Grid Computing Concepts
12 pages
Cloud Computing
No ratings yet
Cloud Computing
18 pages
Overview of Grid Computing Concepts
100% (3)
Overview of Grid Computing Concepts
34 pages
Computing Paradigms Explained
No ratings yet
Computing Paradigms Explained
4 pages
Grid Computing English
No ratings yet
Grid Computing English
3 pages
Understanding Grid Computing Concepts
No ratings yet
Understanding Grid Computing Concepts
11 pages
Cloud & Grid Computing Basics
No ratings yet
Cloud & Grid Computing Basics
51 pages
Grid Computing
No ratings yet
Grid Computing
11 pages
Grid Computing Seminar Report
100% (1)
Grid Computing Seminar Report
36 pages
(PDF) Gridcomputingoverview 1720528827803
No ratings yet
(PDF) Gridcomputingoverview 1720528827803
8 pages
Key Milestones in Cloud Computing Evolution
No ratings yet
Key Milestones in Cloud Computing Evolution
100 pages
2 KAVITA - Grid Computing - 3 (9.1.10) Pp6-10
No ratings yet
2 KAVITA - Grid Computing - 3 (9.1.10) Pp6-10
7 pages
Grid Computing
No ratings yet
Grid Computing
45 pages
Seminar Presented By: Sehar Sultan M.SC (CS) 4 Semester UAF
No ratings yet
Seminar Presented By: Sehar Sultan M.SC (CS) 4 Semester UAF
33 pages
Unit 3.1 Grid Computing
No ratings yet
Unit 3.1 Grid Computing
22 pages
Grid Computing Architecture Overview
No ratings yet
Grid Computing Architecture Overview
4 pages
Grid Computing: Virtual Supercomputing
No ratings yet
Grid Computing: Virtual Supercomputing
6 pages
Paper 4 PDF
No ratings yet
Paper 4 PDF
6 pages
Overview of Distributed and Grid Computing
No ratings yet
Overview of Distributed and Grid Computing
97 pages
Cloud Computing - Unit - 1
No ratings yet
Cloud Computing - Unit - 1
98 pages
Cloud Computing-Unit-1
No ratings yet
Cloud Computing-Unit-1
15 pages
Overview of Distributed and Cloud Computing
No ratings yet
Overview of Distributed and Cloud Computing
69 pages
Intro To Grid Computing PDF
No ratings yet
Intro To Grid Computing PDF
34 pages
Cloud Computing Unit1
No ratings yet
Cloud Computing Unit1
53 pages
Grid Computing
100% (2)
Grid Computing
14 pages
Understanding Grid Computing Architecture
No ratings yet
Understanding Grid Computing Architecture
27 pages
Skill DEVElopment
No ratings yet
Skill DEVElopment
30 pages
CLAD Exam: LabVIEW Basics
No ratings yet
CLAD Exam: LabVIEW Basics
120 pages
Oracle BI Publisher Enterprise Cluster Deployment: An Oracle White Paper August 2007
No ratings yet
Oracle BI Publisher Enterprise Cluster Deployment: An Oracle White Paper August 2007
10 pages
Rain Technology Seminar
67% (3)
Rain Technology Seminar
18 pages
Sub-Capacity (Virtualization) License Counting Rules - IBM
No ratings yet
Sub-Capacity (Virtualization) License Counting Rules - IBM
13 pages
Oracle ASM Storage Management Guide
No ratings yet
Oracle ASM Storage Management Guide
76 pages
Dell NSS NFS Storage Solution Final PDF
No ratings yet
Dell NSS NFS Storage Solution Final PDF
38 pages
Oracle WebLogic 12c Admin Exam Prep
No ratings yet
Oracle WebLogic 12c Admin Exam Prep
25 pages
Tr-4517 Ontap Select Product Architecture and Best Practices
No ratings yet
Tr-4517 Ontap Select Product Architecture and Best Practices
60 pages
Introduction to Linux Operating System
No ratings yet
Introduction to Linux Operating System
49 pages
Cucm B Administration Guide 1251SU1
No ratings yet
Cucm B Administration Guide 1251SU1
494 pages
Best Practices for Tomcat Load Balancing
No ratings yet
Best Practices for Tomcat Load Balancing
11 pages
OS Admin and AD
No ratings yet
OS Admin and AD
17 pages
B2Bi AdministratorGuide Multi-Cluster AllOS en
No ratings yet
B2Bi AdministratorGuide Multi-Cluster AllOS en
102 pages
MPI Computing with CST Studio Suite
No ratings yet
MPI Computing with CST Studio Suite
13 pages
High Performance Computing (HPC)
No ratings yet
High Performance Computing (HPC)
8 pages
Scada 1
No ratings yet
Scada 1
87 pages
HCI Critical Capabilities
No ratings yet
HCI Critical Capabilities
23 pages
Evolution of Cloud Computing Systems
No ratings yet
Evolution of Cloud Computing Systems
23 pages
Aix Hacmp Cookbook
No ratings yet
Aix Hacmp Cookbook
240 pages
Isilon Training
No ratings yet
Isilon Training
1,293 pages
5V0-37.22 Broadcom Exam Practice Questions
No ratings yet
5V0-37.22 Broadcom Exam Practice Questions
14 pages
Eve Pe Book 5.9 2023
No ratings yet
Eve Pe Book 5.9 2023
281 pages
Gluster FS 3.2 Admin Guide
No ratings yet
Gluster FS 3.2 Admin Guide
80 pages
h3c Uis 8.0 Hci Datasheet
No ratings yet
h3c Uis 8.0 Hci Datasheet
13 pages
DELLEMC PowerScale OneFS Software Features Data Sheet
No ratings yet
DELLEMC PowerScale OneFS Software Features Data Sheet
5 pages
JNTUK R20 B.tech CSE 4-1 Cloud Computing Unit 1 Notes
100% (1)
JNTUK R20 B.tech CSE 4-1 Cloud Computing Unit 1 Notes
18 pages
GPS Vs Hdfs
No ratings yet
GPS Vs Hdfs
6 pages
Pve Admin Guide
100% (1)
Pve Admin Guide
303 pages
1 PDF
No ratings yet
1 PDF
5 pages

UNIT-1: Overview of Grid Computing

Uploaded by

UNIT-1: Overview of Grid Computing

Uploaded by

UNIT-1

1. OVERVIEW OF GRID COMPUTING:

An Internet computing architecture provides:

7. GRID COMPUTING MODEL AND PROTOCOLS

➢ NEED OF HIGH PERFORMANCE COMMUNICATION PROTOCOL

➢ TCP BASED PROTOCOLS FOR GRID

An overview and study of protocols that are designed for high

➢ UDP BASED PROTOCOLS FOR GRID NETWORKS

Elements of Grid Computing

9. High performance application development environment

You might also like