0% found this document useful (0 votes)

47 views38 pages

InfiniBand Key Features - Summary

The document outlines the key features of InfiniBand technology, highlighting its benefits for high-performance computing, AI, and cloud data centers. Key features include simplified management, high bandwidth, CPU offloads, ultra-low latency, scalability, quality of service, resiliency, adaptive routing, and support for various topologies. Overall, InfiniBand is presented as a leading interconnect technology for modern data centers, enhancing performance and reducing operational complexity.

Uploaded by

aruoyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views38 pages

InfiniBand Key Features - Summary

Uploaded by

aruoyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Networking

INFINIBAND
Academy

KEY FEATURES

ODED PAZ
Sr. Technologies Instructor
Outline
 InfiniBand Key Features Overview
 InfiniBand Key Features
 Simplified Management
 High Bandwidth
 CPU Offloads
 Ultra-Low Latency
 Easy Network Scale-out
 Quality of Service
 Fabric Resiliency
 Optimal Load Balancing with Adaptive Routing
 MPI Super Performance with SHARP
 InfiniBand Topologies
 Summary
InfiniBand Key Features –
Overview
InfiniBand Interconnect Technology
NVIDIA Mellanox InfiniBand interconnect brings high-speed, extreme low-latency and scalable solutions.

The InfiniBand technology enables supercomputer, Artificial Intelligence (AI) and cloud data centers
to operate at any scale, while reducing operational costs and infrastructure complexity.

4
InfiniBand Interconnect Technology
InfiniBand is the interconnect technology of choice for AI, Deep Learning, Data Science and
many other accelerated computing applications.

5
InfiniBand Key Features

6
Simplified Management
Simplified Management
InfiniBand is the first architecture to truly implement the vision of SDN - Software Defined Network

An InfiniBand network is managed by a Subnet Manager.

8
The Subnet Manager
The Subnet Manager (SM) is a program that runs and manages the entire network.

The SM provides centralized routing management, Every InfiniBand subnet has its own master SM, and
hence enables to plug and play all the nodes in in order-to ensure resiliency, a second SM,
the network. functions as a standby.

9
High Bandwidth
InfiniBand Bandwidth
InfiniBand architecture began its journey in 2002, with a speed of 10 Gigabits per second,
and since then, it has been providing the highest bandwidth, non-blocking bi-directional links.

11
CPU Offloads
CPU Offloads
The InfiniBand architecture supports data transfer with minimal CPU intervention.
This is achievable, thanks to:
Hardware-based transport protocol
Kernel bypass or zero copy
Remote Direct Memory Access (RDMA) - RDMA allows direct memory access from the memory of
one node into that of another without involving either one's CPU.

13
GPU Direct
Off-loading compute nodes is implemented by NVIDIA GPUs as well

GPU-direct allows direct data transfer from the memory of one GPU to the memory of another.
It enables lower latency and improved performance, as provided by the GPU based computation.

14
Low latency
Low Latency
Extreme low latency is achieved by a combination of hardware offloading and accelerating
mechanisms, which is unique to the InfiniBand architecture.

As a result, end-to-end latency of RDMA sessions can be as low as 1000 nano-seconds or

1 micro-second.

16
Easy Network
Scale-Out
Network Scale-Out
One of InfiniBand’s main advantages is the capability to deploy up to 48,000 nodes on a single subnet.

Multiple InfiniBand subnets can be interconnected using InfiniBand routers in-order to

easily scale up beyond 48,000 nodes.

18
Quality of Service
Quality of Service
Quality of service is the ability to provide different priority to different:
Applications
Users
Data flows

Applications that require a higher priority will be mapped to a different port queue, and their packets will be
sent first to the next element in the network.

20
Fabric Resiliency
Fabric Resiliency
One of the main features that customers require
is a stable network without link failures.
Yet, in such cases traffic resumption must be very fast.

When traffic re-route depends solely on the

Subnet Manager routing algorithm,
traffic renewal can take around five seconds.

22
NVIDIA Self-Healing Networking
Self-Healing Networking is a hardware-based capability of NVIDIA switches.

NVIDIA Self-Healing Networking enables a link fault recovery that is 5000x faster!
This means the recovery time takes only one millisecond!

23
Load-Balancing
Load-Balancing
Another requirement that should be addressed in a modern high performant data center is
how the network is best utilized and optimized.
One way to achieve that is to have a load-balancing scheme.

Load-balancing is a routing strategy that allows traffic to be distributed over multiple available paths.

25
Adaptive Routing
Adaptive routing is a feature that allows equalizing the amount of traffic sent on each of the switch ports.

Adaptive Routing is enabled on NVIDIA’s switches’ hardware and

managed by the Adaptive Routing Manager

QM8700 InfiniBand Switch System

26
Adaptive Routing
When Adaptive Routing is enabled, the switch
“Queue Manager” constantly compares the
volume levels between all group exit ports.

The Queue Manager constantly balances the

queue’s load, redirecting flows and packets to
an alternative less utilized port.

27
Adaptive Routing
To sum up:

Adaptive routing may be activated on all fabric switches.

Adaptive Routing supports dynamic load balancing, avoiding

in-network congestion and optimizing network bandwidth utilization.

28
SHARP
SHARP
SHARP - Scalable Hierarchical Aggregation and Reduction Protocol

SHARP is a mechanism based on NVIDIA’s switch hardware and a central management package.
SHARP offloads collective operations from the hosts CPUs or GPUs to the network switches and
eliminates the need to send data multiple times between endpoints.

30
SHARP
SHARP - Scalable Hierarchical Aggregation and Reduction Protocol

SHARP decreases the amount of data traversing the network.

As a result, SHARP dramatically improves the performance of accelerated computing

MPI based applications by up to x10 times.

31
INFINIBAND TOPOLOGIES
InfiniBand Topologies
Fat Tree

Torus

Dragonfly +

Hypercube

HyperX

33
InfiniBand Topologies
Being able to support a wide variety of topologies, InfiniBand answers different
customers’ requirements such as:

Easy network scale out Minimum latency between fabric nodes

Reduced total cost of ownership Maximum distance

Maximum Blocking ratio

34
Summary
Summary
In this session we have described the main key features that make InfiniBand the most
high-performant, effective, resilient and best utilized interconnect technology for accelerated
computing applications in modern data centers.

36
Summary
 Simplified management by the Subnet Manager
 High Bandwidth
 CPU Offloads and RDMA
 End-to-end, best in class latency
 Fabric Scalability
 Quality of Service – Traffic prioritization capability
 Resiliency and fast re-routing of traffic in case of a port failure
 Adaptive Routing providing optimal load-balancing eliminating in-network congestion
 In-networking computation based on NVIDIA's SHARP mechanism
 A wide variety of supported topologies

1 - Introduction To InfiniBand
No ratings yet
1 - Introduction To InfiniBand
21 pages
Using Infiniband For High Speed Interconnect
No ratings yet
Using Infiniband For High Speed Interconnect
15 pages
Infiniband Networking Sales Training
No ratings yet
Infiniband Networking Sales Training
17 pages
InfiniBand Vs Ethernet
No ratings yet
InfiniBand Vs Ethernet
15 pages
Introduction To Infiniband: Executive Summary
No ratings yet
Introduction To Infiniband: Executive Summary
20 pages
Lect16 Infiniband
No ratings yet
Lect16 Infiniband
30 pages
Infiniband: Bart Taylor
No ratings yet
Infiniband: Bart Taylor
13 pages
InfiniBand An Overview
No ratings yet
InfiniBand An Overview
9 pages
InfiniBand and High-Speed Ethernet For Dummies
No ratings yet
InfiniBand and High-Speed Ethernet For Dummies
134 pages
1mellanox 140331123657 Phpapp02
No ratings yet
1mellanox 140331123657 Phpapp02
76 pages
Lez.b-04 Infiniband
No ratings yet
Lez.b-04 Infiniband
76 pages
InfiniBand Architecture Guide
No ratings yet
InfiniBand Architecture Guide
22 pages
InfiniBand Architecture Overview
No ratings yet
InfiniBand Architecture Overview
39 pages
Spectrum X Technical Overview Munich 1710156600119
No ratings yet
Spectrum X Technical Overview Munich 1710156600119
29 pages
Mellanox Cisco
No ratings yet
Mellanox Cisco
4 pages
NB 06 Nexus 9000 Series Switches So Cte en
No ratings yet
NB 06 Nexus 9000 Series Switches So Cte en
5 pages
VM Inifinband
No ratings yet
VM Inifinband
34 pages
Understanding InfiniBand
No ratings yet
Understanding InfiniBand
4 pages
AMD - SuperMicro Cluster Setup Guide
No ratings yet
AMD - SuperMicro Cluster Setup Guide
68 pages
Cisco Nexus 9000 Series Switches Overview
No ratings yet
Cisco Nexus 9000 Series Switches Overview
5 pages
Infiniband for IT Professionals
No ratings yet
Infiniband for IT Professionals
30 pages
Next Generation DC 100511 Juniper
No ratings yet
Next Generation DC 100511 Juniper
27 pages
InfiniBand: HPC Networking Standard
No ratings yet
InfiniBand: HPC Networking Standard
2 pages
Speaker - A02 - 5747 - Best Practices in Networking For AI
No ratings yet
Speaker - A02 - 5747 - Best Practices in Networking For AI
15 pages
IB Architecture - Summary
No ratings yet
IB Architecture - Summary
31 pages
10 Gigabit Ethernet Vs Infinibandbasic
No ratings yet
10 Gigabit Ethernet Vs Infinibandbasic
122 pages
Intelligent Network - Fusion Bearing: ZXR10 M6000-S Intelligent Full-Service Router
No ratings yet
Intelligent Network - Fusion Bearing: ZXR10 M6000-S Intelligent Full-Service Router
7 pages
Ethernet Switch Brochure
No ratings yet
Ethernet Switch Brochure
7 pages
ACI Spine and Leaf Upgrade Strategies
No ratings yet
ACI Spine and Leaf Upgrade Strategies
6 pages
Infiniband & Firewire: Assignment 4
No ratings yet
Infiniband & Firewire: Assignment 4
12 pages
Mellanox Ethernet Switch Brochure
No ratings yet
Mellanox Ethernet Switch Brochure
4 pages
SB HighFreq Trading
No ratings yet
SB HighFreq Trading
4 pages
ExtremeNetworks Product-Guide
No ratings yet
ExtremeNetworks Product-Guide
30 pages
Product Guide Extreme Networks
No ratings yet
Product Guide Extreme Networks
30 pages
NVIDIA MTE - Perfect The Art of Your Network Landscape
100% (1)
NVIDIA MTE - Perfect The Art of Your Network Landscape
14 pages
Intelligent Optical Networks: Michal Debski Rami Abielmona ELG 7187 Wednesday November 21, 2001 Prof. Dan Ionescu
No ratings yet
Intelligent Optical Networks: Michal Debski Rami Abielmona ELG 7187 Wednesday November 21, 2001 Prof. Dan Ionescu
26 pages
Brkopt 2699
No ratings yet
Brkopt 2699
91 pages
02-CloudFabric Builds The Next-Generation DCN For The AI Era PDF
100% (1)
02-CloudFabric Builds The Next-Generation DCN For The AI Era PDF
43 pages
Data Center - Bicsi
No ratings yet
Data Center - Bicsi
8 pages
Brocade Intro VCS Fabric Technology WP - v3
No ratings yet
Brocade Intro VCS Fabric Technology WP - v3
10 pages
Nvidia Spectrum Sn2000 Series Switches
No ratings yet
Nvidia Spectrum Sn2000 Series Switches
9 pages
ENA 12.6-ILT-Mod 01-Extreme's Solution-Rev02-111230
No ratings yet
ENA 12.6-ILT-Mod 01-Extreme's Solution-Rev02-111230
30 pages
Distributed Switching Overview
No ratings yet
Distributed Switching Overview
8 pages
Pan African Enetwork Project Course: Bsc. (It) Subject: Lan Switching and Wireless Semester-Iv Faculty: Nitin Pandey
No ratings yet
Pan African Enetwork Project Course: Bsc. (It) Subject: Lan Switching and Wireless Semester-Iv Faculty: Nitin Pandey
84 pages
Extremeswitching X690: Product Overview
No ratings yet
Extremeswitching X690: Product Overview
8 pages
Term Paper On InfiniBand
No ratings yet
Term Paper On InfiniBand
9 pages
Datasheet c78 742283
No ratings yet
Datasheet c78 742283
15 pages
Datasheet c78 742283 PDF
No ratings yet
Datasheet c78 742283 PDF
15 pages
Data Fabric Collateral (EU)
No ratings yet
Data Fabric Collateral (EU)
2 pages
What Is An Ethernet Fabric White Paper
No ratings yet
What Is An Ethernet Fabric White Paper
8 pages
Giganet
No ratings yet
Giganet
23 pages
Cisco Nexus C9318 Datasheet
No ratings yet
Cisco Nexus C9318 Datasheet
15 pages
InfiniBandFAQ FQ 100
No ratings yet
InfiniBandFAQ FQ 100
16 pages
WP 2007 IB Software and Protocols
No ratings yet
WP 2007 IB Software and Protocols
8 pages
3 - Advance Switching
No ratings yet
3 - Advance Switching
19 pages
Nexus 9300 GX Series Switches Ds
0% (1)
Nexus 9300 GX Series Switches Ds
17 pages
Switching Technologies
No ratings yet
Switching Technologies
38 pages
Fero How SDN WAN Changes For The Better InteropBY 2014 20141002
No ratings yet
Fero How SDN WAN Changes For The Better InteropBY 2014 20141002
84 pages
Data Center Calculations
100% (8)
Data Center Calculations
12 pages
DATA Center Design
92% (12)
DATA Center Design
129 pages
Data Center For Beginners
100% (10)
Data Center For Beginners
35 pages
DATA CENTER Design Checklist
82% (11)
DATA CENTER Design Checklist
6 pages
The Data Center Builder - S Bible - Book 3 - Designing The New Data Center - Specifying, Designing, Building and Migrating To New Data Centers
80% (10)
The Data Center Builder - S Bible - Book 3 - Designing The New Data Center - Specifying, Designing, Building and Migrating To New Data Centers
298 pages
Data Center Design
100% (10)
Data Center Design
209 pages
How To Design and Build A Data Center
100% (6)
How To Design and Build A Data Center
30 pages
Data Center Infrastructure Resource Guide
89% (9)
Data Center Infrastructure Resource Guide
64 pages
McKinsey Handbook - How To Write A Business Plan
97% (29)
McKinsey Handbook - How To Write A Business Plan
116 pages
Data Center Design Guide
100% (4)
Data Center Design Guide
123 pages
Data Center Design
100% (1)
Data Center Design
43 pages
Data Center Cooling
100% (7)
Data Center Cooling
28 pages
Data Center Energy Handbook
100% (7)
Data Center Energy Handbook
118 pages
Data Center Design Best Practices
100% (14)
Data Center Design Best Practices
41 pages
Power Point Template
80% (10)
Power Point Template
9 pages
Practical Project Management
93% (30)
Practical Project Management
295 pages
Data Center Checklist: Proven Industry Experience
100% (1)
Data Center Checklist: Proven Industry Experience
2 pages
Practical Considerations For Implementing Prefabricated Data Centers
100% (1)
Practical Considerations For Implementing Prefabricated Data Centers
14 pages
Data Center Design & Audit Course
50% (4)
Data Center Design & Audit Course
6 pages
The Data Center Builders Bible Book 1 Defining Your Data Center Requirements Specifying Designing Building and Migrating To New Data Centers 1 9781980566755
100% (1)
The Data Center Builders Bible Book 1 Defining Your Data Center Requirements Specifying Designing Building and Migrating To New Data Centers 1 9781980566755
247 pages
Data Center Audit Checklist
85% (61)
Data Center Audit Checklist
2 pages
30 Data Centre Cooling Templates Vincent Byrne Consulting
100% (4)
30 Data Centre Cooling Templates Vincent Byrne Consulting
29 pages
Data Center Architecture
100% (1)
Data Center Architecture
20 pages
Data Center Best Practices Ebook
100% (2)
Data Center Best Practices Ebook
62 pages
Project Report Dashboard Template
100% (3)
Project Report Dashboard Template
6 pages
BICSI Data Center Standard: Stephen Banks, RCDD CDCDP
100% (4)
BICSI Data Center Standard: Stephen Banks, RCDD CDCDP
26 pages
Data Center Tier Classification Guide
100% (1)
Data Center Tier Classification Guide
16 pages
Building A Modern Data Center Ebook
100% (1)
Building A Modern Data Center Ebook
263 pages
Data Center Guide
No ratings yet
Data Center Guide
49 pages
Handbook of Data Center Managment
100% (1)
Handbook of Data Center Managment
801 pages
Introduction To Radar Warning Receiver
100% (1)
Introduction To Radar Warning Receiver
23 pages
Professional Profile and Skills Summary
No ratings yet
Professional Profile and Skills Summary
1 page
Business Process Automation Insights
No ratings yet
Business Process Automation Insights
73 pages
Digital Age Computing Guide
100% (1)
Digital Age Computing Guide
21 pages
IrDA SDK ProgrammingGuide
No ratings yet
IrDA SDK ProgrammingGuide
9 pages
Amateur Radio Common Abbreviations
50% (2)
Amateur Radio Common Abbreviations
69 pages
Canva Basics Guide
100% (1)
Canva Basics Guide
13 pages
Ship Safety Equipment Maintenance Guide
No ratings yet
Ship Safety Equipment Maintenance Guide
2 pages
A Solution For Studying The D.C. Motor Control Using NI MyRIO-1900
No ratings yet
A Solution For Studying The D.C. Motor Control Using NI MyRIO-1900
4 pages
Bioinformatics Jobs in India
No ratings yet
Bioinformatics Jobs in India
4 pages
Volkmann CTwister
100% (1)
Volkmann CTwister
20 pages
DPM2Plus: Pressure Meter Technical Data
No ratings yet
DPM2Plus: Pressure Meter Technical Data
2 pages
Brocade Basics PDF
No ratings yet
Brocade Basics PDF
48 pages
MRO Supplier: Novabolt LLP Products
No ratings yet
MRO Supplier: Novabolt LLP Products
16 pages
Course Selection Confirmation
No ratings yet
Course Selection Confirmation
4 pages
BSCcsit
No ratings yet
BSCcsit
32 pages
Digital Information Age An Introduction To Electrical Engineering 2nd Edition Roman Kuc Solutions Manual 1
100% (79)
Digital Information Age An Introduction To Electrical Engineering 2nd Edition Roman Kuc Solutions Manual 1
36 pages
Command Control Communications Computing Intelligence Graduate Certificate
No ratings yet
Command Control Communications Computing Intelligence Graduate Certificate
1 page
Ch. 6 - Mechanism - Limited Direct Execution
No ratings yet
Ch. 6 - Mechanism - Limited Direct Execution
5 pages
Thevenin's Theorem Lab Manual
No ratings yet
Thevenin's Theorem Lab Manual
18 pages
GMDSS English Vocabulary Guide
No ratings yet
GMDSS English Vocabulary Guide
10 pages
BFA - Corrigendum201509
No ratings yet
BFA - Corrigendum201509
8 pages
Docu104008 - DDVE 7.6.0.5 GCP Installation and Administration Guide (REV 02)
No ratings yet
Docu104008 - DDVE 7.6.0.5 GCP Installation and Administration Guide (REV 02)
66 pages
ServiceNow Exam Questions and Answers
No ratings yet
ServiceNow Exam Questions and Answers
31 pages
Peelable Coating To Protect New Construction Coating Systems
50% (2)
Peelable Coating To Protect New Construction Coating Systems
10 pages
Half Subtractor and Full Subtractor Experimaent 4: Lab Manual 3 Semester
No ratings yet
Half Subtractor and Full Subtractor Experimaent 4: Lab Manual 3 Semester
5 pages
PTCL 3G Evo Tab - Software Update Step by Step Visual Guide
No ratings yet
PTCL 3G Evo Tab - Software Update Step by Step Visual Guide
4 pages
Concorde Corodex Fire Protection and Automation-Low
No ratings yet
Concorde Corodex Fire Protection and Automation-Low
16 pages
Wi-Fi Robot Pan Tilt Camera Guide
No ratings yet
Wi-Fi Robot Pan Tilt Camera Guide
2 pages
Timestamp Type: Type of Student: A5. "Previous" GRADE Level A6. "Previous" School Year A7. "Previous" School Name
No ratings yet
Timestamp Type: Type of Student: A5. "Previous" GRADE Level A6. "Previous" School Year A7. "Previous" School Name
22 pages

InfiniBand Key Features - Summary

Uploaded by

InfiniBand Key Features - Summary

Uploaded by

Networking

An InfiniBand network is managed by a Subnet Manager.

As a result, end-to-end latency of RDMA sessions can be as low as 1000 nano-seconds or

Multiple InfiniBand subnets can be interconnected using InfiniBand routers in-order to

When traffic re-route depends solely on the

Adaptive Routing is enabled on NVIDIA’s switches’ hardware and

QM8700 InfiniBand Switch System

The Queue Manager constantly balances the

Adaptive routing may be activated on all fabric switches.

Adaptive Routing supports dynamic load balancing, avoiding

SHARP decreases the amount of data traversing the network.

As a result, SHARP dramatically improves the performance of accelerated computing

Easy network scale out Minimum latency between fabric nodes

Reduced total cost of ownership Maximum distance

Maximum Blocking ratio

You might also like