Apache Knox - Load Balancing

This document discusses load balancing Apache Knox and HTTP services. It provides considerations for load balancing Knox including whether to use HTTP or TCP, handling stateful services like HiveServer2, and ensuring cookies and URLs are properly handled. An example implementation is described that uses F5 and Traefik load balancers with DNS for a production deployment load balancing 4 Knox instances and providing high availability.

Uploaded by

Rodrigo Valenzuela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

354 views5 pages

Apache Knox - Load Balancing

Uploaded by

Rodrigo Valenzuela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Apache Knox - Load Balancing

Overview 1

Load Balancing HTTP 1

Apache Knox and Load Balancing 2

Considerations 2

Example Implementation 3
Overview 3
Architecture 3
Technology Used 4

References 5

Overview
Load balancing HTTP services can be quite involved. This document covers some specifics about HTTP load
balancing and Apache Knox. This is meant to be a guide about what to consider and doesn't necessarily cover
all cases.

Load Balancing HTTP

There are quite a few things to be aware of when load balancing HTTP. Each of these can drastically affect the
backend service depending how the backend HTTP service and load balancer are configured. List of related
topics:
● HTTP vs TCP load balancing
● TLS termination for HTTP load balancing
● DNS
○ Domains/Subdomains
○ TTL for DNS changes
● URL rewriting
● Health checks
● Caching
Apache Knox and Load Balancing
There are a lot of different load balancers and ways to configure them. Don’t think we have a
recommended/supported setup - Definitely not one that ships as part of HDP. There is a rough guide to using
Apache HTTPD with Apache Knox.

Depends on what features you are looking to load balance as well. APIs vs UIs vs KnoxSSO. HTTP can be
load balanced in a variety of different ways. Matters if context path is changed, how caching is handled,
cookies matter, domains matter, etc.

Considerations
● HTTP vs TCP load balancing
○ Affects SPNEGO authentication due to hostname
● Apache Knox doesn’t load balance backends
○ will failover if necessary but all traffic will go to single backend instance until it dies
○ Putting a load balancer behind Knox can be tricky due to SPNEGO
● Load balancing Kerberos services without Knox or behind Knox is TECHNICALLY possible but a PITA
○ [Link]
● HiveServer2 is stateful
○ need to make sure that all Knox instances point to the same HS2 if load balancing
○ ensure clients don’t get redirected to random HS2 instance
○ need to be careful when rolling restart Knox/HS2
● Non stateful services (WebHDFS, HBase) are much easier to LB
○ No need to worry about sticky sessions or that clients go to the same backend
● HTTP clients need to be able to handle cookies/redirects if doing load balancing
○ be aware of 30* errors and 40* errors that could be caused by redirects/cookie handling
○ curl has "-L", "--location-trusted", "-c", and "-b" flags to help
● DNS/Url rewriting - changing the context path
○ WILL cause issues with UIs due to not handling the rewriting correctly
○ Can also cause cookie issues if they are bound to domain/path
● DNS changes between LB/Knox instances
○ can cause issues with KnoxSSO due to cookies depending on how the DNS is setup
○ Cookies are typically bound to domains
● Load balancing KnoxSSO
○ need to make sure you have the same signing key across nodes to prevent weird “signed out”
issues due to not being able to verify the JWT
Example Implementation

Overview
At my last job, we load balanced 4 Knox instances and handled ~10000 requests per min between
HBase/WebHDFS/Hive services on ~3 clusters. Most were HBase calls concentrated on the production cluster.
We expanded to multiple data centers and used DNS/load balancing to handle failover if necessary. This
configuration worked well and allowed for 0 downtime upgrades, failover, and load balancing for maintenance.

We only focused on API calls - no UIs or SSO (we started with HDP 2.5.x which didn’t support Knox proxying
UIs/SSO). I know we would have had issues with UIs/SSO if we were to go that route based on how we had
configured DNS and the multiple LBs.

All client access to the cluster was over HTTP. No SSH access to the clusters. In order to access the HDP
clusters, all traffic was funneled through HTTP and Knox - Spark with Livy, HBase REST, WebHDFS
read/write, Hive queries, Sqoop via Oozie to ingest RDBMS tables.

Architecture
Technology Used
● Load balancing
○ F5 hardware HTTP LB with sticky sessions
■ provided hardware failover and was part of “corporate LB”
■ Avoided TCP load balancing since HTTP was "default" setup
■ Provided HTTP TLS termination that matched corporate standards
■ F5 pointed to 4 software load balancers - Traefik
■ Health checks to remove instances if they were down
○ Traefik software HTTP LB with sticky sessions
■ HDP team controlled Traefik
● Didn't need to wait for F5 team on LB changes
■ Traefik and Knox were collocated on the same servers
■ Traefik load balanced across 4 Knox instances to balance load
■ Health checks to see if Knox nodes were up/responding
■ TLS termination at Traefik
● Higher than corporate standards since only F5 was client
● Wildcard TLS certificate to easily LB different domains
■ URL rewriting (only to have “nice” urls)
● Example rewriting:
○ [Link]
○ [Link]
● Apache Knox
○ Each instance provided access to multiple clusters
■ Allowed for failure of all but 1 Knox instance
○ LDAP bind account/passwords were per host and not per cluster
■ Enabled changing LDAP password with zero downtime
● DNS
○ Delegated subdomain for F5, Traefik, and other uses
■ *.[Link]
■ Low TTL on delegated subdomain
■ Used dnsmasq reading from /etc/hosts on multiple hosts
○ Setup details
■ [Link] pointed to F5 in main data center
● If major data center failure, [Link] repointed to F5 in secondary
data center
● In reality, easy for specific applications to failover to [Link]
○ Not all applications needed to failover
■ [Link] pointed to F5 in main data center
● dc2 points to backup data center
● Could easily test "failover" by pointing to dc2
■ [Link] pointed to Traefik 1 in main data center
● Repeat for N number of Traefik nodes
● Allowed for pinpointing a single Traefik instance for testing
○ Able to use cookies to pinpoint specific Traefik through F5 cookies or specific Traefik backend
with Traefik cookies
References
● [Link]
er+++mod_proxy+++mod_proxy_balancer
● [Link]
● LB and HiveServer2
○ Cookies - [Link]
○ SPNEGO - [Link]
● [Link]
● [Link]

Apache HTTPD Load-Balancing Guide
100% (2)
Apache HTTPD Load-Balancing Guide
47 pages
Apache httpd 2.4 Features & Enhancements
No ratings yet
Apache httpd 2.4 Features & Enhancements
42 pages
Apache 2.2 Proxy Load Balancing Guide
No ratings yet
Apache 2.2 Proxy Load Balancing Guide
39 pages
f5-101 New
100% (1)
f5-101 New
16 pages
F5 Big-IP Configuration Essentials
No ratings yet
F5 Big-IP Configuration Essentials
149 pages
Devops Technical Test
No ratings yet
Devops Technical Test
6 pages
AWS My Notes With Examples
No ratings yet
AWS My Notes With Examples
10 pages
The System Design
No ratings yet
The System Design
135 pages
HCP Load Balancers Best Practices
No ratings yet
HCP Load Balancers Best Practices
5 pages
Unit 4
No ratings yet
Unit 4
17 pages
Apache Load Balancing Guide
100% (1)
Apache Load Balancing Guide
8 pages
HAProxy Version 1.6
No ratings yet
HAProxy Version 1.6
26 pages
Experiment 2 Aws New
No ratings yet
Experiment 2 Aws New
6 pages
Ocelot 1.0.0 Documentation Guide
No ratings yet
Ocelot 1.0.0 Documentation Guide
104 pages
MIG and Load Balancing
No ratings yet
MIG and Load Balancing
149 pages
Openshift Questions
No ratings yet
Openshift Questions
4 pages
Ocelot Guide for .NET Developers
No ratings yet
Ocelot Guide for .NET Developers
104 pages
Scaling and Automation
No ratings yet
Scaling and Automation
4 pages
GROUP 13 - Web Hosting
No ratings yet
GROUP 13 - Web Hosting
48 pages
Websphere Monitoring
No ratings yet
Websphere Monitoring
1,012 pages
System Design & Load Balancing Guide
No ratings yet
System Design & Load Balancing Guide
33 pages
Networking For DevOps Engineers
No ratings yet
Networking For DevOps Engineers
6 pages
Load Balancing and Autoscaling Explained
No ratings yet
Load Balancing and Autoscaling Explained
24 pages
Load Balancer and Autoscaling Overview
No ratings yet
Load Balancer and Autoscaling Overview
24 pages
Web Service and API Secure by Design
No ratings yet
Web Service and API Secure by Design
66 pages
StoneOS Cookbook V5.5R10
No ratings yet
StoneOS Cookbook V5.5R10
496 pages
F5 SNAT and SSL Configuration Guide
No ratings yet
F5 SNAT and SSL Configuration Guide
10 pages
The Apache Modelling Project
No ratings yet
The Apache Modelling Project
160 pages
06) Ec2 Elb
No ratings yet
06) Ec2 Elb
29 pages
Advanced Load Balancing Techniques
No ratings yet
Advanced Load Balancing Techniques
6 pages
08 Server Load Balancer
No ratings yet
08 Server Load Balancer
30 pages
Draft Ietf Httpbis Http2 17
No ratings yet
Draft Ietf Httpbis Http2 17
92 pages
Load Balancer 100
No ratings yet
Load Balancer 100
19 pages
AWS Lambda and HTTP Evolution Explained
No ratings yet
AWS Lambda and HTTP Evolution Explained
8 pages
L5 LargeScaleWebApps
No ratings yet
L5 LargeScaleWebApps
22 pages
Apache Ignite Client Connectors Guide
No ratings yet
Apache Ignite Client Connectors Guide
5 pages
CC Unit-1-2
No ratings yet
CC Unit-1-2
48 pages
04.HA Scaling
No ratings yet
04.HA Scaling
21 pages
F5 101 Notes
No ratings yet
F5 101 Notes
11 pages
StoneOS Cookbook V5.5R11
No ratings yet
StoneOS Cookbook V5.5R11
532 pages
System Design
No ratings yet
System Design
1 page
SQS (Simple Queue Service) :: Think of The "Human Element", "Human Interaction"
No ratings yet
SQS (Simple Queue Service) :: Think of The "Human Element", "Human Interaction"
4 pages
Optimize Apache for High Traffic
No ratings yet
Optimize Apache for High Traffic
3 pages
F5 401 Exam Keynotes for Experts
No ratings yet
F5 401 Exam Keynotes for Experts
59 pages
Ignitebook Sample
100% (1)
Ignitebook Sample
128 pages
2023 Load Balancing Solutions Guide
No ratings yet
2023 Load Balancing Solutions Guide
14 pages
Load Balancer Configuration Guide
No ratings yet
Load Balancer Configuration Guide
27 pages
Web Dev: LAMP & Apache Load Balancing
No ratings yet
Web Dev: LAMP & Apache Load Balancing
38 pages
BigFix Configuration Guide
No ratings yet
BigFix Configuration Guide
146 pages
AWS Notes
No ratings yet
AWS Notes
17 pages
Tech Stack
No ratings yet
Tech Stack
54 pages
F5 Training Lab Guide
No ratings yet
F5 Training Lab Guide
11 pages
Understanding Load Balancing Techniques
No ratings yet
Understanding Load Balancing Techniques
4 pages
AWS DevOps Services Overview
No ratings yet
AWS DevOps Services Overview
6 pages
AWSQuestions Structured
No ratings yet
AWSQuestions Structured
9 pages
1WE S4HANA2023 Set-Up EN XX
No ratings yet
1WE S4HANA2023 Set-Up EN XX
14 pages
Has I Lpe Label An Vader Lexicon
No ratings yet
Has I Lpe Label An Vader Lexicon
72 pages
04neptun User Guide - Full2024
No ratings yet
04neptun User Guide - Full2024
32 pages
Final Networking in Python by RJ
No ratings yet
Final Networking in Python by RJ
42 pages
CSE320 Assignment 1: Network Topologies
No ratings yet
CSE320 Assignment 1: Network Topologies
3 pages
How To Book Your Exam - 09!07!2020 MS
No ratings yet
How To Book Your Exam - 09!07!2020 MS
14 pages
JavaScript Function Invocation Methods
No ratings yet
JavaScript Function Invocation Methods
4 pages
Chapter - 9 (Switching)
No ratings yet
Chapter - 9 (Switching)
16 pages
Salesforce Auto-Response Setup Guide
No ratings yet
Salesforce Auto-Response Setup Guide
8 pages
Edexcel IGCSE Computer Science TRUE IN DEPTH MASTERBOOK Part3
No ratings yet
Edexcel IGCSE Computer Science TRUE IN DEPTH MASTERBOOK Part3
3 pages
T REC H.246 200011 S!AnnE1
No ratings yet
T REC H.246 200011 S!AnnE1
16 pages
WiFi Router Crash After ReconFTW Scan
No ratings yet
WiFi Router Crash After ReconFTW Scan
3 pages
Web Technology
No ratings yet
Web Technology
10 pages
Senior Mainframe Server Administrator Expertise
No ratings yet
Senior Mainframe Server Administrator Expertise
5 pages
Apache2 Ubuntu Default Page - It Works
No ratings yet
Apache2 Ubuntu Default Page - It Works
2 pages
Get A VPS Completely Free!: by Sakib Hasan
No ratings yet
Get A VPS Completely Free!: by Sakib Hasan
6 pages
Wireguard Log
No ratings yet
Wireguard Log
24 pages
The Social Organism - A Radical Understanding of Social Media To Transform Your Business and Life
No ratings yet
The Social Organism - A Radical Understanding of Social Media To Transform Your Business and Life
239 pages
Cisco 300-415 v2022-02-01 q147
No ratings yet
Cisco 300-415 v2022-02-01 q147
80 pages
Lista Comenzilor Windows CMD
No ratings yet
Lista Comenzilor Windows CMD
4 pages
Ddos Thesis
100% (3)
Ddos Thesis
6 pages
Alice (Aktyexplosion)
100% (1)
Alice (Aktyexplosion)
50 pages
What Is Osint
No ratings yet
What Is Osint
8 pages
Skill Enhancement Course With Description - Final
No ratings yet
Skill Enhancement Course With Description - Final
116 pages
A Guide For Creators Making Content To Counter Extremism
No ratings yet
A Guide For Creators Making Content To Counter Extremism
8 pages
Ad Hoc Wireless Internet Overview
No ratings yet
Ad Hoc Wireless Internet Overview
16 pages
ListView in Android Kotlin
No ratings yet
ListView in Android Kotlin
3 pages
WordPress Developer & SEO Specialist
No ratings yet
WordPress Developer & SEO Specialist
1 page
Managing Risk and Information Security Protect To Enable 2nd Edition by Malcolm Harkins ISBN 1484214560 978-1484214565
No ratings yet
Managing Risk and Information Security Protect To Enable 2nd Edition by Malcolm Harkins ISBN 1484214560 978-1484214565
23 pages
Welcome Home Restoration Project Observation Document
No ratings yet
Welcome Home Restoration Project Observation Document
33 pages

Apache Knox - Load Balancing

Uploaded by

Apache Knox - Load Balancing

Uploaded by

Apache Knox - Load Balancing

Load Balancing HTTP 1

Apache Knox and Load Balancing 2

Load Balancing HTTP

You might also like