SlideShare a Scribd company logo
© 2019 Ververica
Till Rohrmann, Engineering Lead
@stsffap
Future of Apache Flink Deployments: Containers,
Kubernetes and More
© 2019 Ververica2
Advancements of APIs & Supported Use Cases
Batch Processing
process static and
historic data
Data Stream
Processing
realtime results
from data streams
Event-driven
Applications
data-driven actions
and services
• Deployment & operations features did not keep up with APIs
© 2019 Ververica3
Flink as a Library
• Deploying Flink applications should be as easy as starting a process
• Application-oriented instead of cluster-oriented
• Start multiple processes which form distributed execution
• Removing the cluster out of the equation
P1
P2 P3 P4
New process
© 2019 Ververica4
Resource Management & Autoscaling
• Multiple short lived stages
• Different resource requirements per
stage
• Efficient execution requires control
over resources
• Flink allocates actively resources
• Long running, continuously processing
operators
• Constrained by external systems and SLAs
• External system with holistic view over
resources better suited for scaling decisions
• Flink reacts to available resources
Real-time &
data-driven
Applications
Batch Analytics
Continuous
Processing
© 2019 Ververica5
Reactive vs. Active Execution Mode
• Active mode
– Flink is aware of underlying cluster framework
– Flink allocate resources
– E.g. existing YARN and Mesos integration
• Reactive mode
– Flink is oblivious to its runtime environment
– External system allocates and releases resources
– Flink scales with respect to available resources
– Relevant for environments: Kubernetes, Docker, as a library
© 2019 Ververica6
Auto Scaling
• Latency
• Throughput
• Connector signals
• Available resources
Scale up
© 2019 Ververica7
Containerized Deployments
• Popular to deploy Flink in containerized environments
• Why Flink on containers?
– Self-contained runtime environment that includes application code, libraries, dependencies, and
configuration files
– Ease of operations by clean separation of concerns
– Offers dynamic resource allocation
• Reactive mode suits containerized deployments well
• Demand for active mode to run batch workloads
© 2019 Ververica8
Containerized Deployments (cont.)
• De-facto industry standard for container orchestration
• Resource-oriented with declarative configuration
– You tell K8s the desired state, and a background process asynchronously makes it happen
– “3 replicas of this container should be kept running”
– “a load balancer should exist, listening on port 443, backed by containers with this label”
• Core resource types:
– Pod: a group of one or more containers running on a node
– Deployment: set of n identical pods
Brief Introduction to Kubernetes
© 2019 Ververica
Active & Reactive Mode
© 2019 Ververica10
Revamped Distributed Architecture (FLIP-6)
– Better separation of concerns
– RESTful API for client-cluster communication
– Full resource elasticity
Dispatcher
Client
ResourceManager ClusterManager
JobManager TaskManager
1. Submit job
2. Start job
3. Request slots
4. Allocate resources
5. Start TaskManager
6. Execute job
© 2019 Ververica11
Active K8s Integration
K8s deployment
controller
Client
TaskManager
JobManager
K8sResMngr
ApplicationMaster
TaskManager
(3) Submit job
(1) Submit AM deployment
(4) Start JM
(5) Request slots
(7) Start TM pod
Dispatcher
© 2019 Ververica12
Reactive Mode
• Relies on external system to start/release TaskManagers
– Kubernetes Horizontal Pod Autoscaler
– GCP Autoscaling
– AWS Auto Scaling Group
• Re-scale job if resources are added/removed
• By definition works with all cluster managers
Flink cluster
JM TM
TM
ASG
Start new TM
if CPU% >
threshold
Register &
offer slots
Event rate over time
© 2019 Ververica
Does Any of This Work?
• Demo: Active K8s Integration & Reactive Container Mode
© 2019 Ververica14
Current State
• Recommendations on how to deploy job and session cluster on K8s
– Docker image build script
– K8s resource configs
• StandaloneJobClusterEntryPoint to start a (containerized) job cluster
• Work in progress:
– Active K8s integration
• https://issues.apache.org/jira/browse/FLINK-9953
• Kudos to Chunhui-Shi
• git clone -b native-k8s git@github.com:chunhui-shi/flink.git
– Reactive Container Mode with automatic rescaling
• https://issues.apache.org/jira/browse/FLINK-10407
© 2019 Ververica15
TL; DL
• Reactive container mode
– Support applications as first class citizen
– Ease of operations
• Active K8s integration coming soon
• Join the community!
– Subscribe to mailing lists
– Participate in Flink development
– Design the future
© 2019 Ververica
Thank you! Questions?
© 2019 Ververica
www.ververica.com @VervericaDatatill@ververica.com

More Related Content

PPTX
Redesigning Apache Flink's Distributed Architecture @ Flink Forward 2017
Till Rohrmann
 
PDF
Flink Forward San Francisco 2018: Jörg Schad and Biswajit Das - "Operating Fl...
Flink Forward
 
PDF
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp
Flink Forward
 
PPTX
Apache flink 1.7 and Beyond
Till Rohrmann
 
PDF
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Flink Forward
 
PDF
Flink Forward Berlin 2017: Patrick Lucas - Flink in Containerland
Flink Forward
 
PDF
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Ververica
 
PDF
Elastic Streams at Scale @ Flink Forward 2018 Berlin
Till Rohrmann
 
Redesigning Apache Flink's Distributed Architecture @ Flink Forward 2017
Till Rohrmann
 
Flink Forward San Francisco 2018: Jörg Schad and Biswajit Das - "Operating Fl...
Flink Forward
 
Kubernetes + Operator + PaaSTA = Flink @ Yelp - Antonio Verardi, Yelp
Flink Forward
 
Apache flink 1.7 and Beyond
Till Rohrmann
 
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Flink Forward
 
Flink Forward Berlin 2017: Patrick Lucas - Flink in Containerland
Flink Forward
 
Webinar: 99 Ways to Enrich Streaming Data with Apache Flink - Konstantin Knauf
Ververica
 
Elastic Streams at Scale @ Flink Forward 2018 Berlin
Till Rohrmann
 

What's hot (20)

PDF
Flink Forward San Francisco 2019: Building production Flink jobs with Airstre...
Flink Forward
 
PDF
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
PPTX
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Flink Forward
 
PDF
Virtual Flink Forward 2020: How Streaming Helps Your Staging Environment and ...
Flink Forward
 
PDF
Utilizing Kafka Connect to Integrate Classic Monoliths into Modern Microservi...
HostedbyConfluent
 
PPTX
Flink Forward San Francisco 2018 keynote: Srikanth Satya - "Stream Processin...
Flink Forward
 
PPTX
Gatekeeper: API gateway
ChengHui Weng
 
PPTX
Portable Streaming Pipelines with Apache Beam
confluent
 
PDF
2020-05-06 Apache Flink Meetup London: The Easiest Way to Get Operational wit...
Ververica
 
PDF
Streaming your Lyft Ride Prices - Flink Forward SF 2019
Thomas Weise
 
PDF
Kafka on Kubernetes: Keeping It Simple (Nikki Thean, Etsy) Kafka Summit SF 2019
confluent
 
PDF
Beaming flink to the cloud @ netflix ff 2016-monal-daxini
Monal Daxini
 
PDF
Introducing Confluent labs Parallel Consumer client | Anthony Stubbes, Confluent
HostedbyConfluent
 
PPTX
Deploying and Operating KSQL
confluent
 
PPTX
Kubernetes fundamentals
Victor Morales
 
PPTX
Monitoring and Resiliency Testing our Apache Kafka Clusters at Goldman Sachs ...
HostedbyConfluent
 
PDF
Kafka on Kubernetes: Does it really have to be "The Hard Way"? (Viktor Gamov,...
confluent
 
PDF
Flink Connector Development Tips & Tricks
Eron Wright
 
PDF
Flink Forward Berlin 2017: Steffen Hausmann - Build a Real-time Stream Proces...
Flink Forward
 
PDF
Flink Forward SF 2017: Scott Kidder - Building a Real-Time Anomaly-Detection ...
Flink Forward
 
Flink Forward San Francisco 2019: Building production Flink jobs with Airstre...
Flink Forward
 
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Ver...
Flink Forward
 
Virtual Flink Forward 2020: How Streaming Helps Your Staging Environment and ...
Flink Forward
 
Utilizing Kafka Connect to Integrate Classic Monoliths into Modern Microservi...
HostedbyConfluent
 
Flink Forward San Francisco 2018 keynote: Srikanth Satya - "Stream Processin...
Flink Forward
 
Gatekeeper: API gateway
ChengHui Weng
 
Portable Streaming Pipelines with Apache Beam
confluent
 
2020-05-06 Apache Flink Meetup London: The Easiest Way to Get Operational wit...
Ververica
 
Streaming your Lyft Ride Prices - Flink Forward SF 2019
Thomas Weise
 
Kafka on Kubernetes: Keeping It Simple (Nikki Thean, Etsy) Kafka Summit SF 2019
confluent
 
Beaming flink to the cloud @ netflix ff 2016-monal-daxini
Monal Daxini
 
Introducing Confluent labs Parallel Consumer client | Anthony Stubbes, Confluent
HostedbyConfluent
 
Deploying and Operating KSQL
confluent
 
Kubernetes fundamentals
Victor Morales
 
Monitoring and Resiliency Testing our Apache Kafka Clusters at Goldman Sachs ...
HostedbyConfluent
 
Kafka on Kubernetes: Does it really have to be "The Hard Way"? (Viktor Gamov,...
confluent
 
Flink Connector Development Tips & Tricks
Eron Wright
 
Flink Forward Berlin 2017: Steffen Hausmann - Build a Real-time Stream Proces...
Flink Forward
 
Flink Forward SF 2017: Scott Kidder - Building a Real-Time Anomaly-Detection ...
Flink Forward
 
Ad

Similar to Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink Forward 2019 SF (20)

PDF
Deploying Flink on Kubernetes - David Anderson
Ververica
 
PDF
Creating Real-Time Data Streaming powered by SQL on Kubernetes - Albert Lewan...
GetInData
 
PDF
Stream Processing Solution for the Enterprise
HostedbyConfluent
 
PPTX
Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...
Flink Forward
 
PDF
Apache Flink
Mike Frampton
 
PPTX
Autoscaling Flink with Reactive Mode
Flink Forward
 
PPT
Apache flink-crash-course-by-slim-baltagi-and-srini-palthepu-150817191850-lva...
Yun Lung Li
 
PPT
Apache Flink Crash Course by Slim Baltagi and Srini Palthepu
Slim Baltagi
 
PPTX
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Flink Forward
 
PPTX
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Flink Forward
 
PPTX
KEYNOTE Flink Forward San Francisco 2019: From Stream Processor to a Unified ...
Flink Forward
 
PDF
How to build a tool for operating Flink on Kubernetes
AndreaMedeghini
 
PPT
Step-by-Step Introduction to Apache Flink
Slim Baltagi
 
PDF
Kubernetes and real-time analytics - how to connect these two worlds with Apa...
GetInData
 
PDF
Flink Forward Berlin 2017: Dominik Bruhn - Deploying Flink Jobs as Docker Con...
Flink Forward
 
PDF
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward
 
PDF
Improvements to Flink & it's Applications in Alibaba Search
DataWorks Summit/Hadoop Summit
 
PPTX
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
PPTX
Stephan Ewen - Running Flink Everywhere
Flink Forward
 
PPTX
Apache Flink in the Cloud-Native Era
Flink Forward
 
Deploying Flink on Kubernetes - David Anderson
Ververica
 
Creating Real-Time Data Streaming powered by SQL on Kubernetes - Albert Lewan...
GetInData
 
Stream Processing Solution for the Enterprise
HostedbyConfluent
 
Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...
Flink Forward
 
Apache Flink
Mike Frampton
 
Autoscaling Flink with Reactive Mode
Flink Forward
 
Apache flink-crash-course-by-slim-baltagi-and-srini-palthepu-150817191850-lva...
Yun Lung Li
 
Apache Flink Crash Course by Slim Baltagi and Srini Palthepu
Slim Baltagi
 
Virtual Flink Forward 2020: Integrate Flink with Kubernetes natively - Yang Wang
Flink Forward
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Flink Forward
 
KEYNOTE Flink Forward San Francisco 2019: From Stream Processor to a Unified ...
Flink Forward
 
How to build a tool for operating Flink on Kubernetes
AndreaMedeghini
 
Step-by-Step Introduction to Apache Flink
Slim Baltagi
 
Kubernetes and real-time analytics - how to connect these two worlds with Apa...
GetInData
 
Flink Forward Berlin 2017: Dominik Bruhn - Deploying Flink Jobs as Docker Con...
Flink Forward
 
Flink Forward San Francisco 2019: Managing Flink on Kubernetes - FlinkK8sOper...
Flink Forward
 
Improvements to Flink & it's Applications in Alibaba Search
DataWorks Summit/Hadoop Summit
 
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
Stephan Ewen - Running Flink Everywhere
Flink Forward
 
Apache Flink in the Cloud-Native Era
Flink Forward
 
Ad

More from Till Rohrmann (16)

PDF
Scaling stream data pipelines with Pravega and Apache Flink
Till Rohrmann
 
PDF
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Till Rohrmann
 
PDF
Apache Flink Meets Apache Mesos And DC/OS @ Mesos Meetup Berlin
Till Rohrmann
 
PDF
Apache Flink® Meets Apache Mesos® and DC/OS
Till Rohrmann
 
PPTX
From Apache Flink® 1.3 to 1.4
Till Rohrmann
 
PDF
Apache Flink and More @ MesosCon Asia 2017
Till Rohrmann
 
PDF
Gilbert: Declarative Sparse Linear Algebra on Massively Parallel Dataflow Sys...
Till Rohrmann
 
PDF
Dynamic Scaling: How Apache Flink Adapts to Changing Workloads (at FlinkForwa...
Till Rohrmann
 
PDF
Streaming Analytics & CEP - Two sides of the same coin?
Till Rohrmann
 
PDF
Apache Flink: Streaming Done Right @ FOSDEM 2016
Till Rohrmann
 
PDF
Streaming Data Flow with Apache Flink @ Paris Flink Meetup 2015
Till Rohrmann
 
PDF
Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015
Till Rohrmann
 
PDF
Interactive Data Analysis with Apache Flink @ Flink Meetup in Berlin
Till Rohrmann
 
PDF
Computing recommendations at extreme scale with Apache Flink @Buzzwords 2015
Till Rohrmann
 
PDF
Machine Learning with Apache Flink at Stockholm Machine Learning Group
Till Rohrmann
 
PDF
Introduction to Apache Flink - Fast and reliable big data processing
Till Rohrmann
 
Scaling stream data pipelines with Pravega and Apache Flink
Till Rohrmann
 
Modern Stream Processing With Apache Flink @ GOTO Berlin 2017
Till Rohrmann
 
Apache Flink Meets Apache Mesos And DC/OS @ Mesos Meetup Berlin
Till Rohrmann
 
Apache Flink® Meets Apache Mesos® and DC/OS
Till Rohrmann
 
From Apache Flink® 1.3 to 1.4
Till Rohrmann
 
Apache Flink and More @ MesosCon Asia 2017
Till Rohrmann
 
Gilbert: Declarative Sparse Linear Algebra on Massively Parallel Dataflow Sys...
Till Rohrmann
 
Dynamic Scaling: How Apache Flink Adapts to Changing Workloads (at FlinkForwa...
Till Rohrmann
 
Streaming Analytics & CEP - Two sides of the same coin?
Till Rohrmann
 
Apache Flink: Streaming Done Right @ FOSDEM 2016
Till Rohrmann
 
Streaming Data Flow with Apache Flink @ Paris Flink Meetup 2015
Till Rohrmann
 
Fault Tolerance and Job Recovery in Apache Flink @ FlinkForward 2015
Till Rohrmann
 
Interactive Data Analysis with Apache Flink @ Flink Meetup in Berlin
Till Rohrmann
 
Computing recommendations at extreme scale with Apache Flink @Buzzwords 2015
Till Rohrmann
 
Machine Learning with Apache Flink at Stockholm Machine Learning Group
Till Rohrmann
 
Introduction to Apache Flink - Fast and reliable big data processing
Till Rohrmann
 

Recently uploaded (20)

PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Software Development Methodologies in 2025
KodekX
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 

Future of Apache Flink Deployments: Containers, Kubernetes and More - Flink Forward 2019 SF

  • 1. © 2019 Ververica Till Rohrmann, Engineering Lead @stsffap Future of Apache Flink Deployments: Containers, Kubernetes and More
  • 2. © 2019 Ververica2 Advancements of APIs & Supported Use Cases Batch Processing process static and historic data Data Stream Processing realtime results from data streams Event-driven Applications data-driven actions and services • Deployment & operations features did not keep up with APIs
  • 3. © 2019 Ververica3 Flink as a Library • Deploying Flink applications should be as easy as starting a process • Application-oriented instead of cluster-oriented • Start multiple processes which form distributed execution • Removing the cluster out of the equation P1 P2 P3 P4 New process
  • 4. © 2019 Ververica4 Resource Management & Autoscaling • Multiple short lived stages • Different resource requirements per stage • Efficient execution requires control over resources • Flink allocates actively resources • Long running, continuously processing operators • Constrained by external systems and SLAs • External system with holistic view over resources better suited for scaling decisions • Flink reacts to available resources Real-time & data-driven Applications Batch Analytics Continuous Processing
  • 5. © 2019 Ververica5 Reactive vs. Active Execution Mode • Active mode – Flink is aware of underlying cluster framework – Flink allocate resources – E.g. existing YARN and Mesos integration • Reactive mode – Flink is oblivious to its runtime environment – External system allocates and releases resources – Flink scales with respect to available resources – Relevant for environments: Kubernetes, Docker, as a library
  • 6. © 2019 Ververica6 Auto Scaling • Latency • Throughput • Connector signals • Available resources Scale up
  • 7. © 2019 Ververica7 Containerized Deployments • Popular to deploy Flink in containerized environments • Why Flink on containers? – Self-contained runtime environment that includes application code, libraries, dependencies, and configuration files – Ease of operations by clean separation of concerns – Offers dynamic resource allocation • Reactive mode suits containerized deployments well • Demand for active mode to run batch workloads
  • 8. © 2019 Ververica8 Containerized Deployments (cont.) • De-facto industry standard for container orchestration • Resource-oriented with declarative configuration – You tell K8s the desired state, and a background process asynchronously makes it happen – “3 replicas of this container should be kept running” – “a load balancer should exist, listening on port 443, backed by containers with this label” • Core resource types: – Pod: a group of one or more containers running on a node – Deployment: set of n identical pods Brief Introduction to Kubernetes
  • 9. © 2019 Ververica Active & Reactive Mode
  • 10. © 2019 Ververica10 Revamped Distributed Architecture (FLIP-6) – Better separation of concerns – RESTful API for client-cluster communication – Full resource elasticity Dispatcher Client ResourceManager ClusterManager JobManager TaskManager 1. Submit job 2. Start job 3. Request slots 4. Allocate resources 5. Start TaskManager 6. Execute job
  • 11. © 2019 Ververica11 Active K8s Integration K8s deployment controller Client TaskManager JobManager K8sResMngr ApplicationMaster TaskManager (3) Submit job (1) Submit AM deployment (4) Start JM (5) Request slots (7) Start TM pod Dispatcher
  • 12. © 2019 Ververica12 Reactive Mode • Relies on external system to start/release TaskManagers – Kubernetes Horizontal Pod Autoscaler – GCP Autoscaling – AWS Auto Scaling Group • Re-scale job if resources are added/removed • By definition works with all cluster managers Flink cluster JM TM TM ASG Start new TM if CPU% > threshold Register & offer slots Event rate over time
  • 13. © 2019 Ververica Does Any of This Work? • Demo: Active K8s Integration & Reactive Container Mode
  • 14. © 2019 Ververica14 Current State • Recommendations on how to deploy job and session cluster on K8s – Docker image build script – K8s resource configs • StandaloneJobClusterEntryPoint to start a (containerized) job cluster • Work in progress: – Active K8s integration • https://issues.apache.org/jira/browse/FLINK-9953 • Kudos to Chunhui-Shi • git clone -b native-k8s [email protected]:chunhui-shi/flink.git – Reactive Container Mode with automatic rescaling • https://issues.apache.org/jira/browse/FLINK-10407
  • 15. © 2019 Ververica15 TL; DL • Reactive container mode – Support applications as first class citizen – Ease of operations • Active K8s integration coming soon • Join the community! – Subscribe to mailing lists – Participate in Flink development – Design the future
  • 16. © 2019 Ververica Thank you! Questions?