0% found this document useful (0 votes)

195 views35 pages

Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino

The document discusses monitoring performance in AlwaysOn availability groups. It covers how data is replicated between nodes, how network latency can impact availability, and key metrics to monitor like the send queue and redo queue sizes using dynamic management views. Monitoring replication latency is important to understand how much data could potentially be lost and how long a failover would take. The document provides recommendations on dealing with slow replication, such as reducing log generation, increasing bandwidth, or upgrading SQL Server versions.

Uploaded by

rajiv_ndpt8394

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

195 views35 pages

Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino

Uploaded by

rajiv_ndpt8394

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Performance Monitoring

AlwaysOn Availability
Groups
Anthony E. Nocentino
[email protected]
Anthony E. Nocentino

• Consultant and Trainer

• Founder and President of Centino

Systems

• Specialize in system architecture and

performance

• Masters Computer Science (almost a PhD)

• Friend of Redgate - 2015/2016

• Microsoft Certified Professional

• email: [email protected]

• Twitter: @nocentino

• Blog: www.centinosystems.com/blog

• Pluralsight Author: www.pluralsight.com

Overview

• Motivation
• How availability groups move data
• Impact of replication latency on availability
• Monitoring techniques
• Demo
• Dealing with replication latency
Why is this important?
• Recovery Objectives
• Recovery Point Objective - RPO
• Recovery Time Objective - RTO
• Availability
• How much data can we lose?
• How fast will the system fail over?
• Monitoring and Trending
• Establish a baseline for analysis - are we meeting those objectives?
• Impact on resources
• Ownership
• All of the components are monitored by the DBA
Data Movement In Availability Groups

• Transaction log blocks are replicated to secondaries

• Replication mode
• Synchronous
• Asynchronous
• Database mirroring endpoint
Network Based Replication

• Strong working relationship with network team

• Maintenance - patching, network outages, database
• Network conditions can impact your AG’s availability
• Latency - how long it takes for a packet of data to traverse the
network from source to destination.
• Bandwidth - how much data can be moved in a time interval
Network Latency

• Often measured in milliseconds, sometimes microseconds

• Directly impacts network throughput
• TCP sliding window
• ping isn’t your best measure of latency, by default it doesn’t
include any load…measure your workload
• It’s often up to us to PROVE to the network team there is an issue
• Pinging 192.168.2.1 with 32 bytes of data:
• Reply from 192.168.2.1: bytes=32 time=1001ms TTL=128
Availability Group Flow Control

• Used in response to network and system conditions

• Log blocks exchange sequence numbers
• The AG will enter flow control mode IF:
• The primary detects too many unacknowledged messages, the primary
stops sending messages
• The secondary needs to tell the primary to back off, likely due to
resource constraints, it will send a flow control message to the primary
to back off
• Primary polls every 1000ms for a change in flow control state
• Secondary will message primary to leave flow control mode

From: SQL Server PFE Blog - http://bit.ly/1ZpGyIL

Database Synchronization States

• Not synchronizing
• Synchronized
• Synchronizing
• Reverting
• Initializing

https://msdn.microsoft.com/en-us/library/ff877972.aspx
Failover Modes

• Automatic
• Synchronous mode only
• Commonly used within a data center
• Synchronization state must by synchronized
• Manual
• Synchronous or Asynchronous
• Commonly used between data centers

https://msdn.microsoft.com/en-us/library/hh213151.aspx
Send Queue

• Queues log blocks to be sent

to the secondaries
• Each replica maintains it’s own
view of the send queue
• Queued data is as risk to data
loss in the event of a primary
failure
• The send queue can grow due
to an unreachable secondary,
network outage, network
latency and large amount of
data change
Redo Queue

• Queues log blocks received on

the secondary
• Each replica has it’s own redo
queue
• On failover, the redo queue
must be completely processed
• The redo queue can grow due
to a slow disk subsystem or
resource contention or
sustained outage and
subsequent reconnection of a
secondary
Send Queue Impact on Availability
• When log generation on primary exceeds the rate they can be sent
to the secondaries…
• No automatic failover
• Data loss
• Stale data for reporting from secondaries
• Stale data for off-loaded backups on secondaries
• Off-loaded log backups can fail
• Transaction delay
• Fill up transaction logs
• Even in synchronous mode!
Redo Queue Impact on Availability
• When log blocks received on the secondary exceed the rate they
can be processed by the redo thread…
• Delayed failover
• Detect failure
• Process Redo Queue
• Crash recover database
• Stale data for reporting from secondaries
• Stale data for off-loaded backups on secondaries
• Off-loaded log backups can fail
• Transaction delay
Log Backups
Transaction Delay

• In synchronous mode, when secondaries are behind, queries on

the primary can be delayed
• HADR_SYNC_COMMIT
• HADR_SYNCHRONIZING_THROTTLE - replica back online
Maintenance Events That Can Impact
Availability

• Bulk data modifications

• Database maintenance
• Network or server maintenance

• Carefully plan maintenance

• Collaborate with other teams!
Monitoring AG Performance

• Dynamic Management Views

• sys.dm_hadr_database_replica_states
• Perfmon Counters
• SQL Server:Availability Replica
• Replication data - messages sent, bytes sent, flow control
• SQL Server:Database Replica
• Database data - log bytes sent, queue sizes, transaction delay
per database
Measuring Replication Latency
• sys.dm_hadr_database_replica_states
• log_send_queue_size
• log_send_rate
• redo_queue_size
• redo_queue_rate
• On the primary there’s a row for each database on each replica
• On the secondaries there’s a row for each database on that replica
• Pull replication
log send queue is from primary to
secondary
• Offline
• log_send_queue_size changes to NULL
Measuring Replication Latency - ugh!!!
• Well, it looks like sys.dm_hadr_database_replica_states
doesn’t report the correct values for log_send_rate and
redo_queue_rate
• Documented as KB
• Reported on Connect
• https://connect.microsoft.com/SQLServer/Feedback/Details/928582
• Known bug in SQL Server 2012 or 2014
• https://support.microsoft.com/en-us/kb/3012182
• Cumulative Update 5 or better
• Observed in SQL 2016
• Perfmon!
Measuring Latency with Perfmon

• Primary
• SQLServer:Databases - Log Bytes Flushed/sec
• SQLServer:Availability Replica - Bytes Sent to Replica/sec (compressed)
• Network Interface - Bytes Sent/sec

• Secondaries
• SQLServer:Availability Replica - Bytes Received From Replica (compressed)
• SQLServer:Database Replica - Log Bytes Received/sec (log send rate/decompressed)
• SQLServer:Database Replica - Redone Bytes/sec (log redo rate)
• Network Interface - Bytes Received/sec
Measuring Latency with Extended Events
In SQL 2014 - SP2
Primary
Each event has a measure duration
hadr_log_block_group_commit
log_block_pushed_to_logpool
log_flush_start
hadr_log_block_compression
hadr_capture_log_block
ucs_connection_send_msg Synchronous Secondary
hadr_log_block_send_complete hadr_transport_receive_log_block_message
log_flush_complete hadr_log_block_decompression
hadr_apply_log_block
log_block_pushed_to_logpool
log_flush_start
log_flush_complete
hadr_send_harden_lsn_message

hadr_receive_harden_lsn_message ucs_connection_send_msg

hadr_db_commit_mgr_harden hadr_lsn_send_complete
Monitoring Tools

• Build your own

• AlwaysOn Dashboard
• Third Party Tool
• SQL Sentry Performance Advisor
• Redgate SQL Monitor
Demo
Demo
Demo
Real World Example

109GB
248GB
Dealing With Slow Replication Latency
• Identify your bottleneck and mitigate it
• Minimize log generation
• Use smart index maintenance/Better Indexes
• More bandwidth
• Perhaps a dedicated network connection
• Better hardware
• Log throughput on secondaries needs to be equal to primary
• Upgrade SQL Server
• 2012 single threaded redo - ~45MB/sec
• 2016 multi-threaded redo - ~600MB/sec
Key Takeaways
• It is imperative to track and trend replication latency in your
Availability Groups so you can answer the questions
• How much data can will I lose?
• How long it will take to failover?
• Monitor and trend send_queue and redo_queue in
sys.dm_hadr_database_replica_states on replicas to
measure availability impact
• Understand how much log is generated in your databases
• Understand your system’s operations, consider downtime for
patching and network maintenance
Key Takeaways

• Plan database maintenance

• Use a smart index maintenance strategy!
• Offloaded backups
• If availability is most important, backup on primary
Need more data?

http://www.centinosystems.com/blog/talks/
Links to resources
Demos
Presentation

[email protected]
Free SQL Monitor!

Send me a tweet @nocentino @redgate #sqlsatsac

#SQLMONITOR

What was that “one thing” you walked away with from today’s talk?

[email protected]
3 0 0 0!!!
T H $
WO R
D E A L e t :)
ING Tw e
A Z … a
A N AM s t s of
IS IS he c o
T H fo r t
u r s
Yo
Thank You!
http://www.sqlsaturday.com/540/sessions/sessionevaluation.aspx

Thanks to the SQLSaturday Sacramento Team!

Questions?
References

• http://www.centinosystems.com/blog/sql/designing-for-offloaded-
backups-in-alwayson-availability-groups/
• http://www.centinosystems.com/blog/sql/designing-for-offloaded-
log-backups-in-alwayson-availability-groups-monitoring/
• http://www.centinosystems.com/blog/sql/monitoring-availability-
groups-with-redgates-sql-monitor
• https://msdn.microsoft.com/en-us/library/ff878537.aspx
• https://msdn.microsoft.com/en-us/library/ff877972.aspx

Parasite SEO Secrets Revealed by Charles Floate
100% (1)
Parasite SEO Secrets Revealed by Charles Floate
73 pages
Practical Guide To Pki With Windows Server Compress
No ratings yet
Practical Guide To Pki With Windows Server Compress
400 pages
MOC 6425C - Vol 2
No ratings yet
MOC 6425C - Vol 2
264 pages
Junior SQL Database Administrator in TX FL NY CT Resume Henri Arrey
No ratings yet
Junior SQL Database Administrator in TX FL NY CT Resume Henri Arrey
3 pages
Primary DBA Responsibilities
No ratings yet
Primary DBA Responsibilities
19 pages
Always On Availability Group Enhancements
No ratings yet
Always On Availability Group Enhancements
44 pages
8.database Mirroring
No ratings yet
8.database Mirroring
16 pages
Alwayson-Issues 1
No ratings yet
Alwayson-Issues 1
27 pages
Upgrade SQL Server
No ratings yet
Upgrade SQL Server
15 pages
Administering A SQL Database Infrastructure
No ratings yet
Administering A SQL Database Infrastructure
3 pages
How To Upgrade A Microsoft SQL Server EDOCS DM Database To SQL Server 2005
No ratings yet
How To Upgrade A Microsoft SQL Server EDOCS DM Database To SQL Server 2005
16 pages
Creating An Active-Active SQL Cluster Using Hyper-V - Virtualized Storage
No ratings yet
Creating An Active-Active SQL Cluster Using Hyper-V - Virtualized Storage
58 pages
Replication and Database Mirroring
No ratings yet
Replication and Database Mirroring
21 pages
SQL Server DBA or Network Administrator or Systems Administrator
No ratings yet
SQL Server DBA or Network Administrator or Systems Administrator
3 pages
Database Mirroring Questions and Answers
No ratings yet
Database Mirroring Questions and Answers
9 pages
Designing Optimized Index Strategies
No ratings yet
Designing Optimized Index Strategies
35 pages
Administering Microsoft SQL Server 2012 Databases Jumpstart-Mod 1 - Final
No ratings yet
Administering Microsoft SQL Server 2012 Databases Jumpstart-Mod 1 - Final
37 pages
SQL 2014 - Move Database Files
No ratings yet
SQL 2014 - Move Database Files
50 pages
20764C 13-1 PDF
No ratings yet
20764C 13-1 PDF
26 pages
SQL Server Disaster Recovery
No ratings yet
SQL Server Disaster Recovery
7 pages
SQL Server 2016 Management Studio
No ratings yet
SQL Server 2016 Management Studio
9 pages
Five Things That Fix Bad SQL Server Performance
No ratings yet
Five Things That Fix Bad SQL Server Performance
3 pages
AlwaysOn Availability Groups Creation
No ratings yet
AlwaysOn Availability Groups Creation
11 pages
Designing and Implementing Stored Procedures
No ratings yet
Designing and Implementing Stored Procedures
33 pages
(DBA 211) - Berry DBA 211 - 2014
No ratings yet
(DBA 211) - Berry DBA 211 - 2014
50 pages
TUT - SQL Server Transaction Logs
No ratings yet
TUT - SQL Server Transaction Logs
19 pages
Microsoft 70-764 Microsoft Certified Professional
No ratings yet
Microsoft 70-764 Microsoft Certified Professional
28 pages
Steps To Configure SQL Server Availability Groups
No ratings yet
Steps To Configure SQL Server Availability Groups
15 pages
Optmize Performance Windows Server 2016 PDF
No ratings yet
Optmize Performance Windows Server 2016 PDF
235 pages
Check SQL Server Port Availability
No ratings yet
Check SQL Server Port Availability
3 pages
Troubleshooting Alwayson v1 2
No ratings yet
Troubleshooting Alwayson v1 2
42 pages
Recovery Models and Backup Strategies
No ratings yet
Recovery Models and Backup Strategies
23 pages
20764C 08
No ratings yet
20764C 08
28 pages
Microsoft PowerPoint - MS SQL Server
No ratings yet
Microsoft PowerPoint - MS SQL Server
26 pages
Introduction To Managing SQL Server Using Powershell
No ratings yet
Introduction To Managing SQL Server Using Powershell
30 pages
SQL Server DBA Training
No ratings yet
SQL Server DBA Training
6 pages
SOP To Upgrade SQL Server 2005 To 2008 Ver1.1
No ratings yet
SOP To Upgrade SQL Server 2005 To 2008 Ver1.1
28 pages
SQL Server Clustering
No ratings yet
SQL Server Clustering
4 pages
KDSSG Center For Excellence: SQL Server 2005 DBA Installation Checklist
No ratings yet
KDSSG Center For Excellence: SQL Server 2005 DBA Installation Checklist
5 pages
Backing Up SQL Server Databases
No ratings yet
Backing Up SQL Server Databases
27 pages
An Introduction To SQL Server Clustering
No ratings yet
An Introduction To SQL Server Clustering
10 pages
SQL Server Management Studio Keyboard Shortcuts
100% (12)
SQL Server Management Studio Keyboard Shortcuts
7 pages
Understanding SQL Server Execution Plans
No ratings yet
Understanding SQL Server Execution Plans
39 pages
Counters For SQL Server Performance Monitoring
No ratings yet
Counters For SQL Server Performance Monitoring
5 pages
Configuring Security For SQL Server Agent
No ratings yet
Configuring Security For SQL Server Agent
22 pages
20765C 07
No ratings yet
20765C 07
29 pages
Automating SQL Server Management
No ratings yet
Automating SQL Server Management
20 pages
Database Mirroring S
No ratings yet
Database Mirroring S
20 pages
Whitepaper Wait Statistics
No ratings yet
Whitepaper Wait Statistics
44 pages
Configuring Security For SQL Server Agent
No ratings yet
Configuring Security For SQL Server Agent
15 pages
Protecting Data With Encryption and Auditing
No ratings yet
Protecting Data With Encryption and Auditing
35 pages
Monitoring SQL Server With Alerts and Notifications
No ratings yet
Monitoring SQL Server With Alerts and Notifications
25 pages
DBA Inteview Questions
No ratings yet
DBA Inteview Questions
14 pages
Tracing Access To SQL Server With Extended Events
No ratings yet
Tracing Access To SQL Server With Extended Events
24 pages
Lab Answer Key: Module 1: SQL Server Security Lab: Authenticating Users
No ratings yet
Lab Answer Key: Module 1: SQL Server Security Lab: Authenticating Users
10 pages
CICS A Look at Dynamic Storage
No ratings yet
CICS A Look at Dynamic Storage
52 pages
What Is SQL Server Database Mirroring?
No ratings yet
What Is SQL Server Database Mirroring?
4 pages
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet
Oracle Data Guard A Clear and Concise Reference
From Everand
Oracle Data Guard A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Lotus Domino Interview Questions, Answers, and Explanations: Lotus Domino Certification Review
From Everand
Lotus Domino Interview Questions, Answers, and Explanations: Lotus Domino Certification Review
Equity Press
No ratings yet
Active Directory Rights Management Services A Clear and Concise Reference
From Everand
Active Directory Rights Management Services A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
SQLSaturdayPorto 2016 Availability Groups Troubleshooting Common Scenarios
No ratings yet
SQLSaturdayPorto 2016 Availability Groups Troubleshooting Common Scenarios
16 pages
Ujjivan Small Finance Bank LTD Ipo Product Note
No ratings yet
Ujjivan Small Finance Bank LTD Ipo Product Note
5 pages
Notice of Extraordinary General Meeting PDF PDF
No ratings yet
Notice of Extraordinary General Meeting PDF PDF
12 pages
SQL Server All in One Security Audit Script PDF
No ratings yet
SQL Server All in One Security Audit Script PDF
2 pages
SQL Server DBA Course Content
No ratings yet
SQL Server DBA Course Content
13 pages
Monitor SQL Server Performance
No ratings yet
Monitor SQL Server Performance
4 pages
Migrate From On-Premises To Azure SQL Database
100% (1)
Migrate From On-Premises To Azure SQL Database
20 pages
Ms SQL Server Always On Io Reliability Storage System On Hitachi VSP
No ratings yet
Ms SQL Server Always On Io Reliability Storage System On Hitachi VSP
25 pages
Performance Counters
No ratings yet
Performance Counters
2 pages
Database (Oracle) DBA Admin - L3
No ratings yet
Database (Oracle) DBA Admin - L3
1 page
Always Encrypted PDF
No ratings yet
Always Encrypted PDF
21 pages
Kel 13 Jurnal Ips
No ratings yet
Kel 13 Jurnal Ips
10 pages
Max 15.0V at 12V Max 31.5V at 24V Max 61.0V at 48V: Main Features
No ratings yet
Max 15.0V at 12V Max 31.5V at 24V Max 61.0V at 48V: Main Features
1 page
q3 Peh Week3
No ratings yet
q3 Peh Week3
8 pages
THE RESEARCH PROCESS Ed 201 3
No ratings yet
THE RESEARCH PROCESS Ed 201 3
25 pages
NW NSC GR 11 Maths Lit P1 Eng Memo Nov 2019
No ratings yet
NW NSC GR 11 Maths Lit P1 Eng Memo Nov 2019
7 pages
Troubleshooting Neato Botvac Connected Series
No ratings yet
Troubleshooting Neato Botvac Connected Series
4 pages
Building Code Requirements For Structural Concrete Reinforced With Glass FiberReinforced Polymer (GFRP) Bars Code and Commentary 440.11.22 Chapter 22
100% (1)
Building Code Requirements For Structural Concrete Reinforced With Glass FiberReinforced Polymer (GFRP) Bars Code and Commentary 440.11.22 Chapter 22
32 pages
Q. No Sub Q.No Answer: (Autonomous)
No ratings yet
Q. No Sub Q.No Answer: (Autonomous)
23 pages
Tesla Gateway
No ratings yet
Tesla Gateway
1 page
TRC P4P Proposal
No ratings yet
TRC P4P Proposal
48 pages
MSC Circ 0913
No ratings yet
MSC Circ 0913
11 pages
PHD Student in "Innovation Management in The Context of The Space Sector"
No ratings yet
PHD Student in "Innovation Management in The Context of The Space Sector"
4 pages
Habib Rehman Presentation
No ratings yet
Habib Rehman Presentation
8 pages
Performance and Durability Comparison: Dell Latitude 14 5000 Series vs. HP EliteBook 840 G1
No ratings yet
Performance and Durability Comparison: Dell Latitude 14 5000 Series vs. HP EliteBook 840 G1
20 pages
Businessethics
No ratings yet
Businessethics
2 pages
Experiment-2 RLC Circuit
No ratings yet
Experiment-2 RLC Circuit
6 pages
OB Biruktawit Zegeye
No ratings yet
OB Biruktawit Zegeye
6 pages
Sustainable Industrial Chemistry 1st Edition Fabrizio Cavani Download
No ratings yet
Sustainable Industrial Chemistry 1st Edition Fabrizio Cavani Download
55 pages
Chemistry Quiz - General
No ratings yet
Chemistry Quiz - General
3 pages
Opticalsmokedetector Salwicoev P
No ratings yet
Opticalsmokedetector Salwicoev P
2 pages
MBA Marketing Research Project Guidelines
No ratings yet
MBA Marketing Research Project Guidelines
7 pages
ECRI - Grade 1 - Unit 0 Smart Start
No ratings yet
ECRI - Grade 1 - Unit 0 Smart Start
156 pages
Juris 5 - Legal Positivism and Normativism
No ratings yet
Juris 5 - Legal Positivism and Normativism
20 pages
A2mot En5
100% (1)
A2mot En5
5 pages
Fenomenologia Da Psicologia
No ratings yet
Fenomenologia Da Psicologia
24 pages
Task 3 - Instructions Sheet
No ratings yet
Task 3 - Instructions Sheet
4 pages
The Handbook of Mobile Middleware 1st Edition Paolo Bellavista 2024 Scribd Download
No ratings yet
The Handbook of Mobile Middleware 1st Edition Paolo Bellavista 2024 Scribd Download
45 pages
Sand Casting
No ratings yet
Sand Casting
92 pages
REPORT Contour
100% (3)
REPORT Contour
7 pages

Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino

Uploaded by

Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino

Uploaded by

Performance Monitoring

• Consultant and Trainer

• Founder and President of Centino

• Specialize in system architecture and

• Masters Computer Science (almost a PhD)

• Friend of Redgate - 2015/2016

• Microsoft Certified Professional

• Pluralsight Author: www.pluralsight.com

• Transaction log blocks are replicated to secondaries

• Strong working relationship with network team

• Often measured in milliseconds, sometimes microseconds

• Used in response to network and system conditions

From: SQL Server PFE Blog - http://bit.ly/1ZpGyIL

• Queues log blocks to be sent

• Queues log blocks received on

• In synchronous mode, when secondaries are behind, queries on

• Bulk data modifications

• Carefully plan maintenance

• Dynamic Management Views

• Build your own

• Plan database maintenance

Send me a tweet @nocentino @redgate #sqlsatsac

Thanks to the SQLSaturday Sacramento Team!

You might also like