©2008–18 New Relic, Inc. All rights reserved
©2008–18 New Relic, Inc. All rights reserved
Tori Wieldt, Sr. Solutions Manager
SRE-iously!
Defining the Principles, Habits, and Practices of
Site Reliability Engineering
©2008–18 New Relic, Inc. All rights reserved
This document and the information herein (including any information that may be incorporated by reference) is provided for informational purposes only and should
not be construed as an offer, commitment, promise or obligation on behalf of New Relic, Inc. (“New Relic”) to sell securities or deliver any product, material, code,
functionality, or other feature. Any information provided hereby is proprietary to New Relic and may not be replicated or disclosed without New Relic’s express written
permission.
Such information may contain forward-looking statements within the meaning of federal securities laws. Any statement that is not a historical fact or refers to
expectations, projections, future plans, objectives, estimates, goals, or other characterizations of future events is a forward-looking statement. These forward-looking
statements can often be identified as such because the context of the statement will include words such as “believes,” “anticipates,” “expects” or words of similar
import.
Actual results may differ materially from those expressed in these forward-looking statements, which speak only as of the date hereof, and are subject to change at
any time without notice. Existing and prospective investors, customers and other third parties transacting business with New Relic are cautioned not to place undue
reliance on this forward-looking information. The achievement or success of the matters covered by such forward-looking statements are based on New Relic’s
current assumptions, expectations, and beliefs and are subject to substantial risks, uncertainties, assumptions, and changes in circumstances that may cause the
actual results, performance, or achievements to differ materially from those expressed or implied in any forward-looking statement. Further information on factors that
could affect such forward-looking statements is included in the filings we make with the SEC from time to time. Copies of these documents may be obtained by
visiting New Relic’s Investor Relations website at ir.newrelic.com or the SEC’s website at www.sec.gov.
New Relic assumes no obligation and does not intend to update these forward-looking statements, except as required by law. New Relic makes no warranties,
expressed or implied, in this document or otherwise, with respect to the information provided.
©2008–18 New Relic, Inc. All rights reserved
THE SRE HANDBOOK
But what if you’re 

not Google?
©2008–18 New Relic, Inc. All rights reserved
A LITTLE BACKGROUND
Ruby Monolith
Siloed teams
Infrequent Releases
TO
300+ Microservices
50+ Engineering Teams with embedded SREs
20-70 Deploys a Day
Huuuge Kafka Cluster
*with embedded SREs
FROM
©2008–18 New Relic, Inc. All rights reserved
HOW IT WAS
*with embedded SREs
On-Premises
On Premises 

Relational Data
Customers
NoSQL 

Data Store
Public Cloud
Micro Services
API
Mobile
Apps
Browser
©2008–18 New Relic, Inc. All rights reserved
WE ASKED OUR STAKEHOLDERS
©2008–18 New Relic, Inc. All rights reserved
WE ASKED OUR STAKEHOLDERS
Why do we have SREs at New Relic?
What’s the vision for our SRE team?
How can SREs most effectively contribute 

to the future of our platform?
©2008–18 New Relic, Inc. All rights reserved 8©2008–18 New Relic, Inc. All rights reserved
©2008–18 New Relic, Inc. All rights reserved
Continuously improve 

the reliability of systems
in the 

New Relic platform
ONE GOAL
8©2008–18 New Relic, Inc. All rights reserved
©2008–18 New Relic, Inc. All rights reserved
TWO ROLES
9
“Pure” SRE
Build and support our
core internal platform:
Container Fabric
Networking Systems
Partner with Eng Teams
Domain Experts in:
Reliability
Tooling
Scaling
Embedded SRE
©2008–18 New Relic, Inc. All rights reserved
THREE SPHERES
10
RELIABILITYSTABILITY ENGINEERING
©2008–18 New Relic, Inc. All rights reserved 11
• Fix team problems manually.

• Track down hardware defects on servers.

• Provision _______________.
TOIL
©2008–18 New Relic, Inc. All rights reserved 11
• Fix team problems manually.

• Track down hardware defects on servers.

• Provision _______________.
TOIL
Repetitive work that scales linearly.
©2008–18 New Relic, Inc. All rights reserved 12
Encourage Best Practices
• Audit runbooks

• Hold “game days”
• Meet with product architects 

• Create team risk matrices
Stay Current
©2008–18 New Relic, Inc. All rights reserved 13
Reduce Sprawl
• Migrate teams to our 

code pipeline
• Build tools
• Clean up alerts
• Build an integration
Improve Monitoring
©2008–18 New Relic, Inc. All rights reserved
KEYS TO SRE SUCCESS
14
Reliability 

is a feature

Reliability depends 

on shared
understanding
SRE is a challenging, 

cross-disciplinary
practice
©2008–18 New Relic, Inc. All rights reserved
KEYS TO SRE SUCCESS
14
Reliability 

is a feature

Query your stakeholders
Reliability depends 

on shared
understanding
SRE is a challenging, 

cross-disciplinary
practice
©2008–18 New Relic, Inc. All rights reserved
KEYS TO SRE SUCCESS
14
Reliability 

is a feature

Query your stakeholders
Reliability depends 

on shared
understanding
Develop clear, 

specific guidelines
SRE is a challenging, 

cross-disciplinary
practice
©2008–18 New Relic, Inc. All rights reserved
KEYS TO SRE SUCCESS
14
Reliability 

is a feature

Query your stakeholders
Reliability depends 

on shared
understanding
Develop clear, 

specific guidelines
SRE is a challenging, 

cross-disciplinary
practice
Build a strong 

SRE community
©2008–18 New Relic, Inc. All rights reserved
A TEMPLATE FOR SUCCESS
15
Determine Your Goal
Example:
Continuously improve the
reliability of the systems of our
company’s platform
Establish Roles
Example:
Pure SRE

Embedded SRE
21 3
Focus Areas
Examples:
Stability

Reliability

Engineering
©2008–18 New Relic, Inc. All rights reserved
DETAILS IN OUR SRE EBOOK
16
bit.ly/NewRelicSRE @ToriWieldt
©2008–18 New Relic, Inc. All rights reserved
FIRESIDE CHAT
17
TIM O’BRIEN
Managing Director, 

Cloud Services and Information Security
©2008–18 New Relic, Inc. All rights reserved
FIRESIDE CHAT
17
TIM O’BRIEN
Managing Director, 

Cloud Services and Information Security
©2008–18 New Relic, Inc. All rights reserved
THANK YOU!
@ToriWieldt
linkedin.com/in/toriwieldt
©2008–18 New Relic, Inc. All rights reserved
WHAT SRES DO
19
Champion reliability best practices.

Guide designs and processes with an eye toward resilience and low toil.

Reduce technical complexity and sprawl.

Drive the usage of tooling and common components.

Implement software and tooling to improve resilience and automate operations.
©2008–18 New Relic, Inc. All rights reserved

More Related Content

PDF
FS18 Chicago Keynote
PDF
Ground Rules for Code Reviews
PPTX
Monitor all your Kubernetes and EKS stack with New Relic
PPTX
Host for the Most: Cloud Cost Optimization
PPTX
New Relic Infrastructure in the Real World: AWS
PPTX
Monitoring is Not Just for Production!
PPTX
Keeping Modern Applications Performing
PDF
10 Things You Can Do With New Relic - Number 9 Will Shock You
FS18 Chicago Keynote
Ground Rules for Code Reviews
Monitor all your Kubernetes and EKS stack with New Relic
Host for the Most: Cloud Cost Optimization
New Relic Infrastructure in the Real World: AWS
Monitoring is Not Just for Production!
Keeping Modern Applications Performing
10 Things You Can Do With New Relic - Number 9 Will Shock You

What's hot (18)

PPTX
DevOps without Measurement is a Fail
PPTX
SRE-iously! Reliability!
PPTX
Best Practices for Measuring your Code Pipeline
PPTX
Fail Better
PDF
Microservices Practitioner Summit Jan '15 - Designing APIs with Customers in ...
PDF
Engineering and Autonomy in the Age of Microservices - Nic Benders, New Relic
PPTX
Cloud Migration Acceptance Testing - Prove Success
PPTX
Measureable Cloud Migration
PPTX
re:Thinking the Cloud
PPTX
Cloud Adoption Best Practices with New Relic
PPTX
Three Monitoring Mistakes and How to Avoid Them
PPTX
7 Tips & Tricks to Having Happy Customers at Scale
PPTX
FutureStack'19 Closing Keynote
PPTX
How to Lower or Justify your Cloud Spend
PPTX
Rock Stars, Builders, and Janitors: You're Doing It Wrong, New Relic [FutureS...
PDF
Inversion of Control: How New Relic’s Engineers Picked Their Own Jobs and Bui...
PPTX
Our Evolution to GraphQL: Unifying our API Strategy
PPTX
Top Three Mistakes People Make with Monitoring
DevOps without Measurement is a Fail
SRE-iously! Reliability!
Best Practices for Measuring your Code Pipeline
Fail Better
Microservices Practitioner Summit Jan '15 - Designing APIs with Customers in ...
Engineering and Autonomy in the Age of Microservices - Nic Benders, New Relic
Cloud Migration Acceptance Testing - Prove Success
Measureable Cloud Migration
re:Thinking the Cloud
Cloud Adoption Best Practices with New Relic
Three Monitoring Mistakes and How to Avoid Them
7 Tips & Tricks to Having Happy Customers at Scale
FutureStack'19 Closing Keynote
How to Lower or Justify your Cloud Spend
Rock Stars, Builders, and Janitors: You're Doing It Wrong, New Relic [FutureS...
Inversion of Control: How New Relic’s Engineers Picked Their Own Jobs and Bui...
Our Evolution to GraphQL: Unifying our API Strategy
Top Three Mistakes People Make with Monitoring
Ad

Similar to SRE-iously (20)

PPTX
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
PPTX
SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...
PDF
The SRE Report 2024 - Great Findings for the teams
PDF
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
PPTX
What is Site Reliability Engineering (SRE)
PDF
Essential_Skills_of_a_Site_Reliability_E.pdf
PDF
How We Try to Make a Lion Bulletproof; Setting up SRE in a Global Financial O...
PDF
Site Reliability Engineering slide deck 101
PDF
Upskill Yourself With GSDC Site Reliability Engineering Certification
PDF
Site-Reliability-Engineering-v2[6241].pdf
PDF
Changing The Laws Of Engineering With Github Pull Requests
PPTX
Lew Cirne, FS16 Keynote [FutureStack16]
PDF
Getting started with Site Reliability Engineering (SRE)
PDF
Sre summary
PPTX
DevOps Torino Meetup - SRE Concepts
PPTX
SRE (service reliability engineer) on big DevOps platform running on the clou...
PDF
Explore the Future of Digital Success with Site Reliability Engineering
PDF
SRE Model: You Should be aware you want to know
PPTX
Track Welcome: New Relic 101 [FutureStack16]
PPTX
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
SRE-iously! Defining the Principles, Habits, and Practices of Site Reliabilit...
SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...
The SRE Report 2024 - Great Findings for the teams
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
What is Site Reliability Engineering (SRE)
Essential_Skills_of_a_Site_Reliability_E.pdf
How We Try to Make a Lion Bulletproof; Setting up SRE in a Global Financial O...
Site Reliability Engineering slide deck 101
Upskill Yourself With GSDC Site Reliability Engineering Certification
Site-Reliability-Engineering-v2[6241].pdf
Changing The Laws Of Engineering With Github Pull Requests
Lew Cirne, FS16 Keynote [FutureStack16]
Getting started with Site Reliability Engineering (SRE)
Sre summary
DevOps Torino Meetup - SRE Concepts
SRE (service reliability engineer) on big DevOps platform running on the clou...
Explore the Future of Digital Success with Site Reliability Engineering
SRE Model: You Should be aware you want to know
Track Welcome: New Relic 101 [FutureStack16]
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
Ad

More from New Relic (13)

PPTX
7 Tips & Tricks to Having Happy Customers at Scale
PDF
New Relic University at Future Stack Tokyo 2019
PDF
FutureStack Tokyo 19 -[事例講演]株式会社リクルートライフスタイル:年間9300万件以上のサロン予約を支えるホットペッパービューティ...
PDF
FutureStack Tokyo 19 -[New Relic テクニカル講演]モニタリングと可視化がデジタルトランスフォーメーションを救う! - サ...
PDF
FutureStack Tokyo 19 -[特別講演]システム開発によろこびと驚きの連鎖を
PDF
FutureStack Tokyo 19 -[パートナー講演]アマゾン ウェブ サービス ジャパン株式会社: New Relicを活用したAWSへのアプリ...
PDF
FutureStack Tokyo 19_インサイトとデータを組織の力にする_株式会社ドワンゴ 池田 明啓 氏
PPTX
Intro to Multidimensional Kubernetes Monitoring
PPTX
Understanding Microservice Latency for DevOps Teams: An Introduction to New R...
PPTX
Kubernetes in the Wild: Best Practices for Monitoring
PPTX
Kick Ass Data Exploration through Dashboards
PPTX
Ground Rules for Code Reviews: Improving development velocity and team commun...
PPTX
You’re ready to migrate, but how will you prove success?
7 Tips & Tricks to Having Happy Customers at Scale
New Relic University at Future Stack Tokyo 2019
FutureStack Tokyo 19 -[事例講演]株式会社リクルートライフスタイル:年間9300万件以上のサロン予約を支えるホットペッパービューティ...
FutureStack Tokyo 19 -[New Relic テクニカル講演]モニタリングと可視化がデジタルトランスフォーメーションを救う! - サ...
FutureStack Tokyo 19 -[特別講演]システム開発によろこびと驚きの連鎖を
FutureStack Tokyo 19 -[パートナー講演]アマゾン ウェブ サービス ジャパン株式会社: New Relicを活用したAWSへのアプリ...
FutureStack Tokyo 19_インサイトとデータを組織の力にする_株式会社ドワンゴ 池田 明啓 氏
Intro to Multidimensional Kubernetes Monitoring
Understanding Microservice Latency for DevOps Teams: An Introduction to New R...
Kubernetes in the Wild: Best Practices for Monitoring
Kick Ass Data Exploration through Dashboards
Ground Rules for Code Reviews: Improving development velocity and team commun...
You’re ready to migrate, but how will you prove success?

Recently uploaded (20)

PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PPTX
future_of_ai_comprehensive_20250822032121.pptx
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
NewMind AI Weekly Chronicles – August ’25 Week IV
PDF
LMS bot: enhanced learning management systems for improved student learning e...
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PPTX
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
MuleSoft-Compete-Deck for midddleware integrations
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PDF
Rapid Prototyping: A lecture on prototyping techniques for interface design
PDF
Advancing precision in air quality forecasting through machine learning integ...
PDF
Co-training pseudo-labeling for text classification with support vector machi...
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
giants, standing on the shoulders of - by Daniel Stenberg
PPTX
SGT Report The Beast Plan and Cyberphysical Systems of Control
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
future_of_ai_comprehensive_20250822032121.pptx
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
NewMind AI Weekly Chronicles – August ’25 Week IV
LMS bot: enhanced learning management systems for improved student learning e...
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
Improvisation in detection of pomegranate leaf disease using transfer learni...
MuleSoft-Compete-Deck for midddleware integrations
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
Rapid Prototyping: A lecture on prototyping techniques for interface design
Advancing precision in air quality forecasting through machine learning integ...
Co-training pseudo-labeling for text classification with support vector machi...
Data Virtualization in Action: Scaling APIs and Apps with FME
Basics of Cloud Computing - Cloud Ecosystem
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
4 layer Arch & Reference Arch of IoT.pdf
giants, standing on the shoulders of - by Daniel Stenberg
SGT Report The Beast Plan and Cyberphysical Systems of Control

SRE-iously

  • 1. ©2008–18 New Relic, Inc. All rights reserved
  • 2. ©2008–18 New Relic, Inc. All rights reserved Tori Wieldt, Sr. Solutions Manager SRE-iously! Defining the Principles, Habits, and Practices of Site Reliability Engineering
  • 3. ©2008–18 New Relic, Inc. All rights reserved This document and the information herein (including any information that may be incorporated by reference) is provided for informational purposes only and should not be construed as an offer, commitment, promise or obligation on behalf of New Relic, Inc. (“New Relic”) to sell securities or deliver any product, material, code, functionality, or other feature. Any information provided hereby is proprietary to New Relic and may not be replicated or disclosed without New Relic’s express written permission. Such information may contain forward-looking statements within the meaning of federal securities laws. Any statement that is not a historical fact or refers to expectations, projections, future plans, objectives, estimates, goals, or other characterizations of future events is a forward-looking statement. These forward-looking statements can often be identified as such because the context of the statement will include words such as “believes,” “anticipates,” “expects” or words of similar import. Actual results may differ materially from those expressed in these forward-looking statements, which speak only as of the date hereof, and are subject to change at any time without notice. Existing and prospective investors, customers and other third parties transacting business with New Relic are cautioned not to place undue reliance on this forward-looking information. The achievement or success of the matters covered by such forward-looking statements are based on New Relic’s current assumptions, expectations, and beliefs and are subject to substantial risks, uncertainties, assumptions, and changes in circumstances that may cause the actual results, performance, or achievements to differ materially from those expressed or implied in any forward-looking statement. Further information on factors that could affect such forward-looking statements is included in the filings we make with the SEC from time to time. Copies of these documents may be obtained by visiting New Relic’s Investor Relations website at ir.newrelic.com or the SEC’s website at www.sec.gov. New Relic assumes no obligation and does not intend to update these forward-looking statements, except as required by law. New Relic makes no warranties, expressed or implied, in this document or otherwise, with respect to the information provided.
  • 4. ©2008–18 New Relic, Inc. All rights reserved THE SRE HANDBOOK But what if you’re 
 not Google?
  • 5. ©2008–18 New Relic, Inc. All rights reserved A LITTLE BACKGROUND Ruby Monolith Siloed teams Infrequent Releases TO 300+ Microservices 50+ Engineering Teams with embedded SREs 20-70 Deploys a Day Huuuge Kafka Cluster *with embedded SREs FROM
  • 6. ©2008–18 New Relic, Inc. All rights reserved HOW IT WAS *with embedded SREs On-Premises On Premises 
 Relational Data Customers NoSQL 
 Data Store Public Cloud Micro Services API Mobile Apps Browser
  • 7. ©2008–18 New Relic, Inc. All rights reserved WE ASKED OUR STAKEHOLDERS
  • 8. ©2008–18 New Relic, Inc. All rights reserved WE ASKED OUR STAKEHOLDERS Why do we have SREs at New Relic? What’s the vision for our SRE team? How can SREs most effectively contribute 
 to the future of our platform?
  • 9. ©2008–18 New Relic, Inc. All rights reserved 8©2008–18 New Relic, Inc. All rights reserved
  • 10. ©2008–18 New Relic, Inc. All rights reserved Continuously improve 
 the reliability of systems in the 
 New Relic platform ONE GOAL 8©2008–18 New Relic, Inc. All rights reserved
  • 11. ©2008–18 New Relic, Inc. All rights reserved TWO ROLES 9 “Pure” SRE Build and support our core internal platform: Container Fabric Networking Systems Partner with Eng Teams Domain Experts in: Reliability Tooling Scaling Embedded SRE
  • 12. ©2008–18 New Relic, Inc. All rights reserved THREE SPHERES 10 RELIABILITYSTABILITY ENGINEERING
  • 13. ©2008–18 New Relic, Inc. All rights reserved 11 • Fix team problems manually.
 • Track down hardware defects on servers.
 • Provision _______________. TOIL
  • 14. ©2008–18 New Relic, Inc. All rights reserved 11 • Fix team problems manually.
 • Track down hardware defects on servers.
 • Provision _______________. TOIL Repetitive work that scales linearly.
  • 15. ©2008–18 New Relic, Inc. All rights reserved 12 Encourage Best Practices • Audit runbooks
 • Hold “game days” • Meet with product architects 
 • Create team risk matrices Stay Current
  • 16. ©2008–18 New Relic, Inc. All rights reserved 13 Reduce Sprawl • Migrate teams to our 
 code pipeline • Build tools • Clean up alerts • Build an integration Improve Monitoring
  • 17. ©2008–18 New Relic, Inc. All rights reserved KEYS TO SRE SUCCESS 14 Reliability 
 is a feature
 Reliability depends 
 on shared understanding SRE is a challenging, 
 cross-disciplinary practice
  • 18. ©2008–18 New Relic, Inc. All rights reserved KEYS TO SRE SUCCESS 14 Reliability 
 is a feature
 Query your stakeholders Reliability depends 
 on shared understanding SRE is a challenging, 
 cross-disciplinary practice
  • 19. ©2008–18 New Relic, Inc. All rights reserved KEYS TO SRE SUCCESS 14 Reliability 
 is a feature
 Query your stakeholders Reliability depends 
 on shared understanding Develop clear, 
 specific guidelines SRE is a challenging, 
 cross-disciplinary practice
  • 20. ©2008–18 New Relic, Inc. All rights reserved KEYS TO SRE SUCCESS 14 Reliability 
 is a feature
 Query your stakeholders Reliability depends 
 on shared understanding Develop clear, 
 specific guidelines SRE is a challenging, 
 cross-disciplinary practice Build a strong 
 SRE community
  • 21. ©2008–18 New Relic, Inc. All rights reserved A TEMPLATE FOR SUCCESS 15 Determine Your Goal Example: Continuously improve the reliability of the systems of our company’s platform Establish Roles Example: Pure SRE
 Embedded SRE 21 3 Focus Areas Examples: Stability
 Reliability
 Engineering
  • 22. ©2008–18 New Relic, Inc. All rights reserved DETAILS IN OUR SRE EBOOK 16 bit.ly/NewRelicSRE @ToriWieldt
  • 23. ©2008–18 New Relic, Inc. All rights reserved FIRESIDE CHAT 17 TIM O’BRIEN Managing Director, 
 Cloud Services and Information Security
  • 24. ©2008–18 New Relic, Inc. All rights reserved FIRESIDE CHAT 17 TIM O’BRIEN Managing Director, 
 Cloud Services and Information Security
  • 25. ©2008–18 New Relic, Inc. All rights reserved THANK YOU! @ToriWieldt linkedin.com/in/toriwieldt
  • 26. ©2008–18 New Relic, Inc. All rights reserved WHAT SRES DO 19 Champion reliability best practices.
 Guide designs and processes with an eye toward resilience and low toil.
 Reduce technical complexity and sprawl.
 Drive the usage of tooling and common components.
 Implement software and tooling to improve resilience and automate operations.
  • 27. ©2008–18 New Relic, Inc. All rights reserved