SlideShare a Scribd company logo
Measure All The Things!
Gary Dusbabek
Rackspace
@gdusbabek
Motivation
What You Really Want
Kinds of Metrics
How To Do It
Prognostication
Motivation
It’s all
about
the data
We are generating data at an insane rate.
We are generating data at an insane rate.
2006 
IDC estimates 161 Exabytes of
data on the Internet

That is 161 MM 1T drives
2009
988 Exabytes of data
6x growth in 4 years
Almost 1B 1T drives
A zetabyte 21 zeroes

Source http://www.emc.com/collateral/analyst-reports/expanding-digital-idc-white-paper.pdf
Measure All the Things! - Austin Data Day 2014
2012
Internet was estimated to be shipping
roughly 2.5 exabytes of data daily.


Daily

Not counting the NSA
Transferring
Data

Generates
Data
Metadata!
Secondary
Information
A by-product
Example 1
Cloud Monitoring


Is the website up?

GET HTTP/1.1
Status=200
Bytes=432
Time to connect=15ms
Time to first byte=21ms
Duration=28ms
Example 2
Netflix


You want to watch an
episode of Buffy
Observations



What titles you click on
What time of day you started watching
When you paused
Parts you re-watched
When you finished (if you finished)
Useless to people
consuming the
primary data.
Priceless when you’re
trying to understand
behavior.
behavior
Understanding = Knowledge
In these cases all the
data generated is
time-series
Time Series Data


Related events 
sorted by time 
of occurrence
Example
0600 – Wake up
0601 – Checked Hacker News
0605 – Shower
0630 – Breakfast
0630 – Checked Hacker News
0700 – Left for work
0730 – Arrived at work
Etc…
Think about how you’d
store something like this if
you were building a
backend system
Relational Database Much?
You
When
0600

What
Wake
up

0601

Checked
Hacker
News

0605

Shower

0630 Breakfast
0630
0700
0730
0731

Checked
Hacker
News

Left for
work
Arrive
at work
Checked
Hacker
News
Who

When

You

0600

What
Wake
up

You

0601

Checked
Hacker
News

You

0605

Shower

You

0630 Breakfast

You

0630

You

0700

You

0730

You

0731

Checked
Hacker
News

Left for
work
Arrive
at work
Checked
Hacker
News
Who

When

You

0600

You

0601

Friend 0603
Friend 0604

What
Wake
up

Checked
Hacker
News

Wake
up

Checked
Hacker
News

You

0605

You

0630 Breakfast

You

0630

You

0700

Friend 0715

You
You

0730
0731

Shower

Checked
Hacker
News

Left for
work
Left for
work
Arrive
at work
Checked
Hacker
News
Other Ways?
Measure All the Things! - Austin Data Day 2014
Less Appealing
Column Oriented
0600

Wake
up

0601

Checked
Hacker
News

0605

Shower

Friend 0603

Wake
up

0604

Checked
Hacker
News

0715

Left for
work

You

0630 Breakfast 0630

Checked
Hacker
News

0700

Left for
work

0730

Arrive
at work

0731

Checked
Hacker
News
What You Really Want
You
run a
business
You
want to
make
money
You
want to
make
money

Show me the
money!
You need to
make
decisions
You need to
make the right
decisions
How do you do that?
With
your gut
With data
Example
API responses
are taking a
long time.
It’s probably
the database.
You add a few indexes.
You allocate more memory.
You get faster disks.
You get bigger processors.
Maybe it’s the
network…
You replace ethernet adapters.

You get faster switches.
You replace the cabling.
Crap!
Trace it!
500 ms for entire request



15 ms on the wire getting there.
200 ms to auth
50 ms looking up account
50 ms looking up other stuff
15 ms on the wire getting back.
170 ms rendering in the browser
500 ms for entire request



15 ms on the wire getting there.
200 ms to auth
50 ms looking up account
50 ms looking up other stuff
15 ms on the wire getting back.
170 ms rendering in the browser
Make the right
decisions with
data.
You need a metrics system
Take these things into account:

Availability
Redundancy
Accuracy
And your budget
Example: Pretty Graphs
If graphs go away, do
you lose money?
The CEO likes
them.
Do graphs help
you make
decisions?
Example:
Usage Billing
Will losing data
cost you money?
Data Lifecycle
When can I throw it away?
How much work is
throwing it away?
How much work is
throwing it away?
More work
means it probably
won’t happen.
Kinds of Metrics
{Volume, Frequency}

⨯ {Low, High}
Low Volume, High Frequency
5,6,5,6
Things observed infrequently
Almost always changes
Low storage overhead
Bulk operations are easy
Usually uninteresting
Low Volume, Low Frequency
5,5,5,6
Roughly the same as LVHF
High Volume, Low Frequency
5,5,5,5,5,5,5,5,5,6,6,6,6,6,6,6,6,6,6,6,6,7,7

Constantly observed
But doesn’t change much
Optimizations!
Detect and record only level changes
Requires caching
High Volume, High Frequency


34,4,7,345,6,4,2,54,67,5,6,55,74,5,3,2,5,6745…
High Volume, High Frequency


34,4,7,345,6,4,2,54,67,5,6,55,74,5,3,2,5,6745…
Numeric vs String
Most will be numeric
Some are strings
Usually low frequency
Special handling
Numeric vs String
High frequency strings are
a sign you’re doing
something wrong or need a
different system.
Gauges
Current value of something
Operation: snapshot
Speedometer
Thermometer
CPU utilization
Counter
Exists as a set of operations
–  Operation: increment
–  Operation: decrement


Read by selecting over time and
summing


Example: hits on a website
Different than unique hits
Set
statsD
Number of uniquely seen items
Think: Conditional counter
Example: number of unique
visitors
Timer
How long something takes


Statistics (mean, median,
min, max, percentiles)

How many times it has
happened


Rate at which it is
happened
Uses a sliding window
Histograms
Distribution of data


Example: when people
visit your site
Measure All the Things! - Austin Data Day 2014
Measure All the Things! - Austin Data Day 2014
How Do You Do It?
If you make software

Instrument it!
Java?
https://github.com/codahale/metrics


Node.js?
https://github.com/mikejihbe/metrics

Others?
Of course
If you run systems


Instrument them!
Get data via agent
Get data via pollers
Considerations: inside or outside of
your network
StatsD
https://github.com/etsy/statsd
Ingests, aggregates, flushes

Use a client to send your data
Pushes aggregations
Graphite
Databases
Flat files of JSON
Wherever
Graphite
http://graphite.wikidot.com
Makes graphs
Pluggable backends (NEW!!!11)
Scaling problems
Buy Enterprise Software
These exist, but I’m an open
source hacker and can’t say
much about them.
Roll Your Own
Easier than you think
Harder than you think
Roll Your Own
Three components
Ingestion
Aggregation/Rollup
Query/Graphing
Avoid Pileups



1 sample per second
3,600 samples per hour
86,400 samples per day
31,536,000 samples per year
1k of storage?
(roughly) 32 gigabytes
Measure All the Things! - Austin Data Day 2014
No!
Measure all the
right things!
Does this measurement matter?
You don’t care about it when it changes


You aren’t doing anything with it


You can’t figure out what actions to take
from it 




(it’s meaningless)
Recent data
will almost
always be
most
important.
Monitoring vs Aggregation
Graphite collects data that is
already aggregated. 
You are observing history
Looking for patterns
No alerting
Where Things Are Going
Complex Event Analysis
ESPER (my favorite).
– Mostly open source.


Not enough projects though L
Data Intelligence
You need this if you don’t know
what questions you ought to
ask

Correlating signals in order to
make useful conclusions
Thanks!
@gdusbabek
Photos from the Flickr CC collection

train 
data

dump

truck 
traffic 
byproduct
watching 
numbers 
birds 
moons 
cake

business 
guts

data 2 
choices 
flowers 
metrics 
gauge 
counter 
marbles 
timer 
windmils 
logs

train 
tower 

h"p://www.flickr.com/photos/vxla/4673817364/sizes/z/	
  
h"p://www.flickr.com/photos/tensafefrogs/3649985674/sizes/z/	
  
h"p://www.flickr.com/photos/seanhobson/3906189027/sizes/l/	
  
h"p://www.flickr.com/photos/shankaronline/7291507876/sizes/l/	
  
h"p://www.flickr.com/photos/honou/3350764803/sizes/l/	
  
h"p://www.flickr.com/photos/jdickert/2152739544/sizes/l/	
  
h"p://www.flickr.com/photos/28misguidedsouls/6517859113/sizes/z/	
  
h"p://www.flickr.com/photos/55176801@N02/7911595842/sizes/o/	
  
h"p://www.flickr.com/photos/johnkay/3764457497/sizes/l/	
  
h"p://www.flickr.com/photos/andykirk/412600169/sizes/l/	
  
h"p://www.flickr.com/photos/jeff-­‐anderson/4385042770/sizes/l/	
  
h"p://www.flickr.com/photos/sgis/6532363/sizes/o/	
  
h"p://www.flickr.com/photos/whatbe"erNme/405735418/sizes/l/	
  
h"p://www.flickr.com/photos/rachubarama/2709346242/sizes/l/	
  
h"p://www.flickr.com/photos/femto-­‐photography/4604878864/sizes/o/	
  
h"p://www.flickr.com/photos/pixx0ne/5689978130/sizes/l/	
  
h"p://www.flickr.com/photos/ruth_w/8432567657/sizes/l/	
  
h"p://www.flickr.com/photos/wesley_lelieveld/8571911541/sizes/l/	
  
h"p://www.flickr.com/photos/lifeasart/242208550/sizes/l/	
  
h"p://www.flickr.com/photos/mrsenil/2219108948/sizes/l/	
  
h"p://www.flickr.com/photos/crisNc/2773883011/sizes/l/	
  
h"p://www.flickr.com/photos/ma"blaze/4491948497/sizes/l/	
  
h"p://www.flickr.com/photos/kenNsh/43788618/sizes/o/	
  
h"p://www.flickr.com/photos/dtanist/10809534755/sizes/l/	
  
h"p://www.flickr.com/photos/jarodcarruthers/10372829184/sizes/l/	
  

More Related Content

PDF
Putting the Magic in Data Science
PDF
PyData 2015 Keynote: "A Systems View of Machine Learning"
PDF
Less is More: Behind the Data at Risk I/O
PPTX
BIG DATA | How to explain it & how to use it for your career?
PPT
Big data v4.0
PPTX
Big Data: The Magic to Attain New Heights
PPTX
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015
Putting the Magic in Data Science
PyData 2015 Keynote: "A Systems View of Machine Learning"
Less is More: Behind the Data at Risk I/O
BIG DATA | How to explain it & how to use it for your career?
Big data v4.0
Big Data: The Magic to Attain New Heights
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015

Viewers also liked (6)

PDF
Introduction to Blueflood at Berlin Buzzwords 2013
PDF
Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014
PDF
How To (Not) Open Source - Javazone, Oslo 2014
PDF
Blueflood: Open Source Metrics Processing at CassandraEU 2013
ODT
Hoe schrijf ik een persbericht
PDF
パーソナライズニュースを支えるML業務のまわしかた@Yahoo! JAPAN
Introduction to Blueflood at Berlin Buzzwords 2013
Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014
How To (Not) Open Source - Javazone, Oslo 2014
Blueflood: Open Source Metrics Processing at CassandraEU 2013
Hoe schrijf ik een persbericht
パーソナライズニュースを支えるML業務のまわしかた@Yahoo! JAPAN
Ad

Similar to Measure All the Things! - Austin Data Day 2014 (20)

PDF
Metrics driven development 10.09.2014
PDF
Monitoring and observability
PDF
Making operations visible - Nick Gallbreath
PDF
Making operations visible - devopsdays tokyo 2013
PDF
Monitoring as Software Validation
PDF
Mozilla Foundation Metrics - presentation to engineers
PPTX
StasD & Graphite - Measure anything, Measure Everything
KEY
Trending with Purpose
PPTX
Engineers guide to data analysis
PDF
OSMC 2016 | The Engineer's guide to Data Analysis by Avishai Ish-Shalom
PDF
OSMC 2016 - The Engineer's guide to Data Analysis by Avishai Ish-Shalom
PDF
Proactive performance monitoring with adaptive thresholds
PPTX
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
PPTX
Real-Time Metrics and Distributed Monitoring - Jeff Pierce, Change.org - Dev...
PPTX
Bigdata Hadoop introduction
PDF
I Know What You Did Last Summer
PDF
Winning the metrics battle
PDF
The 5 Biggest Enterprise Architecture challenges solved with real-time metric...
PDF
Observability cookbook JUGtoberfest Poznan 2024-10-16
Metrics driven development 10.09.2014
Monitoring and observability
Making operations visible - Nick Gallbreath
Making operations visible - devopsdays tokyo 2013
Monitoring as Software Validation
Mozilla Foundation Metrics - presentation to engineers
StasD & Graphite - Measure anything, Measure Everything
Trending with Purpose
Engineers guide to data analysis
OSMC 2016 | The Engineer's guide to Data Analysis by Avishai Ish-Shalom
OSMC 2016 - The Engineer's guide to Data Analysis by Avishai Ish-Shalom
Proactive performance monitoring with adaptive thresholds
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
Real-Time Metrics and Distributed Monitoring - Jeff Pierce, Change.org - Dev...
Bigdata Hadoop introduction
I Know What You Did Last Summer
Winning the metrics battle
The 5 Biggest Enterprise Architecture challenges solved with real-time metric...
Observability cookbook JUGtoberfest Poznan 2024-10-16
Ad

More from gdusbabek (10)

PDF
Rackspace Cloud Monitoring - Strata NYC
PPTX
Austin cassandra meetup
PPTX
How Rackspace Cloud Monitoring uses Cassandra
PPTX
Breaking the Relational Headlock: A Survey of NoSQL Datastores
PPTX
Building Rackspace Cloud Monitoring
PPTX
Cassandra Codebase 2011
PPTX
Data Modeling with Cassandra Column Families
PPTX
Getting to Know the Cassandra Codebase
PPTX
Introduction to Cassandra (June 2010)
PPTX
Cassandra Presentation for San Antonio JUG
Rackspace Cloud Monitoring - Strata NYC
Austin cassandra meetup
How Rackspace Cloud Monitoring uses Cassandra
Breaking the Relational Headlock: A Survey of NoSQL Datastores
Building Rackspace Cloud Monitoring
Cassandra Codebase 2011
Data Modeling with Cassandra Column Families
Getting to Know the Cassandra Codebase
Introduction to Cassandra (June 2010)
Cassandra Presentation for San Antonio JUG

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Electronic commerce courselecture one. Pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
KodekX | Application Modernization Development
PPTX
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Empathic Computing: Creating Shared Understanding
PDF
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Transforming Manufacturing operations through Intelligent Integrations
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Spectral efficient network and resource selection model in 5G networks
Review of recent advances in non-invasive hemoglobin estimation
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Electronic commerce courselecture one. Pdf
Chapter 3 Spatial Domain Image Processing.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
KodekX | Application Modernization Development
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Empathic Computing: Creating Shared Understanding
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Modernizing your data center with Dell and AMD
Transforming Manufacturing operations through Intelligent Integrations
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Reach Out and Touch Someone: Haptics and Empathic Computing
Spectral efficient network and resource selection model in 5G networks

Measure All the Things! - Austin Data Day 2014