0% found this document useful (0 votes)

81 views25 pages

NoSQL Evolution: CAP Theorem Insights

NoSQL databases take a bottom-up approach inspired by Unix, with simple APIs and mechanisms that can be composed. They provide high performance by relaxing consistency in the face of network partitions compared to relational databases. The CAP theorem formalized this tradeoff, showing you cannot have consistency, availability, and partition tolerance simultaneously. NoSQL systems focus on availability during partitions and employ techniques like eventual consistency and compensation transactions to recover consistency later.

Uploaded by

gimli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views25 pages

NoSQL Evolution: CAP Theorem Insights

Uploaded by

gimli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

NoSQL: Past, Present, Future

Eric Brewer
Professor, UC Berkeley
VP Infrastructure, Google

QCon SF
November 8, 2012
Charles Bachman, 1973 Turing Award

Integrated Datastore (IDS)

(very) Early “No SQL” database
“Navigational” Database

Tight integration between code and data

Database = linked groups of records (“CODASYL”)
Pointers were physical names, today we hash
Programmer as “navigator” through the links
Similar to DOM engine, WWW, graph DBs
Used for its high performance, but…
But hard to program, maintain
Hard to evolve the schema (embedded in code)

Wikipedia: “IDMS”
Why Relational? (1970s)

Need a high-level model (sets)

Separate the data from the code
SQL is the (only) API
Data outlasts any particular implementation
because the model doesn’t change
Goal: implement the top-down model well
Led to transactions as a tool
Declarative language leaves room for optimization
Also 1970s: Unix

“The most important job of UNIX is to provide a file

system”
– original 1974 Unix paper
Bottom-up world view
Few, simple, efficient mechanisms
Layers and composition
“navigational”
Evolution comes from APIs, encapsulation

NoSQL is in this Unix tradition

Examples: dbm (1979 kv), gdbm, Berkeley DB, JDBM
Two Valid World Views

Relational View Systems View

Top Down Bottom Up
Clean model, Build on top
ACID Transactions Evolve modules
Two kinds of developers One kind of programmer
DB authors Integrated use
SQL programmers
Values Values:
Clean Semantics Good APIs
Set operations Flexibility
Easy long-term evolution Range of possible programs
Venues: SIGMOD, VLDB Venues: SOSP, OSDI
NoSQL in Context

Large reusable storage component

Systems values:
Layered, ideally modular APIs
Enable a range of systems and semantics
Some things to build on top over time:
Multi-component transactions
Secondary indices
Evolution story
Returning sets of data, not just values
How did I get here…
l  Modern cluster-based server (1995)
–  Scalable, highly available, commodity clusters
–  Inktomi search engine (1996), proxy cache (1998)
l  But didn't use a DBMS
–  Informix was 10x slower for the search engine
–  Instead, custom servers on top of file systems
l  Led to “ACID vs. BASE” spectrum (1997)
–  Basically Available, Soft State, Eventual Consistency
–  … but BASE was not well received… (ACID was sacred)
Genesis of the CAP Theorem
l  I felt the design choices we made were “right”:
–  Sufficient (and faster)
–  Necessary (consistency hinders performance/availability)
l  Started to notice other systems that made similar
decisions: Coda, Bayou

l  Developed CAP while teaching in 1998

–  Appears in 1999
–  PODC keynote in 2000, led to Gilbert/Lynch proof
l  … but nothing changed (for a while)
CAP Theorem
l  Choose at most two for any shared-data
system:
–  Consistency (linearizable)
–  Availability (system always accepts updates)
–  Partition Tolerance
l  Partitions are inevitable for the wide area
–  => consistency vs. availability

l  I think this was the right phrasing for 2000

–  But probably not for 2010
Things CAP does NOT say..

1.  Give up on consistency (in the wide area)

•  Inconsistency should be the exception
•  Many projects give up more than needed
2.  Give up on transactions (ACID)
•  Need to adjust “C” and “I” expectations (only)
3.  Don’t use SQL
•  SQL is appearing in “NoSQL” systems
•  Declarative languages fit well with CAP
CAP & ACID
No partitions => Full ACID
With partitions:
Atomic:
•  Partitions should occur between operations (!)
•  Each side should use atomic ops
Consistent:
•  Temporarily violate this (e.g. no duplicates?)
Isolation:
•  Temporarily lose this by definition
Durable:
•  Should never forfeit this (and we need it later)
Single-site transactions

Claim 1: partitions are temporary

•  Provide degraded service for a while
•  Then RECOVER
Claim 2: can detect “partition mode”
•  Timeout => effectively partitioned
•  Commit locally? (A) => partition started
•  Fail? (C)
•  Retry just means postpone the decision a bit
Claim 3: impacts lazy vs. eager consistency
•  Lazy => can’t recover consistency during partition
•  Can only choose A in some sense
Life of a Partition

State: S

Operations on S
time

l  Serializable operations on state S

l  Available (no partitions)
Life of a Partition

State: S State: S1

Operations on S
time State: S2

Partition
starts
partition mode

l  Both sides available, locally linearizable

… but (maybe) globally inconsistent
l  No ACID “I”: concurrent ops on both sides
l  No ACID “C” either (only local integrity checks)
Life of a Partition

State: S State: S1

Operations on S
time State: S2

Partition
starts
partition mode

l  Commit locally?

l  Externalize output? (A says yes)
l  Execute side effects? (launch missile?)
Life of a Partition

State: S State: S1 State: S'

?
State: S2

partition mode
Partition
ends
Need “Partition Recovery”
•  Goal: restore consistency (ACID)
•  Similar to traditional recovery
•  Move to some self-consistent state
•  Roll forward the “log” from each side
Partition Recovery

State: S State: S1 State: S'

?
State: S2

partition mode

1)  Merge State (S’)

•  Easy: last writer wins
•  General: S’ = f(S1 log, S2 log) // the paths matter
2)  Detect bad things that you did
•  Side effects? Incorrect response?
3)  Compensate for bad actions
Partition Recovery

State: S State: S1 State: S'

?
State: S2

partition mode

Amazon shopping cart:

1) Merge by union of items
2) Only bad action is deleted item reappears
ATM “Stand In” Time
l  ATMs have “partition mode”
–  … chooses A over C
–  Commutative atomic ops: incr, decr
–  When partition heals, the end balance is correct
l  Partition recovery:
–  Detect: intermediate wrong decisions
–  Side effects (like “issue cash”) might be wrong
–  Exceptions are not commutative (below zero?)
–  Compensate via overdraft penalty
l  Bound “wrongness” during partition: (less A)
–  Limit deficit to (say) $200
l  When you remove $200, “decr” becomes unavailable
Define your “Partition Strategy”

1)  Define detection (start Partition Mode)

2)  Partition Mode operation:
Determine which operations can proceed
•  Can depend on args/access level/state
•  Simple example: no updates, read only
•  ATM: withdrawal allowed only up to $200 total
3)  Partition recovery
•  Detect problems via joint logs
•  Execute compensations
•  Every allowed op should have a compensation
•  Calculate merged state (last)
Compensation Happens
Claim: Real world =
l 
weak consistency + delayed exceptions + compensation
–  Charge you twice => credit your account
–  Overbook an airplane => compensate passengers that miss out

l  This concept is missing from wide-area data systems

–  Except for some workflow
l  Compensating transactions can be human response
–  “We just realized we sent you two of the same item”
–  Should be logged just like any other xact
CAP 2010

NoSQL CAP only

Disallows this area !
100%
BASE
Availability

BigTable
Dynamo

ACID
Sherpa Databases

0%
Transactions
Eventual Single copy
Consistency consistency

Consistency
Summary
l  Net effect of CAP:
–  Freedom to explore a wide diverse space
–  Merging of systems and DB approaches
l  While there are no partitions:
–  Can have both A and C, and full ACID xact
l  Choosing A => focus on partition recovery
–  Need a before, during, and after strategy
–  Delayed Exceptions seem promising
–  Applying the ideas of compensation is open

The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
No ratings yet
The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
44 pages
Module 2.3
No ratings yet
Module 2.3
25 pages
Nosql KK
No ratings yet
Nosql KK
23 pages
Understanding the CAP Theorem in Databases
No ratings yet
Understanding the CAP Theorem in Databases
77 pages
Understanding ACID vs BASE in Databases
No ratings yet
Understanding ACID vs BASE in Databases
30 pages
NoSQL for Data Engineers
No ratings yet
NoSQL for Data Engineers
144 pages
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
Acid Vs Base
No ratings yet
Acid Vs Base
13 pages
ADO Lecture I 2024-26
No ratings yet
ADO Lecture I 2024-26
79 pages
04 NoSQL
No ratings yet
04 NoSQL
126 pages
CAP Theorem vs ACID in Databases
100% (1)
CAP Theorem vs ACID in Databases
22 pages
Transaction Properties: Acid vs. Base
No ratings yet
Transaction Properties: Acid vs. Base
13 pages
Untitled Document
No ratings yet
Untitled Document
30 pages
NoSQL Trends for IT Professionals
No ratings yet
NoSQL Trends for IT Professionals
26 pages
Paper 0585
No ratings yet
Paper 0585
10 pages
HBase & NoSQL Database Insights
No ratings yet
HBase & NoSQL Database Insights
4 pages
Lec21Notes Merged
No ratings yet
Lec21Notes Merged
20 pages
NoSQL Sharding and Replication Guide
No ratings yet
NoSQL Sharding and Replication Guide
28 pages
Understanding NoSQL Databases and CAP Theorem
No ratings yet
Understanding NoSQL Databases and CAP Theorem
23 pages
ACID and CAP Theorem Explained
No ratings yet
ACID and CAP Theorem Explained
13 pages
Deadlock Detection and RAID Strategies
No ratings yet
Deadlock Detection and RAID Strategies
3 pages
Dimensional Data In: Distributed Hash Tables
No ratings yet
Dimensional Data In: Distributed Hash Tables
71 pages
Module 3 - NoSQL
No ratings yet
Module 3 - NoSQL
53 pages
Database Management Systems: UNIT-5: Nosql Databases
No ratings yet
Database Management Systems: UNIT-5: Nosql Databases
39 pages
Module 1
No ratings yet
Module 1
9 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
43 pages
Bda Mod 3
No ratings yet
Bda Mod 3
70 pages
CAP Theorem
No ratings yet
CAP Theorem
15 pages
Software Engineer Concepts - 4030afdb-00a4-4f83-A520 - 241007 - 202416
No ratings yet
Software Engineer Concepts - 4030afdb-00a4-4f83-A520 - 241007 - 202416
26 pages
NoSQL Databases: A Beginner's Guide
No ratings yet
NoSQL Databases: A Beginner's Guide
12 pages
Intro No SQL
No ratings yet
Intro No SQL
44 pages
ACIDand Base
No ratings yet
ACIDand Base
10 pages
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
No ratings yet
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
10 pages
Overview of NoSQL Databases and Concepts
No ratings yet
Overview of NoSQL Databases and Concepts
26 pages
IMP3
No ratings yet
IMP3
2 pages
Lect26 After
No ratings yet
Lect26 After
28 pages
Session09-Parts 17-18
No ratings yet
Session09-Parts 17-18
98 pages
Scalability Availability Stability:, & Patterns
No ratings yet
Scalability Availability Stability:, & Patterns
197 pages
Sem3-wk3-CAP Theorem For Big Data
No ratings yet
Sem3-wk3-CAP Theorem For Big Data
3 pages
Big Data Storage and Processing
No ratings yet
Big Data Storage and Processing
49 pages
Module 1
No ratings yet
Module 1
69 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Cloud Data Management Strategies
No ratings yet
Cloud Data Management Strategies
79 pages
12 Backup Recovery-TELU
No ratings yet
12 Backup Recovery-TELU
44 pages
2 - NoSQL
No ratings yet
2 - NoSQL
32 pages
Transaction Processing Concepts Concurrency Control and Recovery Part 3
No ratings yet
Transaction Processing Concepts Concurrency Control and Recovery Part 3
34 pages
4.NoSQL 1
No ratings yet
4.NoSQL 1
69 pages
Bigdata and Nosql DBS: Piyushgupta July2013
No ratings yet
Bigdata and Nosql DBS: Piyushgupta July2013
27 pages
SGDB
No ratings yet
SGDB
14 pages
Big Data Architecture Overview
No ratings yet
Big Data Architecture Overview
16 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
No SQL
No ratings yet
No SQL
49 pages
Understanding NoSQL Databases and Their Types
No ratings yet
Understanding NoSQL Databases and Their Types
35 pages
Consistency Models and The Cap Theorem
No ratings yet
Consistency Models and The Cap Theorem
160 pages
Intro To NoSQL DBs
No ratings yet
Intro To NoSQL DBs
44 pages
Answer Key CIA 1 Retest
No ratings yet
Answer Key CIA 1 Retest
6 pages
Week 7
No ratings yet
Week 7
47 pages
Tuning The Doctors of Heaven
No ratings yet
Tuning The Doctors of Heaven
15 pages
NEET-MDS 2020 Information Bulletin
0% (1)
NEET-MDS 2020 Information Bulletin
77 pages
Assignment IT Project Management
100% (1)
Assignment IT Project Management
4 pages
UNDA BW5 Final PDF
No ratings yet
UNDA BW5 Final PDF
95 pages
Development of Geospatial Technologies by Using Artificial Intelligence
No ratings yet
Development of Geospatial Technologies by Using Artificial Intelligence
1 page
Reviewer G11
No ratings yet
Reviewer G11
13 pages
LPile Plus Student Edition Documentation
No ratings yet
LPile Plus Student Edition Documentation
53 pages
My Vampire 541-550
No ratings yet
My Vampire 541-550
52 pages
Life Pack Math Grade 1
No ratings yet
Life Pack Math Grade 1
45 pages
2018 Bullock Monitoring Landsat Degradation Forests Brasil
No ratings yet
2018 Bullock Monitoring Landsat Degradation Forests Brasil
16 pages
Syllabus
No ratings yet
Syllabus
1 page
Catharsis and The Actor
No ratings yet
Catharsis and The Actor
9 pages
DSP Electric Drives Lab Guide
No ratings yet
DSP Electric Drives Lab Guide
102 pages
IT Syllabus 2007 08
No ratings yet
IT Syllabus 2007 08
101 pages
CSS English Precis Papers 2000-2005
No ratings yet
CSS English Precis Papers 2000-2005
13 pages
CEE 433 - Assignment 1
No ratings yet
CEE 433 - Assignment 1
2 pages
Foundation Design Calculation
No ratings yet
Foundation Design Calculation
3 pages
Model Template For A Monologue
No ratings yet
Model Template For A Monologue
2 pages
Zero Point Energy to Kinetic Energy Conversion
No ratings yet
Zero Point Energy to Kinetic Energy Conversion
6 pages
RPT English Year 5 (SK) 2024-2025
100% (1)
RPT English Year 5 (SK) 2024-2025
12 pages
Production Management Assignment Guide
No ratings yet
Production Management Assignment Guide
21 pages
Anaphora Semantic
No ratings yet
Anaphora Semantic
15 pages
Pyramid Re-Alignment and Boundary Changes
No ratings yet
Pyramid Re-Alignment and Boundary Changes
3 pages
Introduction:-: Journal Homepage
No ratings yet
Introduction:-: Journal Homepage
5 pages
SSL Decryption Guide for Admins
No ratings yet
SSL Decryption Guide for Admins
11 pages
BUS 5411 - Written Assignment Week 2
100% (1)
BUS 5411 - Written Assignment Week 2
3 pages
Notice 2
No ratings yet
Notice 2
2 pages
Exploding Dots ™ Teaching Guide: Insight
No ratings yet
Exploding Dots ™ Teaching Guide: Insight
12 pages

NoSQL Evolution: CAP Theorem Insights

Uploaded by

NoSQL Evolution: CAP Theorem Insights

Uploaded by

NoSQL: Past, Present, Future

Integrated Datastore (IDS)

Tight integration between code and data

Need a high-level model (sets)

“The most important job of UNIX is to provide a file

NoSQL is in this Unix tradition

Relational View Systems View

Large reusable storage component

l  Developed CAP while teaching in 1998

l  I think this was the right phrasing for 2000

1.  Give up on consistency (in the wide area)

Atomic transaction, but only within one site

Claim 1: partitions are temporary

l  Serializable operations on state S

l  Both sides available, locally linearizable

l  Commit locally?

State: S State: S1 State: S'

State: S State: S1 State: S'

1)  Merge State (S’)

State: S State: S1 State: S'

Amazon shopping cart:

1)  Define detection (start Partition Mode)

l  This concept is missing from wide-area data systems

NoSQL CAP only

You might also like

NoSQL Evolution: CAP Theorem Insights

Uploaded by

NoSQL Evolution: CAP Theorem Insights

Uploaded by

NoSQL: Past, Present, Future

Integrated Datastore (IDS)

Tight integration between code and data

Need a high-level model (sets)

“The most important job of UNIX is to provide a file

NoSQL is in this Unix tradition

Relational View Systems View

Large reusable storage component

l Developed CAP while teaching in 1998

l I think this was the right phrasing for 2000

1. Give up on consistency (in the wide area)

Atomic transaction, but only within one site

Claim 1: partitions are temporary

l Serializable operations on state S

l Both sides available, locally linearizable

l Commit locally?

State: S State: S1 State: S'

State: S State: S1 State: S'

1) Merge State (S’)

State: S State: S1 State: S'

Amazon shopping cart:

1) Define detection (start Partition Mode)

l This concept is missing from wide-area data systems

NoSQL CAP only

You might also like

l  Developed CAP while teaching in 1998

l  I think this was the right phrasing for 2000

1.  Give up on consistency (in the wide area)

l  Serializable operations on state S

l  Both sides available, locally linearizable

l  Commit locally?

1)  Merge State (S’)

1)  Define detection (start Partition Mode)

l  This concept is missing from wide-area data systems