0% found this document useful (0 votes)

21 views23 pages

Nosql KK

Uploaded by

dzwowt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views23 pages

Nosql KK

Uploaded by

dzwowt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

NoSQL Databases

An Overview
Dr. Kalpakis, Introduction to Data Science, Fall 2017

1
The need

2
Scaling Relational Databases
• Vertically (or up)
• Can be achieved by hardware upgrades (e.g., faster CPU, more memory, or
larger disks)
• Limited by the amount of CPU, RAM and disk that can be configured on a
single machine

• Horizontally (or out)

• Can be achieved by adding more machines
• Requires database sharding and probably replication
• Limited by the Read-to-Write ratio and communication overhead
• ACID requirements constrain scalability
3
Data Sharding
• Data is typically sharded (or striped) to allow for parallel accesses
• Amdahl’s Law gives the speedup due to sharding
• Real speedup is less due to communication overhead and workload
imbalance
Input data: A large file

Machine 1 Machine 2 Machine 3

Chunk1 of input data Chunk3 of input data Chunk5 of input data

Chunk2 of input data Chunk4 of input data Chunk5 of input data

E.g., parallel access to chunks 1, 3 and 5

4
Data Replication
• Replicating data across servers helps
• Avoid performance bottlenecks
• Avoid single point of failures
• Enhance scalability and availability

Main Server

Replicated Servers

5
Relational Databases & ACID
properties
• Execution of DB code blocks (aka transactions) ensure
• Atomicity: either all instructions or none of them are excuted
• Consistency: at the end, it leaves database in consistent state
• Isolation: oblivious to other concurrent manipulations of database
• Durability: upon completion, modifications to DB are permanent
• Consistency in distributed relational databases is often done using 2-
phase commit protocol (2PC)
• When sharding and replicating relational databases, ensuring
consistency is costly since real-life distributed systems are unreliable
• even worse, when network partitions
• AID are relatively easier to support in distributed systems
6
2-Phase Commit protocol (2PC)
Phase I: Voting
1. VOTE_REQUEST
2. VOTE_COMMIT
DB Server 1
Participant 1

DB Server 2
Coordinator Participant 2

Phase II: Commit

3. GLOBAL_COMMIT DB Server 3
Participant 3
4. LOCAL_COMMIT

7
The CAP Theorem
• “Of three properties of a shared data system: data consistency, system availability
and tolerance to network partitions, only two can be achieved at any given moment .”
• Conjectured by Eric Brewer (2000) and proven by Nancy Lynch and Seth Gilbert (2002)
• “CAP prohibits only a tiny part of the design space: perfect availability and consistency in
the presence of partitions, which are rare.” (Eric Brewer, 2012)
• Consistency:
• All nodes should see the same data at the same time (strict consistency)
• Availability:
• Node failures do not prevent survivors from continuing to operate
• Partition-tolerance:
• The system continues to operate despite network partitions

• Necessary to decide between C and A for very large systems since almost certainly will
partition
8
Various Consistency types
• Strong Consistency
• any subsequent access after an update will return the same updated value.
• Eventual Consistency
• if no new updates are made, eventually all accesses will return the last updated value
• Read-your-writes
• Upon updating an item, a process never sees an older value
• Monotonic read consistency
• If a process has seen a particular value of an item, no process sees an older value
afterwards
• Monotonic write consistency
• serializes the writes by the same process

9
BASE antidote to ACID
• Basically Available: indicates that the system does guarantee
availability
• Soft state indicates that the state of the system may change
over time, even without input.
• Eventual consistency indicates that the system will become
consistent over time, when input ceases during that time.
• Most NoSQL databases relax ACID and adopt BASE

10
CAP and databases

11
Taxonomy of NoSQL (Not-only
SQL) databases
• Key-Value Stores
• Lookup a single value for a key
• Amazon’s DynamoDB
• Document Stores
• Access data by key or by search of “document” data.
• MongoDB
• CouchDB
• Column Stores
• Column-wise storage of tabular data
• Google’s BigTable
• Facebook’s Cassandra
• Graph Stores
• Native graph storage, efficient graph algorithms
• Neo4j
• Google’s Pregel
12
13
Key-Value Stores
DynamoDB Data Model

Mandatory Optional
Key-value access pattern Models 1:N relationships
Determines data distribution Enables rich queries

14
Column Stores

15
Document Stores

in JSON/BSON

16
MongoDB Architecture

17
Queries

18
Graph Stores
Graph Stores – neo4j

19
Prons/Cons of NoSQL
• Advantages :
• High elastic scalability
• Lower cost
• Schema flexibility, semi-structured data
• Disadvantages
• No standardization
• Less mature
• Limited query capabilities
• Programming with eventual consistent is counter-intuitive

20
21
NewSQL
• A DBMS that delivers the scalability and flexibility promised by NoSQL while
retaining the support for SQL queries and/or ACID, or to improve performance for
appropriate workloads.
Matt Aslett – “How Will The Database Incumbents Respond To NoSQL And NewSQL?”
https://www.451research.com/report-short?entityId=66963

Properties Traditional SQL NoSQL NewSQL

ACID Y N Y
• NewSQL databases have In-memory DB N Y Y
• SQL as the primary interface. Big Data N Y Y
RDBMS Y N Y
• ACID support for transactions
• Non-locking concurrency control.
• High per-node performance.
Parallel,
•Michael shared-nothing
Stonebraker- architecture.
“New SQL: An Alternative
http://cacm.acm.org/blogs/blog-cacm/109710
to NoSQL and Old SQL for New OLTP Apps”

22
23

Module 2.3
No ratings yet
Module 2.3
25 pages
Module 1
No ratings yet
Module 1
69 pages
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
29 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
43 pages
Understanding NoSQL Databases and CAP Theorem
No ratings yet
Understanding NoSQL Databases and CAP Theorem
23 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Lec 24
No ratings yet
Lec 24
16 pages
4.NoSQL 1
No ratings yet
4.NoSQL 1
69 pages
2 - NoSQL
No ratings yet
2 - NoSQL
32 pages
Overview of NoSQL Databases and Concepts
No ratings yet
Overview of NoSQL Databases and Concepts
26 pages
No SQL
No ratings yet
No SQL
109 pages
NGD Unit 1-4
No ratings yet
NGD Unit 1-4
43 pages
NoSQL Databases for CSE Students
No ratings yet
NoSQL Databases for CSE Students
39 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
Overview of NoSQL Database Systems
No ratings yet
Overview of NoSQL Database Systems
9 pages
Intro No SQL
No ratings yet
Intro No SQL
44 pages
NoSQL for Data Engineers
No ratings yet
NoSQL for Data Engineers
144 pages
2.1 Nosql
No ratings yet
2.1 Nosql
25 pages
HBase & NoSQL Database Insights
No ratings yet
HBase & NoSQL Database Insights
4 pages
Understanding CAP Theorem and NoSQL Databases
No ratings yet
Understanding CAP Theorem and NoSQL Databases
7 pages
No SQL
No ratings yet
No SQL
39 pages
Bda Module 3
No ratings yet
Bda Module 3
20 pages
Riak CS Latency in NoSQL Systems
No ratings yet
Riak CS Latency in NoSQL Systems
49 pages
No SQL
No ratings yet
No SQL
49 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
Untitled Document
No ratings yet
Untitled Document
30 pages
BigData NoSQL
No ratings yet
BigData NoSQL
30 pages
Lecture 8 Chapter 5 Part 4 Big Data Storage Concepts
No ratings yet
Lecture 8 Chapter 5 Part 4 Big Data Storage Concepts
9 pages
Module 2
No ratings yet
Module 2
104 pages
NoSQL vs. Cloud Data Storage Systems
No ratings yet
NoSQL vs. Cloud Data Storage Systems
17 pages
Chap2 NoSQL
No ratings yet
Chap2 NoSQL
13 pages
CAP Theorem & NoSQL Database Overview
No ratings yet
CAP Theorem & NoSQL Database Overview
23 pages
BDS Session 10
No ratings yet
BDS Session 10
70 pages
Intro To NoSQL DBs
No ratings yet
Intro To NoSQL DBs
44 pages
Module 3
No ratings yet
Module 3
37 pages
Module 2
No ratings yet
Module 2
100 pages
NoSQL
No ratings yet
NoSQL
18 pages
BDS Session 5 - NoSQL DB
No ratings yet
BDS Session 5 - NoSQL DB
51 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
40 pages
Understanding NoSQL Databases and Their Types
No ratings yet
Understanding NoSQL Databases and Their Types
35 pages
NoSQL Databases: Features and Limitations
No ratings yet
NoSQL Databases: Features and Limitations
13 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
NoSQL Databases
No ratings yet
NoSQL Databases
52 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
30 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
Understanding NoSQL Databases Explained
No ratings yet
Understanding NoSQL Databases Explained
25 pages
8.4 NoSQL Database
No ratings yet
8.4 NoSQL Database
36 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
Module 3 NOSQL
No ratings yet
Module 3 NOSQL
69 pages
04 NoSQL
No ratings yet
04 NoSQL
126 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
Database Management Systems: UNIT-5: Nosql Databases
No ratings yet
Database Management Systems: UNIT-5: Nosql Databases
39 pages
NoSQL Sharding and Replication Guide
No ratings yet
NoSQL Sharding and Replication Guide
28 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
NoSQL Trends for IT Professionals
No ratings yet
NoSQL Trends for IT Professionals
26 pages
03 Database
No ratings yet
03 Database
14 pages
03 MongoDB Shell
No ratings yet
03 MongoDB Shell
6 pages
AI Innovations in Healthcare by Walorska
No ratings yet
AI Innovations in Healthcare by Walorska
31 pages
Aiinhealthcare 200426161429
No ratings yet
Aiinhealthcare 200426161429
21 pages
Hailort 4.18.0 User Guide
0% (1)
Hailort 4.18.0 User Guide
294 pages
Journal Article Mining Insights
No ratings yet
Journal Article Mining Insights
10 pages
Church Management System1
100% (1)
Church Management System1
29 pages
Al 0795 CSC 3 Mock Hihs 2025
No ratings yet
Al 0795 CSC 3 Mock Hihs 2025
6 pages
Spreadsheet
No ratings yet
Spreadsheet
12 pages
SLT Setup for SAP HANA Users
No ratings yet
SLT Setup for SAP HANA Users
13 pages
Introduction To Exploratory Data Analysis EDA
No ratings yet
Introduction To Exploratory Data Analysis EDA
10 pages
Attachment 1
No ratings yet
Attachment 1
7 pages
Chapter 1 - Fundamental of DB - Part 1
No ratings yet
Chapter 1 - Fundamental of DB - Part 1
40 pages
Apply Patching On Oracle 19c Database Release Update 19
100% (1)
Apply Patching On Oracle 19c Database Release Update 19
14 pages
Modesdeco 2
No ratings yet
Modesdeco 2
3 pages
Data Analysis Fundamentals: A Beginner's Course
No ratings yet
Data Analysis Fundamentals: A Beginner's Course
4 pages
Data Analytics Glossary Guide
No ratings yet
Data Analytics Glossary Guide
4 pages
Druva White Paper
No ratings yet
Druva White Paper
2 pages
IBM C2090-424 Exam Questions and Answers
No ratings yet
IBM C2090-424 Exam Questions and Answers
28 pages
Unit-1: Ajay Kumar Assistant Professor Computer Scinece & Engineering
No ratings yet
Unit-1: Ajay Kumar Assistant Professor Computer Scinece & Engineering
52 pages
Spring and Springboot
No ratings yet
Spring and Springboot
12 pages
Oracle Analytics Cloud 2018 Associate 1Z0-936 - Quiz
No ratings yet
Oracle Analytics Cloud 2018 Associate 1Z0-936 - Quiz
23 pages
Juspay Interview Questions
No ratings yet
Juspay Interview Questions
14 pages
Information System Audit For Hospital Management Information System
No ratings yet
Information System Audit For Hospital Management Information System
4 pages
Database Systems Lab Manual - Updated Sep 2023
No ratings yet
Database Systems Lab Manual - Updated Sep 2023
103 pages
Exam SPLK-1002: IT Certification Guaranteed, The Easy Way!
100% (1)
Exam SPLK-1002: IT Certification Guaranteed, The Easy Way!
31 pages
Travel and Tourism Management System
No ratings yet
Travel and Tourism Management System
41 pages
BDA Notes
No ratings yet
BDA Notes
40 pages
Overview of Oracle Database Editions
No ratings yet
Overview of Oracle Database Editions
4 pages
SQL Database Detach, Restore, Create Guide
No ratings yet
SQL Database Detach, Restore, Create Guide
40 pages
SQL DML Lab for Computing Students
No ratings yet
SQL DML Lab for Computing Students
11 pages
Assignemnt 5
No ratings yet
Assignemnt 5
7 pages
Building Cloud-Based Applications With Python
No ratings yet
Building Cloud-Based Applications With Python
13 pages
Minor Project Aman
No ratings yet
Minor Project Aman
33 pages
ProjectReport 160104061
No ratings yet
ProjectReport 160104061
7 pages

Nosql KK

Uploaded by

Nosql KK

Uploaded by

NoSQL Databases

• Horizontally (or out)

Machine 1 Machine 2 Machine 3

Chunk2 of input data Chunk4 of input data Chunk5 of input data

E.g., parallel access to chunks 1, 3 and 5

Phase II: Commit

Properties Traditional SQL NoSQL NewSQL

You might also like