4
Most read
5
Most read
7
Most read
NOSQL DATABASES TYPES AND USES
VIEW POINT
Suvradeep Rudra
April’2014
Agenda
• The four categories of NoSQL databases
• When to Use NoSQL
• When NOT to use NoSQL
• Use cases NoSQL (Each Category)
Executive Summary
• A NoSQL database provides a mechanism for
storage and retrieval of data that is modeled in
means other than the tabular relations used in
relational databases. Motivations for this
approach include simplicity of design, horizontal
scaling and finer control over availability. The
data structure (e.g., tree, graph, key-value)
differs from the RDBMS, and therefore some
operations are faster in NoSQL and some in
RDBMS.
4 categories of NoSQL DB
• Key-values Stores
• Column Family Stores
• Document Databases
• Graph Databases
Key-values Stores
 Key valued stores are those types of NoSQL database that are scheme free, and also your
values stored as key i.e in one column you will be having a key “Name” and the value
would be “Zack” and in the second column it’s not necessary mean that you must have
the value of Name again you could store different kind of data in the same column in
different row, and also you could have more column in one row than previous or vice
versa, this is the most common kinds of NoSQL database that are currently in the market
and other kinds of NoSQL database are built upon the principle of this kinds of NoSQL
database and added some features on that.
 The Key-Value database is a very simple structure based on Amazon’s Dynamo DB. Data
is indexed and queried based on it’s key. Key-value stores provide consistent hashing so
they can scale incrementally as your data scales. They communicate node structure
through a gossip-based membership protocol to keep all the nodes synchronized. If you
are looking to scale very large sets of low complexity data, key-value stores are the best
option.
 Examples: Tokyo Cabinet/Tyrant, Redis, Voldemort, Oracle BDB, Amazon
SimpleDB, Riak
 Strengths: Fast lookups
 Weaknesses: Stored data has no schema
Column Family Stores
 These were created to store and process very large amounts of data distributed over many
machines. There are still keys but they point to multiple columns. The columns are
arranged by column family.
 These data stores are based on Google’s BigTable implementation. They may look
similar to relational databases on the surface but under the hood a lot has changed. A
column family database can have different columns on each row so is not relational and
doesn’t have what qualifies in an RDBMS as a table. The only key concepts in a column
family database are columns, column families and super columns. All you really need to
start with is a column family. Column families define how the data is structured on disk.
 A column by itself is just a key-value pair that exists in a column family. A super column
is like a catalogue or a collection of other columns except for other super columns.
 Column family databases are still extremely scalable but less-so than key-value stores.
However, they work better with more complex data sets.
 Examples: Cassandra, HBase
Document Databases
 These were inspired by Lotus Notes and are similar to key-value stores. The model is
basically versioned documents that are collections of other key-value collections. The
semi-structured documents are stored in formats like JSON.
 A document database is not a new idea. It was used to power one of the more
prominent communication platforms of the 90’s and still in service today, Lotus Notes
now called Lotus Domino. APIs for document DBs use Restful web services and JSON
for message structure making them easy to move data in and out.
 A document database has a fairly simple data model based on collections of key-value
pairs. A typical record in a document database would look like this:
• { “Subject”: “I like Plankton”
• “Author”: “Rusty”
• “PostedDate”: “5/23/2006″
• “Tags”: ["plankton", "baseball", "decisions"]
• “Body”: “I decided today that I don’t like baseball. I like plankton.” }
 Examples: CouchDB, MongoDb
 Strengths: Tolerant of incomplete data
 Weaknesses: Query performance, no standard query syntax
Graph Databases
 Instead of tables of rows and columns and the rigid structure of SQL, a flexible graph
model is used which, again, can scale across multiple machines. NoSQL databases do not
provide a high-level declarative query language like SQL to avoid overtime in
processing. Rather, querying these databases is data-model specific. Many of the NoSQL
platforms allow for RESTful interfaces to the data, while other offer query APIs.
 Graph databases take document databases to the extreme by introducing the concept of
type relationships between documents or nodes. The most common example is the
relationship between people on a social network such as Facebook.
 A graph database is a big dense network structure. While it could take an RDBMS hours
to sift through a huge linked list of people, a graph database uses sophisticated shortest
path algorithms to make data queries more efficient. Although slower than its other
NoSQL counterparts, a graph database can have the most complex structure of them all
and still traverse billions of nodes and relationships with light speed.
 Examples: Neo4J, InfoGrid, Infinite Graph
 Strengths: Graph algorithms e.g. shortest path,n degree relationships, etc.
 Weaknesses: Traverse the entire graph to achieve a definitive answer. Not easy to cluster
When is NoSQL a poor choice?
After spending so long extolling the benefits of the various NoSQL solutions, I would like to
point out at least one scenario where I haven’t seen a good NosQL solution for the RDBMS:
Reporting. One of the great things about RDBMS is that given the information that it already
have, it is very easy to massage the data into a lot of interesting forms. That is especially
important when you are trying to do things like give the user the ability to analyze the data
on their own, such as by providing the user with a report tool that allows them to query,
aggregate and manipulate the data to their heart’s content. While it is certainly possible to
produce reports on top of a NoSQL store, you wouldn’t be able to come close to the level of
flexibility that a RDMBS will offer. That is one of the major benefits of the RDBMS, its
flexibility. The NoSQL solutions will tend to outperform the RDBMS solution (as long as you
stay in the appropriate niche for each NoSQL solution) and they certainly have better
scalability story than the RDBMS, but for user driven reports, the RDBMS is still my tool of
choice
Suvradeep Rudra is a Sr. Data Architect and has more than 10
years of experience in Data Management. He held a number
of roles at Caritor Inc. (now NTT DATA), Oracle, Deloitte
Consulting. Experienced in building overall data strategy,
tapping value from data assets and capabilities and driving
value to the business. He has worked in various projects,
establishing and building data management solutions for
customers in the industries such as High Tech, Health
Insurance, Oil and Gas, Payments services and Banking. His
experience ranges from Data strategy, Product Strategy,
MDM, Business Intelligence and Analytics, Data Architecture
(Data Warehouse), Data Governance.
Suvradeep writes and speaks about Monetizing Company’s
Data and Technology trends.
He holds Masters in Computer Applications from University
of Madras, Chennai, India.
He can be reached via LinkedIn profile

More Related Content

PPTX
Non relational databases-no sql
PPTX
Sql vs NoSQL
PPSX
A Seminar on NoSQL Databases.
PPTX
NoSQL databases - An introduction
PPTX
PPTX
No SQL- The Future Of Data Storage
PPTX
NoSQL databases
Non relational databases-no sql
Sql vs NoSQL
A Seminar on NoSQL Databases.
NoSQL databases - An introduction
No SQL- The Future Of Data Storage
NoSQL databases

What's hot (20)

PPTX
introduction to NOSQL Database
PPTX
Introduction to NoSQL
PPT
9. Document Oriented Databases
PDF
NoSQL databases
ZIP
NoSQL databases
PDF
The CAP Theorem
PPTX
Key-Value NoSQL Database
PPTX
Introduction to NoSQL Databases
PDF
NOSQL- Presentation on NoSQL
PPSX
Parallel Database
PDF
Nosql data models
PPTX
Introduction to Apache Hadoop Eco-System
PPT
7. Key-Value Databases: In Depth
PPT
Map reduce in BIG DATA
PPTX
Introduction to HiveQL
PPTX
Big Data Open Source Technologies
PPTX
Introduction to NOSQL databases
PPT
Hive(ppt)
PPTX
What is NoSQL and CAP Theorem
PPTX
PPT on Hadoop
introduction to NOSQL Database
Introduction to NoSQL
9. Document Oriented Databases
NoSQL databases
NoSQL databases
The CAP Theorem
Key-Value NoSQL Database
Introduction to NoSQL Databases
NOSQL- Presentation on NoSQL
Parallel Database
Nosql data models
Introduction to Apache Hadoop Eco-System
7. Key-Value Databases: In Depth
Map reduce in BIG DATA
Introduction to HiveQL
Big Data Open Source Technologies
Introduction to NOSQL databases
Hive(ppt)
What is NoSQL and CAP Theorem
PPT on Hadoop
Ad

Viewers also liked (20)

KEY
NoSQL Databases: Why, what and when
PPT
NoSQL databases pros and cons
PPTX
An Introduction To NoSQL & MongoDB
PDF
A Beginners Guide to noSQL
PDF
NoSQL, no SQL injections?
PPT
Common Entrepreneur Mistakes
PPTX
Kaplowitch equity dos and donts
PPT
Building A Stable, Fundable Startup
PPT
Employee equity incentives
PPT
Startups: Attracting and Retaining Talent (updated 3/6/13)
PDF
Guaranteed vs Incentive Pay - What's the Right Balance?
PDF
10 Movies Every Entrepreneur Should Watch
PPT
Succession Planning using Equity Incentive Plan and ESOPs
PDF
Negotiating as Consultant
PDF
Converting Employees to Owners: Employee Share Purchase Plans
PDF
Startup Equity - Startup summer camp, 2014
PPTX
ESOPs LEGAL & PROCEDURAL ASPECTS
ODP
How to Divide the Pie? Dynamic Equity Share by Mike Moyer
PPT
Raising Your Seed Round Financing: Should You Use Convertible Notes or Prefe...
NoSQL Databases: Why, what and when
NoSQL databases pros and cons
An Introduction To NoSQL & MongoDB
A Beginners Guide to noSQL
NoSQL, no SQL injections?
Common Entrepreneur Mistakes
Kaplowitch equity dos and donts
Building A Stable, Fundable Startup
Employee equity incentives
Startups: Attracting and Retaining Talent (updated 3/6/13)
Guaranteed vs Incentive Pay - What's the Right Balance?
10 Movies Every Entrepreneur Should Watch
Succession Planning using Equity Incentive Plan and ESOPs
Negotiating as Consultant
Converting Employees to Owners: Employee Share Purchase Plans
Startup Equity - Startup summer camp, 2014
ESOPs LEGAL & PROCEDURAL ASPECTS
How to Divide the Pie? Dynamic Equity Share by Mike Moyer
Raising Your Seed Round Financing: Should You Use Convertible Notes or Prefe...
Ad

Similar to NOSQL Databases types and Uses (20)

PPTX
Selecting best NoSQL
PPTX
unit2-ppt1.pptx
DOCX
NoSQL_Databases
PPTX
WEB_DATABASE_chapter_4.pptx
PPTX
NoSQL powerpoint presentation difference with rdbms
PPTX
No SQL DATABASE Description about 4 no sql database.pptx
PPTX
Choosing your NoSQL storage
DOCX
Report 1.0.docx
PDF
Brief introduction to NoSQL by fas mosleh
PPTX
An Intro to NoSQL Databases
PPTX
Relational databases vs Non-relational databases
PDF
PPT
No sql or Not only SQL
PPTX
2.Introduction to NOSQL (Core concepts).pptx
PDF
All About Database v1.1
PDF
The Rise of Nosql Databases
PPTX
NoSql - mayank singh
PDF
Introduction to NOSQL quadrants
PPTX
Module 2.2 Introduction to NoSQL Databases.pptx
PDF
NOsql Presentation.pdf
Selecting best NoSQL
unit2-ppt1.pptx
NoSQL_Databases
WEB_DATABASE_chapter_4.pptx
NoSQL powerpoint presentation difference with rdbms
No SQL DATABASE Description about 4 no sql database.pptx
Choosing your NoSQL storage
Report 1.0.docx
Brief introduction to NoSQL by fas mosleh
An Intro to NoSQL Databases
Relational databases vs Non-relational databases
No sql or Not only SQL
2.Introduction to NOSQL (Core concepts).pptx
All About Database v1.1
The Rise of Nosql Databases
NoSql - mayank singh
Introduction to NOSQL quadrants
Module 2.2 Introduction to NoSQL Databases.pptx
NOsql Presentation.pdf

More from Suvradeep Rudra (10)

DOCX
Cloud Strategies for Financial Firms : Migrating one step at a time
DOCX
Design patterns 101
PPTX
Data architecture around risk management
PPTX
Business intelligence vs business analytics
PPT
Big data analytics in politics voted yes
PPTX
Overview ppdm data_architecture_in_oil and gas_ industry
PPTX
Data Warehousing and BI - Recruitment POV
PPTX
Rise of Column Oriented Database
PDF
Where HADOOP fits in and challenges
PPTX
Dodd frank yahoo_10_14_2011_show
Cloud Strategies for Financial Firms : Migrating one step at a time
Design patterns 101
Data architecture around risk management
Business intelligence vs business analytics
Big data analytics in politics voted yes
Overview ppdm data_architecture_in_oil and gas_ industry
Data Warehousing and BI - Recruitment POV
Rise of Column Oriented Database
Where HADOOP fits in and challenges
Dodd frank yahoo_10_14_2011_show

Recently uploaded (20)

PDF
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
PDF
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...
PDF
The Digital Engine Room: Unlocking APAC’s Economic and Digital Potential thro...
PDF
Rooftops detection with YOLOv8 from aerial imagery and a brief review on roof...
PPTX
From XAI to XEE through Influence and Provenance.Controlling model fairness o...
PDF
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
PDF
Advancements in abstractive text summarization: a deep learning approach
PPTX
Introduction-to-Artificial-Intelligence (1).pptx
PPT
Overviiew on Intellectual property right
PDF
1_Keynote_Breaking Barriers_한계를 넘어서_Charith Mendis.pdf
PDF
Revolutionizing recommendations a survey: a comprehensive exploration of mode...
PDF
Examining Bias in AI Generated News Content.pdf
PPTX
Slides World Game (s) Great Redesign Eco Economic Epochs.pptx
PDF
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
PDF
Peak of Data & AI Encore: Scalable Design & Infrastructure
PDF
ment.tech-How to Develop an AI Agent Healthcare App like Sully AI (1).pdf
PPTX
Presentation - Principles of Instructional Design.pptx
PDF
Altius execution marketplace concept.pdf
PDF
TrustArc Webinar - Data Minimization in Practice_ Reducing Risk, Enhancing Co...
PPTX
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com
EGCB_Solar_Project_Presentation_and Finalcial Analysis.pdf
Uncertainty-aware contextual multi-armed bandits for recommendations in e-com...
The Digital Engine Room: Unlocking APAC’s Economic and Digital Potential thro...
Rooftops detection with YOLOv8 from aerial imagery and a brief review on roof...
From XAI to XEE through Influence and Provenance.Controlling model fairness o...
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
Advancements in abstractive text summarization: a deep learning approach
Introduction-to-Artificial-Intelligence (1).pptx
Overviiew on Intellectual property right
1_Keynote_Breaking Barriers_한계를 넘어서_Charith Mendis.pdf
Revolutionizing recommendations a survey: a comprehensive exploration of mode...
Examining Bias in AI Generated News Content.pdf
Slides World Game (s) Great Redesign Eco Economic Epochs.pptx
CCUS-as-the-Missing-Link-to-Net-Zero_AksCurious.pdf
Peak of Data & AI Encore: Scalable Design & Infrastructure
ment.tech-How to Develop an AI Agent Healthcare App like Sully AI (1).pdf
Presentation - Principles of Instructional Design.pptx
Altius execution marketplace concept.pdf
TrustArc Webinar - Data Minimization in Practice_ Reducing Risk, Enhancing Co...
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com

NOSQL Databases types and Uses

  • 1. NOSQL DATABASES TYPES AND USES VIEW POINT Suvradeep Rudra April’2014
  • 2. Agenda • The four categories of NoSQL databases • When to Use NoSQL • When NOT to use NoSQL • Use cases NoSQL (Each Category)
  • 3. Executive Summary • A NoSQL database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. Motivations for this approach include simplicity of design, horizontal scaling and finer control over availability. The data structure (e.g., tree, graph, key-value) differs from the RDBMS, and therefore some operations are faster in NoSQL and some in RDBMS.
  • 4. 4 categories of NoSQL DB • Key-values Stores • Column Family Stores • Document Databases • Graph Databases
  • 5. Key-values Stores  Key valued stores are those types of NoSQL database that are scheme free, and also your values stored as key i.e in one column you will be having a key “Name” and the value would be “Zack” and in the second column it’s not necessary mean that you must have the value of Name again you could store different kind of data in the same column in different row, and also you could have more column in one row than previous or vice versa, this is the most common kinds of NoSQL database that are currently in the market and other kinds of NoSQL database are built upon the principle of this kinds of NoSQL database and added some features on that.  The Key-Value database is a very simple structure based on Amazon’s Dynamo DB. Data is indexed and queried based on it’s key. Key-value stores provide consistent hashing so they can scale incrementally as your data scales. They communicate node structure through a gossip-based membership protocol to keep all the nodes synchronized. If you are looking to scale very large sets of low complexity data, key-value stores are the best option.  Examples: Tokyo Cabinet/Tyrant, Redis, Voldemort, Oracle BDB, Amazon SimpleDB, Riak  Strengths: Fast lookups  Weaknesses: Stored data has no schema
  • 6. Column Family Stores  These were created to store and process very large amounts of data distributed over many machines. There are still keys but they point to multiple columns. The columns are arranged by column family.  These data stores are based on Google’s BigTable implementation. They may look similar to relational databases on the surface but under the hood a lot has changed. A column family database can have different columns on each row so is not relational and doesn’t have what qualifies in an RDBMS as a table. The only key concepts in a column family database are columns, column families and super columns. All you really need to start with is a column family. Column families define how the data is structured on disk.  A column by itself is just a key-value pair that exists in a column family. A super column is like a catalogue or a collection of other columns except for other super columns.  Column family databases are still extremely scalable but less-so than key-value stores. However, they work better with more complex data sets.  Examples: Cassandra, HBase
  • 7. Document Databases  These were inspired by Lotus Notes and are similar to key-value stores. The model is basically versioned documents that are collections of other key-value collections. The semi-structured documents are stored in formats like JSON.  A document database is not a new idea. It was used to power one of the more prominent communication platforms of the 90’s and still in service today, Lotus Notes now called Lotus Domino. APIs for document DBs use Restful web services and JSON for message structure making them easy to move data in and out.  A document database has a fairly simple data model based on collections of key-value pairs. A typical record in a document database would look like this: • { “Subject”: “I like Plankton” • “Author”: “Rusty” • “PostedDate”: “5/23/2006″ • “Tags”: ["plankton", "baseball", "decisions"] • “Body”: “I decided today that I don’t like baseball. I like plankton.” }  Examples: CouchDB, MongoDb  Strengths: Tolerant of incomplete data  Weaknesses: Query performance, no standard query syntax
  • 8. Graph Databases  Instead of tables of rows and columns and the rigid structure of SQL, a flexible graph model is used which, again, can scale across multiple machines. NoSQL databases do not provide a high-level declarative query language like SQL to avoid overtime in processing. Rather, querying these databases is data-model specific. Many of the NoSQL platforms allow for RESTful interfaces to the data, while other offer query APIs.  Graph databases take document databases to the extreme by introducing the concept of type relationships between documents or nodes. The most common example is the relationship between people on a social network such as Facebook.  A graph database is a big dense network structure. While it could take an RDBMS hours to sift through a huge linked list of people, a graph database uses sophisticated shortest path algorithms to make data queries more efficient. Although slower than its other NoSQL counterparts, a graph database can have the most complex structure of them all and still traverse billions of nodes and relationships with light speed.  Examples: Neo4J, InfoGrid, Infinite Graph  Strengths: Graph algorithms e.g. shortest path,n degree relationships, etc.  Weaknesses: Traverse the entire graph to achieve a definitive answer. Not easy to cluster
  • 9. When is NoSQL a poor choice? After spending so long extolling the benefits of the various NoSQL solutions, I would like to point out at least one scenario where I haven’t seen a good NosQL solution for the RDBMS: Reporting. One of the great things about RDBMS is that given the information that it already have, it is very easy to massage the data into a lot of interesting forms. That is especially important when you are trying to do things like give the user the ability to analyze the data on their own, such as by providing the user with a report tool that allows them to query, aggregate and manipulate the data to their heart’s content. While it is certainly possible to produce reports on top of a NoSQL store, you wouldn’t be able to come close to the level of flexibility that a RDMBS will offer. That is one of the major benefits of the RDBMS, its flexibility. The NoSQL solutions will tend to outperform the RDBMS solution (as long as you stay in the appropriate niche for each NoSQL solution) and they certainly have better scalability story than the RDBMS, but for user driven reports, the RDBMS is still my tool of choice
  • 10. Suvradeep Rudra is a Sr. Data Architect and has more than 10 years of experience in Data Management. He held a number of roles at Caritor Inc. (now NTT DATA), Oracle, Deloitte Consulting. Experienced in building overall data strategy, tapping value from data assets and capabilities and driving value to the business. He has worked in various projects, establishing and building data management solutions for customers in the industries such as High Tech, Health Insurance, Oil and Gas, Payments services and Banking. His experience ranges from Data strategy, Product Strategy, MDM, Business Intelligence and Analytics, Data Architecture (Data Warehouse), Data Governance. Suvradeep writes and speaks about Monetizing Company’s Data and Technology trends. He holds Masters in Computer Applications from University of Madras, Chennai, India. He can be reached via LinkedIn profile