0% found this document useful (0 votes)

1K views22 pages

Distributed Database

A distributed database is a collection of interconnected databases located in different places that communicate through a computer network. It offers advantages like easier expandability and lower costs but also has disadvantages like increased complexity, expenses, and security issues. Distributed databases can be homogeneous, with identical systems across sites, or heterogeneous, with different systems. Data can be distributed through replication or fragmentation. Transaction processing must satisfy ACID properties to ensure consistency. Deadlocks can occur when multiple processes wait circularly for resources held by each other.

Uploaded by

Rosemelyne Wartde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views22 pages

Distributed Database

Uploaded by

Rosemelyne Wartde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

PRESENTATION

ON
DISTRIBUTED DATABASE
Submitted BY
Rosemelyne Wartde
MTech IT 1st Semester
Roll No- 20MTechIT02
What is a Distributed database?

 A collection of multiple interconnected databases,

which are spread physically across various locations that
communicate via a computer network.
Advantages of distributed database:

 it is easier to expand

 Can have data arranged according to different levels of

transparency

 It is cheaper to create a network of system containing a

part of the database

 Even if some of the data nodes go offline, the rest of the

database can continue its normal function.
Disadvantages of Distributed Database Systems

 It is quite complex

 It is more expensive

 It is difficult to provide security

 There can also be data redundancy in the

database.
Types of Distributed Databases

Distributed Database Environment

Homogeneous Heterogeneous

Non-
Autonomous Federated Multidatabase
Autonomous
Homogeneous Distributed Databases

 All the sites use identical DBMS and operating systems

Properties:

 The sites use very similar software.

 The sites use identical DBMS or DBMS from the same vendor.

 Each site is aware of all other sites and cooperates with other
sites to process user requests.

 The database is accessed through a single interface as if it is a

single database.
Types of Homogeneous Distributed Database

 Autonomous − Each database is independent that functions on

its own. They are integrated by a controlling application and
use message passing to share data updates.

 Non-autonomous − Data is distributed across the homogeneous

nodes and a central or master DBMS co-ordinates data updates
across the sites.
Heterogeneous Distributed Databases

 different sites have different operating systems, DBMS products and data
models.

Properties:

 Different sites use dissimilar schemas and software.

 The system may be composed of a variety of DBMSs like relational, network,

hierarchical or object oriented.

 Query processing is complex due to dissimilar schemas.

 Transaction processing is complex due to dissimilar software.

 A site may not be aware of other sites and so there is limited co-operation in
processing user requests.
Types of Heterogeneous Distributed Databases

 Federated − The heterogeneous database systems are

independent in nature and integrated together so that they
function as a single database system.

 Un-federated − The database systems employ a central

coordinating module through which the databases are accessed.
Distributed Data Storage
 Consider a relation r that is to be stored in the database. There are two
approaches to storing this relation in the distributed database:

 1. Data Replication- Data replication is the process of storing separate

copies of the database at two or more sites.

 2. Fragmentation- Fragmentation is the task of dividing a table into a set

of smaller tables.

 The subsets of the table are called fragments.

 types: horizontal, vertical, and hybrid

Horizontal Fragmentation:
 Divides a relation horizontally into the group of rows to create
subset of tables.

 Example:

 Account (Acc_No, Balance, Branch_Name). In this e.g if values are

inserted in table Branch_Name as Pune, Baroda, Delhi.

 The query can be written as:

 SELECT * FROM ACCOUNT

WHERE Branch_Name=“Baroda”
Vertical Fragmentation
 Divides a relation vertically into groups of column to create subsets
of tables.
Acc_No Balance Branch-
Name
A_101 50000 Pune
A-102 40000 Baroda

 Fragmentation1:

SELECT * FROM Acc_No

Fragmentation2:

SELECT * FROM Balance

Hybrid Fragmentation:
 a combination of horizontal and vertical fragmentation techniques
are used.

 Consider the following table which consist of employee information

Emp_ID Emp_Name Emp_Address Emp_Age Em_Salary

101 Raj Pune 37 15000

102 Maya Baroda 40 12000

 Fragmentation1:

SELECT * FROM Emp_Name where Emp_Age<40

 Fragmentation2:

SELECT * FROM Emp_Name where Emp_Address=‘Pune’ AND

Salary<14000
Distributed query processing
 It is the procedure of answering queries in a distributed
environment where data is managed at multiple site.
 Transformation a high level query into a query execution plan as
well as the execution of this plan
 The goal is to produce a plan which is equivalent to the original
query and efficient I,e to minimize resource consumption like
total cost or response time.
Transaction
 A transaction is a program including a collection of database
operations, executed as a logical unit of data processing.

 The operations performed like insert, delete, update or retrieve

data.

 Each high level operation can be divided into a number of low

level tasks or operations. For example, a data update operation
can be divided into three tasks −

 read_item()

 modify_item()

 write_item()
Transaction Operations
 The low level operations performed in a transaction are −

 begin_transaction

 read_item or write_item

 end_transaction

 commit − A signal to specify that the transaction has been successfully completed
in its entirety and will not be undone.

 rollback − A signal to specify that the transaction has been unsuccessful and so all
temporary changes in the database are undone. A committed transaction cannot
be rolled back.
Transaction States

 Active − The initial state where the transaction enters is the active state. The transaction
remains in this state while it is executing read, write or other operations.
 Partially Committed − The transaction enters this state after the last statement of the
transaction has been executed.
 Committed − The transaction enters this state after successful completion of the
transaction and system checks have issued commit signal.
 Failed − The transaction goes from partially committed state or active state to failed
state when it is discovered that normal execution can no longer proceed or system checks
fail.
 Aborted − This is the state after the transaction has been rolled back after failure and
the database has been restored to its state that was before the transaction began.
Desirable Properties of Transactions

ACID Properties:

 Atomicity − This property states that a transaction is an atomic unit of processing, that
is, either it is performed in its entirety or not performed at all. No partial update should
exist.

 Consistency − A transaction should take the database from one consistent state to
another consistent state. It should not adversely affect any data item in the database.

 Isolation − A transaction should be executed as if it is the only one in the system. There
should not be any interference from the other concurrent transactions that are
simultaneously running.

 Durability − If a committed transaction brings about a change, that change should be

durable in the database and not lost in case of any failure.
Deadlock
 What are Deadlocks?

 A deadlock occurs when two or more processes need some

resource to complete their execution that is held by the
other process.
Coffman Condition
 A deadlock will only occur if the four conditions hold true:

 1. Mutual Exclusion-There should be a resource that can only be held

by one process at a time.

 2. Hold and Wait-A process can hold multiple resources and still
request more resources from other processes which are holding them.
 3. No Preemption-A resource cannot be preempted from a process by force. A
process can only release a resource voluntarily.

 4. Circular Wait-A process is waiting for the resource held by the second
process, which is waiting for the resource held by the third process and so on,
till the last process is waiting for a resource held by the first process.
THANK YOU

Data - Structures (1-5) 2&16 Marks
No ratings yet
Data - Structures (1-5) 2&16 Marks
21 pages
DotNet Notes
No ratings yet
DotNet Notes
6 pages
Presentation On Tcp/Ip Reference Model
100% (2)
Presentation On Tcp/Ip Reference Model
12 pages
CHAPTER 03: Big Data Technology Landscape
No ratings yet
CHAPTER 03: Big Data Technology Landscape
81 pages
Synchronization in Java
No ratings yet
Synchronization in Java
23 pages
Synchronization in Java
No ratings yet
Synchronization in Java
13 pages
2marks and Questions CS8392
No ratings yet
2marks and Questions CS8392
21 pages
Notifications and Alarms
No ratings yet
Notifications and Alarms
15 pages
Multiprocessor and Multicomputers
No ratings yet
Multiprocessor and Multicomputers
5 pages
Attributes of Output Primitives
89% (9)
Attributes of Output Primitives
25 pages
DBDM Unit Four
No ratings yet
DBDM Unit Four
33 pages
7.assignment2 DAA Answers Dsatm PDF
No ratings yet
7.assignment2 DAA Answers Dsatm PDF
19 pages
Comprehensive FLAT Question Bank
100% (1)
Comprehensive FLAT Question Bank
13 pages
DataWarehouseMining Complete Notes
No ratings yet
DataWarehouseMining Complete Notes
55 pages
OS 2 Marks
100% (11)
OS 2 Marks
15 pages
Computer Programming PDF
No ratings yet
Computer Programming PDF
260 pages
Principles of Compiler Design
No ratings yet
Principles of Compiler Design
36 pages
1097 - File - JAVA - Notes (Unit 1)
No ratings yet
1097 - File - JAVA - Notes (Unit 1)
16 pages
Unit 4 Cloud Dr. Preeti Patil
100% (1)
Unit 4 Cloud Dr. Preeti Patil
81 pages
Structure of DBMS PDF
50% (4)
Structure of DBMS PDF
2 pages
MC4104 - Unit 1
No ratings yet
MC4104 - Unit 1
13 pages
BCS402 - Module 5 Edited
No ratings yet
BCS402 - Module 5 Edited
16 pages
PPL Unit 3
No ratings yet
PPL Unit 3
14 pages
Unit 1: Database Management System (DBMS) Historical Perspective
100% (1)
Unit 1: Database Management System (DBMS) Historical Perspective
30 pages
Qa - CD Unit-3
No ratings yet
Qa - CD Unit-3
8 pages
BScCSIT Transaction DBMS
No ratings yet
BScCSIT Transaction DBMS
30 pages
Exception Handling in Java PDF
No ratings yet
Exception Handling in Java PDF
1 page
UNIT 4 NOTES Oops
No ratings yet
UNIT 4 NOTES Oops
15 pages
Exception Handling and Multithreading
No ratings yet
Exception Handling and Multithreading
60 pages
OS Unit - 4 Notes
No ratings yet
OS Unit - 4 Notes
35 pages
Multiprocessor Configuration
100% (1)
Multiprocessor Configuration
7 pages
Transaction Management Unit III
No ratings yet
Transaction Management Unit III
28 pages
NLP Unit-1
No ratings yet
NLP Unit-1
12 pages
Operating Digital Notes (R22 Regulation)
No ratings yet
Operating Digital Notes (R22 Regulation)
156 pages
Ad3381 Set2
No ratings yet
Ad3381 Set2
4 pages
Classical IPC Problems
No ratings yet
Classical IPC Problems
15 pages
Unit Iii
No ratings yet
Unit Iii
15 pages
Chapter Eight: Building The E-Business Backbone: Enterprise Resource Planning
No ratings yet
Chapter Eight: Building The E-Business Backbone: Enterprise Resource Planning
22 pages
2marks For Pondicherry University
No ratings yet
2marks For Pondicherry University
45 pages
Unit 5
No ratings yet
Unit 5
22 pages
OS Viva Question
No ratings yet
OS Viva Question
6 pages
Macro Pass 1
No ratings yet
Macro Pass 1
11 pages
Vtu 7TH Sem Cse/ise Data Warehousing & Data Mining Notes 10cs755/10is74
94% (18)
Vtu 7TH Sem Cse/ise Data Warehousing & Data Mining Notes 10cs755/10is74
70 pages
Database Management Systems
No ratings yet
Database Management Systems
2 pages
CP7102-Advanced Datastructure and Algorithm Question Bank
No ratings yet
CP7102-Advanced Datastructure and Algorithm Question Bank
4 pages
Message Oriented Middleware (MOM)
No ratings yet
Message Oriented Middleware (MOM)
19 pages
Aos Unit-1 Notes
No ratings yet
Aos Unit-1 Notes
29 pages
AD3391 Database Design and Management Nov Dec 2022 Question Paper Download
No ratings yet
AD3391 Database Design and Management Nov Dec 2022 Question Paper Download
3 pages
ATCD Unit Wise Important Questions
No ratings yet
ATCD Unit Wise Important Questions
5 pages
Packages: Putting Classes Together
No ratings yet
Packages: Putting Classes Together
16 pages
Sonali DBMS Notes
100% (13)
Sonali DBMS Notes
61 pages
MAD Lab Manual
No ratings yet
MAD Lab Manual
43 pages
Distributed-Computing Notes
No ratings yet
Distributed-Computing Notes
108 pages
CS3451 OS UNIT 1 NOTES EduEngg
No ratings yet
CS3451 OS UNIT 1 NOTES EduEngg
34 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
Distributed Database System And: Transaction-Processing
No ratings yet
Distributed Database System And: Transaction-Processing
21 pages
Adt Unitnotes 1to3
No ratings yet
Adt Unitnotes 1to3
107 pages
ADT Unit 1 To 5
No ratings yet
ADT Unit 1 To 5
160 pages
DB Unit-2
No ratings yet
DB Unit-2
27 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
Assignment: Software Defined Networking (SDN)
No ratings yet
Assignment: Software Defined Networking (SDN)
12 pages
Counting Inversion
No ratings yet
Counting Inversion
8 pages
Backtracking Is A Systematic Way of Trying Out Different Sequences of Decisions Until We Find One That
No ratings yet
Backtracking Is A Systematic Way of Trying Out Different Sequences of Decisions Until We Find One That
19 pages
Closest Pair of Points
No ratings yet
Closest Pair of Points
12 pages
Asymptotic Analysis of Algorithms (Growth of Function)
No ratings yet
Asymptotic Analysis of Algorithms (Growth of Function)
8 pages
DBSCAN Presentation
No ratings yet
DBSCAN Presentation
10 pages
New Comers Technical Guide
No ratings yet
New Comers Technical Guide
15 pages
Mysql Replication & Cluster
100% (9)
Mysql Replication & Cluster
40 pages
Dbms Assignment 9
No ratings yet
Dbms Assignment 9
6 pages
Sandip Adhvaryu
No ratings yet
Sandip Adhvaryu
12 pages
38 - 013tv. Com X.T.R.E.A.M
No ratings yet
38 - 013tv. Com X.T.R.E.A.M
4 pages
Physical and Logical Structure of Active Directory
No ratings yet
Physical and Logical Structure of Active Directory
4 pages
Pawan Resume
No ratings yet
Pawan Resume
1 page
Module 5.1 - Association Rule Mining, Apriori Algorithm, Data Mining, Support, Confidence, Examples
100% (1)
Module 5.1 - Association Rule Mining, Apriori Algorithm, Data Mining, Support, Confidence, Examples
108 pages
Cec S 323 Classic Models Practices QL
No ratings yet
Cec S 323 Classic Models Practices QL
5 pages
DB Views in Django
No ratings yet
DB Views in Django
9 pages
Introduction To Spark
No ratings yet
Introduction To Spark
84 pages
PDF Article Metadata Harvester: Jurnal Komputer Dan Informatika
No ratings yet
PDF Article Metadata Harvester: Jurnal Komputer Dan Informatika
6 pages
DS in RME Using RM-Adi W
No ratings yet
DS in RME Using RM-Adi W
32 pages
CVS User Guide CVS User Guide: Page 1 of 27
No ratings yet
CVS User Guide CVS User Guide: Page 1 of 27
27 pages
Azure Capacity Planning - Google Search - 11
No ratings yet
Azure Capacity Planning - Google Search - 11
2 pages
SQLi
No ratings yet
SQLi
32 pages
Aecs Blockchain
No ratings yet
Aecs Blockchain
12 pages
Model QP For Mba
No ratings yet
Model QP For Mba
2 pages
Circular Queue
No ratings yet
Circular Queue
5 pages
Databases - (17 - 19)
No ratings yet
Databases - (17 - 19)
11 pages
Form 5 Computer Science
No ratings yet
Form 5 Computer Science
3 pages
Business Free Talk Lesson 6
No ratings yet
Business Free Talk Lesson 6
9 pages
Mahesh
No ratings yet
Mahesh
10 pages
Unit-7 Part-1 Working With Databases
No ratings yet
Unit-7 Part-1 Working With Databases
43 pages
Javacore 20190809 174230 7064 0002
No ratings yet
Javacore 20190809 174230 7064 0002
27 pages
Documentum Content Server 5.2.5 SP2 Installation
0% (1)
Documentum Content Server 5.2.5 SP2 Installation
27 pages
Deleted Files
No ratings yet
Deleted Files
82 pages
Idb Lab 2
No ratings yet
Idb Lab 2
8 pages
Hostel Management System
No ratings yet
Hostel Management System
6 pages
SQLServer Guide
No ratings yet
SQLServer Guide
179 pages

Distributed Database

Uploaded by

Distributed Database

Uploaded by

PRESENTATION

 A collection of multiple interconnected databases,

 Can have data arranged according to different levels of

 It is cheaper to create a network of system containing a

 Even if some of the data nodes go offline, the rest of the

 It is difficult to provide security

 There can also be data redundancy in the

Distributed Database Environment

 All the sites use identical DBMS and operating systems

 The sites use very similar software.

 The database is accessed through a single interface as if it is a

 Autonomous − Each database is independent that functions on

 Non-autonomous − Data is distributed across the homogeneous

 Different sites use dissimilar schemas and software.

 The system may be composed of a variety of DBMSs like relational, network,

 Query processing is complex due to dissimilar schemas.

 Transaction processing is complex due to dissimilar software.

 Federated − The heterogeneous database systems are

 Un-federated − The database systems employ a central

 1. Data Replication- Data replication is the process of storing separate

 2. Fragmentation- Fragmentation is the task of dividing a table into a set

 The subsets of the table are called fragments.

 types: horizontal, vertical, and hybrid

 Account (Acc_No, Balance, Branch_Name). In this e.g if values are

 The query can be written as:

 SELECT * FROM ACCOUNT

SELECT * FROM Acc_No

SELECT * FROM Balance

 Consider the following table which consist of employee information

Emp_ID Emp_Name Emp_Address Emp_Age Em_Salary

101 Raj Pune 37 15000

SELECT * FROM Emp_Name where Emp_Age<40

SELECT * FROM Emp_Name where Emp_Address=‘Pune’ AND

 The operations performed like insert, delete, update or retrieve

 Each high level operation can be divided into a number of low

 Durability − If a committed transaction brings about a change, that change should be

 A deadlock occurs when two or more processes need some

 1. Mutual Exclusion-There should be a resource that can only be held

You might also like