0% found this document useful (0 votes)

8 views

14-PhysicalAccess

Uploaded by

chamarilk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

14-PhysicalAccess

Uploaded by

chamarilk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Database Management Systems

Physical Access to Data

DB
MG
1
DBMS Architecture

SQL INSTRUCTION

OPTIMIZER

CONCURRENCY CONTROL
MANAGEMENT OF ACCESS
METHODS

BUFFER MANAGER RELIABILITY MANAGEMENT

Index Files
System DATABASE
Catalog
Data Files

2
DB
MG
Physical Access Structures

Data may be stored on disk in different formats

to provide efficient query execution
Different formats are appropriate for different
query needs
Physical access structures describe how data is
stored on disk

3
DB
MG
Access Method Manager

Transforms an access plan generated by the

optimizer into a sequence of physical access
requests to (database) disk pages
It exploits access methods
An access method is a software module
It is specialized for a single physical data structure
It provides primitives for
reading data
writing data

4
DB
MG
Access method

Selects the appropriate blocks of a file to be

loaded in memory
Requests them to the Buffer Manager
Knows the organization of data into a page
can find specific tuples and values inside a page

5
DB
MG
Organization of a disk page

Different for different access methods

Divided in
Space available for data
Space reserved for access method control
information
Space reserved for file system control information

6
DB
MG
Remarks

Tuples may have varying size

Varchar types
Presence of Null values
A single tuple may span several pages
When its size is larger than a single page
e.g., for BLOB or CLOB data types

7
DB
MG
Database Management Systems

Physical Access Structures

DB
MG
8
Physical Access Structures

Physical access structures describe how data is

stored on disk to provide efficient query
execution
SQL select, update, …
In relational systems
Physical data storage
Sequential structures
Hash structures
Indexing to increase access efficiency
Tree structures (B-Tree, B+-Tree)
Unclustered hash index
Bitmap index 9
DB
MG
Sequential Structures

Tuples are stored in a given sequential order

Different types of structures implement different
ordering criteria
Available sequential structures
Heap file (entry sequenced)
Ordered sequential structure

10
DB
MG
Heap file

Tuples are sequenced in insertion order

insert is typically an append at the end of the file
All the space in a block is completely exploited
before starting a new block
Delete or update may cause wasted space
Tuple deletion may leave unused space
Updated tuple may not fit if new values have larger size
Sequential reading/writing is very efficient
Frequently used in relational DBMS
jointly with unclustered (secondary) indices to support
search and sort operations

11
DB
MG
Ordered sequential structures

The order in which tuples are written depends on

the value of a given key, called sort key
A sort key may contain one or more attributes
the sort key may be the primary key
Appropriate for
Sort and group by operations on the sort key
Search operations on the sort key
Join operations on the sort key
when sorting is used for join

12
DB
MG
Ordered sequential structures

Problem
preserving the sort order when inserting new
tuples
it may also hold for update
Solution
Leaving a percentage of free space in each block
during table creation
On insertion, dynamic (re)sorting in main memory of
tuples into a block
Alternative solution
Overflow file containing tuples which do not fit into
the correct block
13
DB
MG
Ordered sequential structures

Typically used with B+-Tree clustered (primary)

indices
the index key is the sort key
Used by the DBMS to store intermediate
operation results

14
DB
MG
Tree structures

Provide “direct” access to data based on the

value of a key field
The key includes one or more attributes
It does not constrain the physical position of
tuples
The most widespread in relational DBMS

15
DB
MG
General characteristics

One root node

16
DB
MG
Tree structure

17
DB
MG
General characteristics

One root node

Many intermediate nodes
Nodes have a large fan-out
Each node has many children

18
DB
MG
Tree structure

19
DB
MG
General characteristics

One root node

Many intermediate nodes
Nodes have a large fan-out
Each node has many children
Leaf nodes provide access to data
Clustered
Unclustered

20
DB
MG
Tree structure

DATA

21
DB
MG
B-Tree and B+-Tree

Two different tree structures for indexing

B-Tree
Data pages are reached only through key values by
visiting the tree
B+-Tree
Provides a link structure allowing sequential access
in the sort order of key values

22
DB
MG
B-Tree structure

DATA

23
DB
MG
B+-Tree structure

DATA

24
DB
MG
B-Tree and B+-Tree

Two different tree structures for indexing

B-Tree
Data pages are reached only through key values by
visiting the tree
B+-Tree
Provides a link structure allowing sequential access
on the sort order of key values
B stands for balanced
Leaves are all at the same distance from the root
Access time is constant, regardless of the searched
value
25
DB
MG
Clustered

The tuple is contained into the leaf node

Constrains the physical position of tuples in a
given leaf node
The position may be modified by splitting the node,
when it is full
Also called key sequenced
Typically used for primary key indexing

26
DB
MG
Clustered B+-Tree index

Data Data Data Data

27
DB
MG
Unclustered

The leaf contains physical pointers to actual data

The position of tuples in a file is totally
unconstrained
Also called indirect
Used for secondary indices

28
DB
MG
Unclustered B+-Tree index

Data

29
DB
MG
Example: Unclustered B+-Tree index
STUDENT (StudentId, Name, Grade)

12 78 Grade > 78
Grade < 12
12 <= Grade <= 78

19 56
12<=Grade < 19 56< Grade <=78

19 <= Grade <= 56

33 44
19 <= Grade < 33 44< Grade <= 56
33 <= Grade <= 44
LEAF
19 22 30 30 33 34 34 34 40 50

(T1) (T2 ) (T3) (T4) (T5) (T6 ) (T10) (T7) (T8) (T9)

T1 T6 T10 T2 T3 T5 T4 T7 T8 T9
19 34 34 22 30 33 30 34 40 50

DB
30
MG DATA FILE FOR STUDENT TABLE
Example: Clustered B+-Tree index
STUDENT (StudentId, Name, Grade)

12 78 Grade > 78
Grade < 12
12 <= Grade <= 78

19 56
12<=Grade < 19 56< Grade <=78

19 <= Grade <= 56

33 44
19 <= Grade < 33 44< Grade <= 56
33 <= Grade <= 44
LEAF

T1 T2 T3 T4 T5 T6 T10 T7 T8 T9
19 22 30 30 33 34 34 34 40 50

DATA FILE FOR STUDENT TABLE

DB
31
MG
Advantages and disadvantages

Advantages
Very efficient for range queries
Appropriate for sequential scan in the order of the
key field
Always for clustered, not guaranteed otherwise
Disadvantages
Insertions may require a split of a leaf
possibly, also of intermediate nodes
computationally intensive
Deletions may require merging uncrowded
nodes and re-balancing
32
DB
MG
Hash structure

It guarantees direct and efficient access to data

based on the value of a key field
The hash key may include one or more attributes
Suppose the hash structure has B blocks
The hash function is applied to the key field value
of a record
It returns a value between 0 and B-1 which defines
the position of the record
Blocks should never be completely filled
To allow new data insertion

33
DB
MG
Example: hash index
STUDENT (StudentId, Name, Grade)

BLOCK 0

TUPLE T1 H(StudentId =50)=1

StudentId = 50 T1 50
BLOCK 1 T4 75

TUPLE T4 H(StudentId =75)=1

StudentId = 75
BLOCK 2

DATA FILE FOR STUDENT TABLE

34
DB
MG
Hash index

Advantages
Very efficient for queries with equality predicate on
the key
No sorting of disk blocks is required
Disadvantages
Inefficient for range queries
Collisions may occur

35
DB
MG
Unclustered hash index

It guarantees direct and efficient access to data

based on the value of a key field
Similar to hash index
Blocks contain pointers to data
Actual data is stored in a separate structure
Position of tuples is not constrained to a block
Different from hash index

36
DB
MG
Example: Unclustered hash index
STUDENT (StudentId, Name, Grade)

BLOCK 0

TUPLE T1 T1 30
GRADE = 30 H(GRADE=30)=1
30 → T1
BLOCK 1 40 → T2

TUPLE T2 T2 40
GRADE = 40 H(GRADE=40)=1
BLOCK 2
DATA FILE FOR
STUDENT TABLE
INDEX BLOCKS

37
DB
MG
Bitmap index

It guarantees direct and efficient access to data

based on the value of a key field
It is based on a bit matrix
The bit matrix references data rows by means of
RIDs (Row IDentifiers)
Actual data is stored in a separate structure
Position of tuples is not constrained

38
DB
MG
Bitmap index

The bit matrix has

One column for each different value of the indexed
attribute
One row for each tuple
Position (i, j) of the matrix is
1 if tuple i takes value j RID Val1 Val2 … Valn
0 otherwise 1 0 0 … 1
2 0 0 … 0
3 0 0 … 1
4 1 0 … 0
5 0 1 … 0

39
DB
MG
Example: Bitmap index
EMPLOYEE (EmployeeId, Name, Job)

Domain of Job attribute = {Engineer, Consultant, Manager, Programmer, Secretary, Accountant}

RID Eng. Cons. Man. Prog. Secr. Acc.

1 0 0 1 0 0 0
2 0 0 0 1 0 0
3 0 0 0 0 1 0
4 0 0 0 1 0 0
5 1 0 0 0 0 0
Prog.
0 T2
1
0
1
0
T4

DATA FILE
FOR EMPLOYEE
TABLE
40
DB
MG
Bitmap index

Advantages
Very efficient for boolean expressions of predicates
Reduced to bit operations on bitmaps
Appropriate for attributes with limited domain
cardinality
Disadvantages
Not used for continuous attributes
Required space grows significantly with domain
cardinality

41
DB
MG

Grade 06 ICT 1st Term Test Paper 2023 English Medium Royal College
67% (3)
Grade 06 ICT 1st Term Test Paper 2023 English Medium Royal College
6 pages
Using BAPI in LSMW
No ratings yet
Using BAPI in LSMW
16 pages
DBMS - R18 UNIT 5 Notes
86% (7)
DBMS - R18 UNIT 5 Notes
23 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
Storage System - RAID Levels
No ratings yet
Storage System - RAID Levels
53 pages
Indexing - DBMS
No ratings yet
Indexing - DBMS
20 pages
Unit 6 notes DBMS final
No ratings yet
Unit 6 notes DBMS final
14 pages
Indexing
No ratings yet
Indexing
6 pages
Unit 4 Chapter 1 Storage and Querying
No ratings yet
Unit 4 Chapter 1 Storage and Querying
37 pages
Unit5 File Organization
No ratings yet
Unit5 File Organization
112 pages
DBMS - Indexing: Dense Index
No ratings yet
DBMS - Indexing: Dense Index
5 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
Unit_6
No ratings yet
Unit_6
38 pages
Indexing
No ratings yet
Indexing
10 pages
UNIT-IV - File Organization
No ratings yet
UNIT-IV - File Organization
10 pages
File Organization
No ratings yet
File Organization
11 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
Exam Notes COA
No ratings yet
Exam Notes COA
36 pages
02 Blocking - Addional
No ratings yet
02 Blocking - Addional
74 pages
Unit 3 - DBMS (Indexing, Hashing, B+-Tree)
No ratings yet
Unit 3 - DBMS (Indexing, Hashing, B+-Tree)
7 pages
Database Index PDF
No ratings yet
Database Index PDF
6 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Dbms Indexing
No ratings yet
Dbms Indexing
3 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
DBMS A1
No ratings yet
DBMS A1
10 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
Unit 4 Index Structures For Files: Structure
No ratings yet
Unit 4 Index Structures For Files: Structure
16 pages
IT3020 L06 Indexing
No ratings yet
IT3020 L06 Indexing
41 pages
Unit-4 Hand Written
No ratings yet
Unit-4 Hand Written
35 pages
Indexing
No ratings yet
Indexing
6 pages
d-s-s-1
No ratings yet
d-s-s-1
6 pages
Lecture 17
No ratings yet
Lecture 17
24 pages
Indexing
No ratings yet
Indexing
6 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
DBMS-U5 Notes
No ratings yet
DBMS-U5 Notes
16 pages
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
No ratings yet
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
53 pages
CH 3 Index
No ratings yet
CH 3 Index
40 pages
02 - Indices
No ratings yet
02 - Indices
208 pages
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
No ratings yet
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
41 pages
Dmbs New Slides Unit 2
No ratings yet
Dmbs New Slides Unit 2
28 pages
Unit-6 Storage Strategies
No ratings yet
Unit-6 Storage Strategies
43 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
dbms 3 sem
No ratings yet
dbms 3 sem
31 pages
Java Merged
No ratings yet
Java Merged
291 pages
PPT-203105251-3
No ratings yet
PPT-203105251-3
35 pages
Unit Iv Implementation Techniques
No ratings yet
Unit Iv Implementation Techniques
91 pages
Dbms Notes
No ratings yet
Dbms Notes
21 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Unit 5
No ratings yet
Unit 5
20 pages
Unit 5
No ratings yet
Unit 5
185 pages
Indexing_Hashing_Files
No ratings yet
Indexing_Hashing_Files
68 pages
Dbms r18 Unit 5 Notes
No ratings yet
Dbms r18 Unit 5 Notes
24 pages
Indexing
No ratings yet
Indexing
62 pages
File Organization
No ratings yet
File Organization
41 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
file organization
No ratings yet
file organization
9 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
19-DistributedDatabases
No ratings yet
19-DistributedDatabases
76 pages
MongoDB-for-Data-Science-seminar
No ratings yet
MongoDB-for-Data-Science-seminar
135 pages
20-ElasticSearch
No ratings yet
20-ElasticSearch
62 pages
15-QueryOptimization
No ratings yet
15-QueryOptimization
78 pages
DSTBD_oracle_hints-IT
No ratings yet
DSTBD_oracle_hints-IT
11 pages
Tutorial_DataMiningENG
No ratings yet
Tutorial_DataMiningENG
8 pages
18-Recovery
No ratings yet
18-Recovery
53 pages
Data_2
No ratings yet
Data_2
1 page
V6I5-0268
No ratings yet
V6I5-0268
7 pages
optimization
No ratings yet
optimization
4 pages
Answers_Assignment_B21_43
No ratings yet
Answers_Assignment_B21_43
7 pages
K-means clustering using RapidMiner
No ratings yet
K-means clustering using RapidMiner
10 pages
correct-validation-wp-final-v
No ratings yet
correct-validation-wp-final-v
26 pages
RapidMiner-Data-Science-Foundations-Course-Description
No ratings yet
RapidMiner-Data-Science-Foundations-Course-Description
2 pages
DSTBD_10-DMClassification-ENG
No ratings yet
DSTBD_10-DMClassification-ENG
160 pages
tutorial_rm5_prom6extension
No ratings yet
tutorial_rm5_prom6extension
20 pages
Bodhipooja Print
No ratings yet
Bodhipooja Print
21 pages
jovanovicetal.2014RapidMinerBook
No ratings yet
jovanovicetal.2014RapidMinerBook
17 pages
DSTBD_9-DMassrules
No ratings yet
DSTBD_9-DMassrules
98 pages
Writing
No ratings yet
Writing
4 pages
WMM Final Updated
No ratings yet
WMM Final Updated
11 pages
IQ
No ratings yet
IQ
7 pages
Nuwanethi Obata Senehebara Amathumak
No ratings yet
Nuwanethi Obata Senehebara Amathumak
40 pages
Integrating UEBA With Zero Trust
No ratings yet
Integrating UEBA With Zero Trust
23 pages
ChatGPT for Data Analytics Full Course
No ratings yet
ChatGPT for Data Analytics Full Course
3 pages
Persistent Staging Area: Purpose
No ratings yet
Persistent Staging Area: Purpose
2 pages
Lecture-7-Dynamic Programming Global-Sequence Alignment
No ratings yet
Lecture-7-Dynamic Programming Global-Sequence Alignment
31 pages
American Library Association Support For Libraries - AL
100% (1)
American Library Association Support For Libraries - AL
2 pages
BIM & CAD Solutions For The AEC Industry BIM & CAD Solutions For The AEC Industry
No ratings yet
BIM & CAD Solutions For The AEC Industry BIM & CAD Solutions For The AEC Industry
10 pages
Data storage and querying
No ratings yet
Data storage and querying
2 pages
MongoDB Sharding PDF
No ratings yet
MongoDB Sharding PDF
3 pages
CTI Course 2023 2024 Chapter 9
No ratings yet
CTI Course 2023 2024 Chapter 9
22 pages
Test I
No ratings yet
Test I
3 pages
7c of Ecommerce Britannia and ITC
No ratings yet
7c of Ecommerce Britannia and ITC
6 pages
Togaf Ea Methodology
No ratings yet
Togaf Ea Methodology
27 pages
Oracle.1Z0-083.v2024-11-04.q291
No ratings yet
Oracle.1Z0-083.v2024-11-04.q291
108 pages
CAD Phase-5
No ratings yet
CAD Phase-5
21 pages
Data Science
No ratings yet
Data Science
2 pages
Ateneo de Manila University
No ratings yet
Ateneo de Manila University
4 pages
Data Warehousing and Data Mining UNIT - 04: A Lazy Learner Simply Stores The Training Data and
No ratings yet
Data Warehousing and Data Mining UNIT - 04: A Lazy Learner Simply Stores The Training Data and
3 pages
Salesforce Certified AI Associate
No ratings yet
Salesforce Certified AI Associate
7 pages
ADBMS Practical - List - 2019 PDF
No ratings yet
ADBMS Practical - List - 2019 PDF
28 pages
Mgt1051 Business-Analytics-For-Engineers TH 1.1 47 Mgt1051
No ratings yet
Mgt1051 Business-Analytics-For-Engineers TH 1.1 47 Mgt1051
2 pages
OFBiz Framework
100% (1)
OFBiz Framework
1 page
SAP Data Architecture - NEW
100% (2)
SAP Data Architecture - NEW
74 pages
B8 Abstract Final
No ratings yet
B8 Abstract Final
4 pages
NBIMS-US V3 4.1 Introduction To IE Standards
No ratings yet
NBIMS-US V3 4.1 Introduction To IE Standards
2 pages
CH 2 Information Systems For Decision Making
100% (1)
CH 2 Information Systems For Decision Making
8 pages
IMS-CSET-201-Lab Assignment 2.4
No ratings yet
IMS-CSET-201-Lab Assignment 2.4
6 pages
Net - How To Connect Access Database in C# - Stack Overflow
No ratings yet
Net - How To Connect Access Database in C# - Stack Overflow
4 pages
Power BI - Data Modeling
100% (1)
Power BI - Data Modeling
17 pages
NSE5_FSM-6.3 Fortinet Exam Practice Questions
No ratings yet
NSE5_FSM-6.3 Fortinet Exam Practice Questions
5 pages

14-PhysicalAccess

Uploaded by

14-PhysicalAccess

Uploaded by

Database Management Systems

Physical Access to Data

BUFFER MANAGER RELIABILITY MANAGEMENT

Data may be stored on disk in different formats

Transforms an access plan generated by the

Selects the appropriate blocks of a file to be

Different for different access methods

Tuples may have varying size

Physical Access Structures

Physical access structures describe how data is

Tuples are stored in a given sequential order

Tuples are sequenced in insertion order

The order in which tuples are written depends on

Typically used with B+-Tree clustered (primary)

Provide “direct” access to data based on the

One root node

One root node

One root node

Two different tree structures for indexing

Two different tree structures for indexing

The tuple is contained into the leaf node

Data Data Data Data

The leaf contains physical pointers to actual data

19 <= Grade <= 56

19 <= Grade <= 56

DATA FILE FOR STUDENT TABLE

It guarantees direct and efficient access to data

TUPLE T1 H(StudentId =50)=1

TUPLE T4 H(StudentId =75)=1

DATA FILE FOR STUDENT TABLE

It guarantees direct and efficient access to data

It guarantees direct and efficient access to data

The bit matrix has

Domain of Job attribute = {Engineer, Consultant, Manager, Programmer, Secretary, Accountant}

RID Eng. Cons. Man. Prog. Secr. Acc.

You might also like