0% found this document useful (0 votes)

39 views

CO3 Notes Indexing

Indexing allows for quick retrieval of records from a database. There are different types of indexes like primary indexes, secondary indexes, clustered indexes, and multi-level indexes. Primary indexes contain a key field and pointer to each data block. Secondary indexes provide additional ways to access data through non-key fields. Clustered indexes group similar records together. Multi-level indexes use B-tree or B+-tree structures to allow efficient insertion and deletion while maintaining a balanced tree.

Uploaded by

Nani Yagneshwar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

CO3 Notes Indexing

Uploaded by

Nani Yagneshwar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

CO3

Indexing

An index for a file in a database system works in much the same way as the index in this
textbook. Database-system indices play the same role as book indices in libraries. For example,
to retrieve a student record given an ID, the database system would look up an index to find on
which disk block the corresponding record resides, and then fetch the disk block, to get the
appropriate student record.
Indexing is a data structure technique which allows you to quickly retrieve records from a
database file. An Index is a small table having only two columns. The first column comprises
a copy of the primary or candidate key of a table. Its second column contains a set
of pointers for holding the address of the disk block where that specific key value stored.

For a file with a given record structure consisting of several fields (or attributes), an index
access structure is usually defined on a single field of a file, called an indexing field (or
indexing attribute). The index typically stores each value of the index field along with a list
of pointers to all disk blocks that contain records with that field value. The values in the index
are ordered so that we can do a binary search on the index. If both the data file and the index
file are ordered, and since the index file is typically much smaller than the data file, searching
the index using a binary search is a better option.
PRIMARY INDEXES
A primary index is an ordered file whose records are of fixed length with two fields, and it
acts like an access structure to efficiently search for and access the data records in a data file.
The first field is of the same data type as the ordering key field—called the primary key—of
the data file, and the second field is a pointer to a disk block (a block address). There is one
index entry (or index record) in the index file for each block in the data file. Each index entry
has the value of the primary key field for the first record in a block and a pointer to that block
as its two field values.
The primary index can be classified into two types: Dense index and Sparse index.
Dense index:
The dense index contains an index record for every search key value in the data file. It makes
searching faster.
In this, the number of records in the index table is same as the number of records in the main
table.
It needs more space to store index record itself. The index records have the search key and a
pointer to the actual record on the disk.

No. of records in IT = No. of records in HD

Sparse Index: The total number of entries in the index is the same as the number of disk blocks
in the ordered data file. The first record in each block of the data file is called the anchor record
of the block, or simply the block anchor.
Each index entry has the value of the primary key field for the first record in a block and a
pointer to that block as its two field values. A binary search on the index file requires fewer
block accesses than a binary search on the data file.
It is an index record that appears for only some of the values in the file. Sparse Index helps you
to resolve the issues of dense Indexing in DBMS. In this method of indexing technique, a range
of index columns stores the same data block address, and when data needs to be retrieved, the
block address will be fetched. However, sparse Index stores index records for only some
search-key values. It needs less space, less maintenance overhead for insertion, and deletions.

No. of records in IT = No. of blocks in HD

CLUSTERED INDEXES
Clustering index is defined on an ordered data file. The data file is ordered on a non-key field.
The index is created on non-primary key columns which may not be unique for each record. In
such cases, in order to identify the records faster, we will group two or more columns together
to get the unique values and create index out of them. This method is known as the clustering
index.
Basically, records with similar characteristics are grouped together and indexes are created for
these groups. By using cluster indexing we can reduce the cost of searching reason being
multiple records related to the same thing are stored in one place and it also gives the frequent
joining of more than two tables (records).
Suppose a company contains several employees in each department. Suppose we use a
clustering index, where all employees which belong to the same Dept_ID are considered within
a single cluster, and index pointers point to the cluster as a whole. Here Dept_Id is a non-unique
key.
The previous schema is little confusing because one disk block is shared by records which
belong to the different cluster. If we use separate disk block for separate clusters, then it is
called better technique.

SECONDARY INDEXES:
A secondary index provides a secondary means of accessing a data file for which some primary
access already exists. The data file records could be ordered, unordered, or hashed. The
secondary index may be created on a field that is a candidate key and has a unique value in
every record, or on a non-key field with duplicate values.
The index is again an ordered file with two fields. The first field is of the same data type as
some non-ordering field of the data file that is an indexing field. The second field is either a
block pointer or a record pointer. Many secondary indexes (and hence, indexing fields) can be
created for the same file—each represents an additional means of accessing that file based on
some specific field.
Unordered file with Key:

• File is already primary indexed on Eid(Primary Key).

• Now suppose search to be done using Pno.
• Pno is unordered and we cannot make it ordered.
• So, Index table will maintain Pno as a key and always in ordered.
• We will store the Pno in the index as ordered so binary search can be applied for
faster searching.
• It will be a type of dense indexing.
Unordered file with Non-Key:

• Now, search to be done by Ename(Non-key)

• Index file contains Ename as key and is ordered.
• Maintains intermediate index layer which contains block of record pointers.
• Pointer in IT points to a particular block and the record pointers in that block will
point to the record in HD.
MULTI-LEVEL INDEXES

The idea behind a multilevel index is to reduce the part of the index that we continue to search.
Because a single-level index is an ordered file, we can create a primary index to the index itself;
In this case, the original index file is called the first-level index and the index to the index is
called the second-level index. We can repeat the process, creating a third, fourth, ..., top level
until all entries of the top-level fit in one disk block.

Dynamic Multilevel Indexes Using B-Trees and B+ Trees

As we have seen, a multilevel index reduces the number of blocks accessed when searching for
a record, given its indexing field value. We are still faced with the problems of dealing with
index insertions and deletions.

To retain the benefits of using multilevel indexing while reducing index insertion and deletion
problems, designers adopted a multilevel index called a dynamic multilevel index that leaves
some space in each of its blocks for inserting new entries and uses appropriate
insertion/deletion algorithms for creating and deleting new index blocks when the data file
grows and shrinks. It is implemented by using data structures called B-trees and B+-trees.

Trees: A tree is formed of nodes. Each node in the tree, except for a special node called the
root, has one parent node and zero or more child nodes. The root node has no parent. A node
that does not have any child nodes is called a leaf node; a nonleaf node is called an internal
node.
The level of a node is always one more than the level of its parent, with the level of the root
node being zero. Figure illustrates a tree data structure. In this figure the root node is A, and its
child nodes are B, C, and D. Nodes E, J, C, G, H, and K are leaf nodes. Since the leaf nodes
are at different levels of the tree, this tree is called unbalanced.

• Most multi-level indexes use B-tree or B+-tree data structures.

• These data structures are variations of search trees that allow efficient insertion and
deletion of new search values.
• Both B-Tree and B+-Tree are balanced.
• Elements are in sorted order.

B-TREE

The B-tree has additional constraints that ensure that the tree is always balanced and that the
space wasted by deletion, if any, never becomes excessive. The algorithms for insertion and
deletion, though, become more complex in order to maintain these constraints. Nonetheless,
most insertions and deletions are simple processes;
A B tree of order q contains the following properties:

• Order is the max no of children a node can have.

• Root node in a B-Tree can have max q children and min 2 children.
• Every node in a B-Tree except the root node can have max q nodes and min ⎡(q/2)⎤
children.
• Every node in a B-Tree contains at most q-1 keys.
• All leaf nodes must be at the same level.

A B-tree starts with a single root node (which is also a leaf node) at level 0 (zero). Once the
root node is full with p − 1 search key values and we attempt to insert another entry in the tree,
the root node splits into two nodes at level 1. Only the middle value is kept in the root node,
and the rest of the values are split evenly between the other two nodes. When a nonroot node
is full and a new entry is inserted into it, that node is split into two nodes at the same level, and
the middle entry is moved to the parent node along with two pointers to the new split nodes. If
the parent node is full, it is also split. Splitting can propagate all the way to the root node,
creating a new level if the root is split.

Example:

1) Insert the values in order 8, 5, 1, 7, 3, 12, 9, 6 in a B-tree of order p = 3

2) The elements to be inserted are 8, 9, 10, 11, 15, 20, 17 in a B-tree of order p = 3

B+ TREE

Most implementations of a dynamic multilevel index use a variation of the B-tree data structure
called a B+-tree. In a B+-tree, data pointers are stored only at the leaf nodes of the tree; hence,
the structure of leaf nodes differs from the structure of internal nodes. The leaf nodes have an
entry for every value of the search field, along with a data pointer to the record (or to the block
that contains this record) if the search field is a key field.

• B+ Tree is an extension of B Tree which allows efficient insertion, deletion and search
operations.
• In B Tree, records (data) can only be stored on the leaf nodes while internal nodes can
only store the key values.
• The leaf nodes of a B+ tree are linked together in the form of a singly linked lists to
make the search queries more efficient.
The pointers in internal nodes are tree pointers to blocks that are tree nodes, whereas the
pointers in leaf nodes are data pointers to the data file records or blocks—except for the Pnext
pointer, which is a tree pointer to the next leaf node. By starting at the leftmost leaf node, it is
possible to traverse leaf nodes as a linked list, using the Pnext pointers. This provides ordered
access to the data records on the indexing field. A Pprevious pointer can also be included.

Example:

1) Insert the following key values 6, 16, 26, 36, 46 on a B+ tree with order = 3
2) The elements to be inserted are 5,15, 25, 35, 45 on a B+ tree with order = 3

NZ b2b Guide Connectivity Guide v1 07
No ratings yet
NZ b2b Guide Connectivity Guide v1 07
17 pages
Bakery and Restaurant Management System
83% (12)
Bakery and Restaurant Management System
164 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
23 pages
Single Level Indexing
No ratings yet
Single Level Indexing
9 pages
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
No ratings yet
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
32 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
Indexing
No ratings yet
Indexing
6 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Chapter 3
No ratings yet
Chapter 3
50 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
Module-5 Dbms Cs208 Notes
No ratings yet
Module-5 Dbms Cs208 Notes
11 pages
Types of Indexes
No ratings yet
Types of Indexes
9 pages
Indexing
No ratings yet
Indexing
27 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
25 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
What Is An Index
No ratings yet
What Is An Index
4 pages
SingleLevelIndexing Examples
No ratings yet
SingleLevelIndexing Examples
24 pages
Indexing Dbms
No ratings yet
Indexing Dbms
22 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
Single-Level Ordered Indexes
No ratings yet
Single-Level Ordered Indexes
12 pages
Dbms Mod3
No ratings yet
Dbms Mod3
54 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
File Org
No ratings yet
File Org
10 pages
Chapter_3_File_Organization_Indexed_methods
No ratings yet
Chapter_3_File_Organization_Indexed_methods
31 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
5 pages
CNG351 Lecture 12 A
No ratings yet
CNG351 Lecture 12 A
21 pages
Ch17Notes Indexing Structures For Files
No ratings yet
Ch17Notes Indexing Structures For Files
39 pages
Module 4 Indexing
No ratings yet
Module 4 Indexing
20 pages
R22 Unit 5
No ratings yet
R22 Unit 5
23 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
13 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
CNG351-lecture-12-a
No ratings yet
CNG351-lecture-12-a
21 pages
Indexing_Hashing_Files
No ratings yet
Indexing_Hashing_Files
68 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
M12 Indexing in DBMS
No ratings yet
M12 Indexing in DBMS
18 pages
Unit 6 notes DBMS final
No ratings yet
Unit 6 notes DBMS final
14 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
33 pages
Data Indexing Presentation
No ratings yet
Data Indexing Presentation
38 pages
DBMS UNIT-5
No ratings yet
DBMS UNIT-5
23 pages
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
48 pages
Indexes
No ratings yet
Indexes
4 pages
Indexing PDF
100% (1)
Indexing PDF
6 pages
Index Structures
No ratings yet
Index Structures
34 pages
2. Elmasri_6e_Ch18 (1)
No ratings yet
2. Elmasri_6e_Ch18 (1)
53 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
Dbms Notes
No ratings yet
Dbms Notes
21 pages
Indexing
No ratings yet
Indexing
8 pages
Index Architecture: Febriliyan Samopa
No ratings yet
Index Architecture: Febriliyan Samopa
110 pages
sqlIndexes2
No ratings yet
sqlIndexes2
10 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
Screenshot 2025-03-12 at 9.41.04 AM
No ratings yet
Screenshot 2025-03-12 at 9.41.04 AM
41 pages
CMP 312
No ratings yet
CMP 312
2 pages
S - UNIT VII Indexing in Database
No ratings yet
S - UNIT VII Indexing in Database
9 pages
CS2202_IndexingHashing
No ratings yet
CS2202_IndexingHashing
83 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
Indexing
No ratings yet
Indexing
10 pages
DBMS - R2017 - Anna University
No ratings yet
DBMS - R2017 - Anna University
20 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
DB2
No ratings yet
DB2
5 pages
Ravindra Narayan
No ratings yet
Ravindra Narayan
5 pages
Santosh Panigrahi Updated - Resume 2022 PDF
No ratings yet
Santosh Panigrahi Updated - Resume 2022 PDF
4 pages
Cyber Security Business Plan
100% (1)
Cyber Security Business Plan
13 pages
Authorizations SAP BI
No ratings yet
Authorizations SAP BI
5 pages
SQL Query (5th Semester)
No ratings yet
SQL Query (5th Semester)
28 pages
FSG
No ratings yet
FSG
3 pages
1.software Testing Methodologies
0% (1)
1.software Testing Methodologies
2 pages
OCM - Oracle Database 10g Adminis.
No ratings yet
OCM - Oracle Database 10g Adminis.
7 pages
Lec01 Introduction To DS 23092020 0230 0430 12102020 040538pm 27092022 121533pm
No ratings yet
Lec01 Introduction To DS 23092020 0230 0430 12102020 040538pm 27092022 121533pm
70 pages
22620-Sample-Question-Paper (Msbte Study Resources)
50% (2)
22620-Sample-Question-Paper (Msbte Study Resources)
4 pages
Analyzing and Storing Logs: Log File Type of Messages Stored
No ratings yet
Analyzing and Storing Logs: Log File Type of Messages Stored
7 pages
Resume Anirudh Joshi
No ratings yet
Resume Anirudh Joshi
2 pages
ISTQB CTFL v4.0 Sample-Exam-C-Answers v1.1
No ratings yet
ISTQB CTFL v4.0 Sample-Exam-C-Answers v1.1
38 pages
Daily Tracker Pramod Saini LSPL0254
No ratings yet
Daily Tracker Pramod Saini LSPL0254
137 pages
Accounting Information Systems
No ratings yet
Accounting Information Systems
3 pages
PI Web API 2017 R2 Release Notes
No ratings yet
PI Web API 2017 R2 Release Notes
15 pages
ServiceNow Certified System Administrator Practice Exam 2019 Set 3
No ratings yet
ServiceNow Certified System Administrator Practice Exam 2019 Set 3
25 pages
HPE StoreEasy 1660 Expanded Storage With Microsoft Windows Server IoT 2019-PSN1013310645CZEN
No ratings yet
HPE StoreEasy 1660 Expanded Storage With Microsoft Windows Server IoT 2019-PSN1013310645CZEN
5 pages
ITC 244 Systems Analysis and Design: 1.the Systems Development Methodologies. 2. Project Team Skills
No ratings yet
ITC 244 Systems Analysis and Design: 1.the Systems Development Methodologies. 2. Project Team Skills
22 pages
2024-02-04
No ratings yet
2024-02-04
6 pages
Code 2
No ratings yet
Code 2
10 pages
TOGAF Lists
No ratings yet
TOGAF Lists
4 pages
Zepul - Project Name TOS E2E INDIA - JOB OUTSOURCING DOCUMENT
No ratings yet
Zepul - Project Name TOS E2E INDIA - JOB OUTSOURCING DOCUMENT
15 pages
Report PFT State Continuous Testing 24
No ratings yet
Report PFT State Continuous Testing 24
11 pages
RPS Crystal - Reports SP - Application - Note
No ratings yet
RPS Crystal - Reports SP - Application - Note
4 pages
Com 426 Practical (Ogidi)
No ratings yet
Com 426 Practical (Ogidi)
4 pages
User Guide HouzKEY Application v1.0
No ratings yet
User Guide HouzKEY Application v1.0
69 pages

CO3 Notes Indexing

Uploaded by

CO3 Notes Indexing

Uploaded by

CO3

No. of records in IT = No. of records in HD

No. of records in IT = No. of blocks in HD

• File is already primary indexed on Eid(Primary Key).

• Now, search to be done by Ename(Non-key)

Dynamic Multilevel Indexes Using B-Trees and B+ Trees

• Most multi-level indexes use B-tree or B+-tree data structures.

• Order is the max no of children a node can have.

1) Insert the values in order 8, 5, 1, 7, 3, 12, 9, 6 in a B-tree of order p = 3

You might also like