0% found this document useful (0 votes)

9 views

Unit 4 Notes

The document discusses different types of indexing in databases. It defines indexing and describes how indexes are used to optimize database queries. The key types of indexing discussed are single-level, multi-level and static hashing indexes. Different attributes of indexing like ordering, data structure and partitioning are also explained.

Uploaded by

2902snehashinde

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Unit 4 Notes

Uploaded by

2902snehashinde

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

What is Indexing in DBMS?

Indexing is used to quickly retrieve particular data from the database. Formally we can define
Indexing as a technique that uses data structures to optimize the searching time of a database
query in DBMS. Indexing reduces the number of disks required to access a particular data by
internally creating an index table.

Indexing is achieved by creating Index-table or Index.

Index usually consists of two columns which are a key-value pair. The two columns of the index
table(i.e., the key-value pair) contain copies of selected columns of the tabular data of the
database.

Here, Search Key contains the copy of the Primary Key or the Candidate Key of the database
table. Generally, we store the selected Primary or Candidate keys in a sorted manner so that we
can reduce the overall query time or search time(from linear to binary).

Data Reference contains a set of pointers that holds the address of the disk block. The pointed
disk block contains the actual data referred to by the Search Key. Data Reference is also
called Block Pointer because it uses block-based addressing.

Indexing Attributes
Let's discuss the various indexing attributes:
Standard (B-tree) and Bitmap
B-tree-indexing is one of the most popular and commonly used indexing techniques. B-tree in
DBMS is a type of tree data structure that contains 2 things namely: Index Key and its
corresponding disk address. Index Key refers to a certain disk address and that disk further
contains rows or tuples of data.
On the other hand, Bitmap indexing uses strings to store the address of the tuples or rows. A
bitmap is a mapping from one system to the other such as integers to bits.

Bitmap has an advantage over B-tress as bitmap performs faster retrieval of certain data
(Bitmap is made according to a certain data, hence retrieves faster). Bitmaps are also more
compact than B-trees.

There is a drawback with bit mapping, bit mapping requires more overhead during tuple
operations on the table. Hence, bit maps are mainly used in data warehouse environments.

Example - We want to store this three-column table in the database.

Note: Oracle Database uses Bitmap and B-trees.
Ascending and Descending
As we have discussed above, columns of the index are stored in some sorted manner. Generally,
we store these Search Keys in ascending order. These sorted keys allow us to search data the
data fastly. We can change the sort order from ascending to descending or something different
according to the most frequent queries on the database.

Syntax

Lets see the syntax to store indexing in descending order-

CREATE INDEX index_name ON table-name (column-name_1, column-name_2 DESC);
By default Sorting Order:

● Character Data: Sorted by ASCII values of the characters.

● Numeric Data: Smallest to largest numbers.
● Date: Earliest date to the latest date.

Column and Functional:

Generally, we prepare the index table with certain column values of the actual database but
sometimes we can also use predefined SQL functions like UPPER() or LOWER() or MAX(), etc. to
prepare the Search Keys.

Example - We can convert all values in a column to uppercase and stored these results in the
index.

Syntax:
CREATE INDEX index-name
ON members(UPPER(target-column));
Note: The index table formed used columns values are also termed as Column Index or Column
Index-table.
Single-Column and Concatenated
We can create a single-column index table or multi-column index table. Concatenated indexes
are made according to certain WHERE clauses(WHERE clause related to the most frequent SQL
Queries), hence making the searching or data retrieval faster.

Example - Let us take an example of a multi-column index table:

We can use the primary key to create multiple index tables such as indexing based on year
(grouping years) or indexing based on model-name etc. This multi-table indexing will help in
getting specific query results faster.

Note: The multi-column is also termed a Concatenated Index.

Non-Partitioned and Partitioned
As we know index points to a certain table or block of data but sometimes the data itself is
partitioned in a certain manner, so we need to partition the index table as well. Generally, we
use the same table partition schema for the partition of the index table which is known as
the Local Partition Index. We use the same schema so that the data retrieval speed is
maintained. However, we can also create our non-partitioned index. This is known as Global
Index of the partitioned table.

Example - Suppose we have a table namely a student table. If the student table is partitioned
according to the roll number(primary key) then the index table of the student table should be
partitioned according to roll number as well. This type of partition will help in the grouping of
similar data and faster query results.

Types of Indexes
According to the attributes defined above, we divide indexing into three types:

Single Level Indexing

It is somewhat like the index (or the table of contents) found in a book. Index of a book contains
topic names along with the page number similarly the index table of the database contains keys
and their corresponding block address.

Single Level Indexing is further divided into three categories:

1. Primary Indexing: The indexing or the index table created using Primary keys is known as
Primary Indexing. It is defined on ordered data. As the index is comprised of primary keys, they
are unique, not null, and possess one to one relationship with the data blocks.

Example:
Characteristics of Primary Indexing:

● Search Keys are unique.

● Search Keys are in sorted order.
● Search Keys cannot be null as it points to a block of data.
● Fast and Efficient Searching.

2. Secondary Indexing: It is a two-level indexing technique used to reduce the mapping

size of the primary index. The secondary index points to a certain location where the
data is to be found but the actual data is not sorted like in the primary indexing.
Secondary Indexing is also known as non-clustered Indexing.

Example:
Characteristics of Secondary Indexing:

● Search Keys are Candidate Keys.

● Search Keys are sorted but actual data may or may not be sorted.
● Requires more time than primary indexing.
● Search Keys cannot be null.
● Faster than clustered indexing but slower than primary indexing.

3. Cluster Indexing: Clustered Indexing is used when there are multiple related records
found at one place. It is defined on ordered data. The important thing to note here is
that the index table of clustered indexing is created using non-key values which may or
may not be unique. To achieve faster retrieval, we group columns having similar
characteristics. The indexes are created using these groups and this process is known
as Clustering Index.

Example:
Characteristics of Clustered Indexing:

● Search Keys are non-key values.

● Search Keys are sorted.
● Search Keys cannot be null.
● Search Keys may or may not be unique.
● Requires extra work to create indexing.

Ordered Indexing:
Ordered indexing is the traditional way of storing that gives fast retrieval. The indices are stored
in a sorted manner hence it is also known as ordered indices.

Ordered Indexing is further divided into two categories:

1. Dense Indexing: In dense indexing, the index table contains records for every search key value of
the database. This makes searching faster but requires a lot more space. It is like primary
indexing but contains a record for every search key.
Example:

2. Sparse Indexing: Sparse indexing consumes lesser space than dense indexing, but it is a
bit slower as well. We do not include a search key for every record despite that we store
a Search key that points to a block. The pointed block further contains a group of data.
Sometimes we have to perform double searching this makes sparse indexing a bit
slower.

Example:
Multi-Level Indexing
Since the index table is stored in the main memory, single-level indexing for a huge amount of
data requires a lot of memory space. Hence, multilevel indexing was introduced in which we
divide the main data block into smaller blocks. This makes the outer block of the index table
small enough to be stored in the main memory.

Example:
We use the B+ Tree data structure for multilevel indexing. The leaf nodes of the B+ tree contain
the actual data pointers. The leaf nodes are themselves in the form of a linked list. This linked
list representation helps in both sequential and random access.

B+ Tree Index Files

A B+ Tree Index is a multilevel index.

A B+ Tree is a rooted tree satisfying the following properties :

1. All paths from the root to leaf are equally long.

2. If a node isn’t a root or a leaf, it has between [n / 2] and ‘n’ children.
3. A leaf node has between [(n-1) / 2] and ‘n-1’ values.
Example- 1: Construct a B+ Tree for the following search key values,
{10, 20, 30, 40 }
where n = 3 ( n is number of pointers)

Example- 2: Construct a B+ Tree for the following search key values, Where n = 4.
{10, 30, 40, 50, 60, 70, 90 }

Now, Let’s Insert and Delete some elements into this tree.
Insert 25,75
When we insert an element, we add it on the next right node of the value lower than the
inserting element.

Delete 70

Here, when you delete any element. The element that has been deleted will be replaced with
the element on the right.

Static Hashing

Let K denote all the search-key values.

Let B represent the set of all bucket addresses.
A bucket is a unit of storage that contains some records.

Here, h is a ‘hash function’ from K to B.

‘Hash function’ is used to avoid ‘index structure’.

Bucket Overflow :
This will occur only in two ways.
1. Insufficient buckets.
2. Skew in distribution of records. Some buckets are given more records than others, so a bucket
can overflow even though the other buckets still have space. This situation is called ‘bucket
skew’.
Overflow Chaining :
The overflows of a given bucket are chained together in a linked list. This is called ‘Closed
Hashing’.
In ‘Open Hashing’, the set of buckets are fixed, and there are no overflow chains. Here, if a
bucket is full, the system inserts records in some other bucket in the initial set of buckets.

A hash index arranges the search keys, with their associated pointers, into a hash file
structure. In this, one applies a hash function on a search key to helping identify a bucket, and
store the key and its associated pointers in the bucket.

Example-10: Hash file organization of DEPT file using DName as key, where there are eight
departments.
Note: In case of hash functions, the hash function is of two types :
1. The distribution is uniform: The hash function assigns each bucket the same number of
search-key values from the set of all possible search-key values.

1. The distribution is random : In the average case, each bucket will have nearly the same number
of values assigned to it, regardless of the actual distribution of search-key values.

IBM Data Analyst Capstone Project
No ratings yet
IBM Data Analyst Capstone Project
25 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
Indexing
No ratings yet
Indexing
6 pages
Indexes
No ratings yet
Indexes
4 pages
Dbms Mod3
No ratings yet
Dbms Mod3
54 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Unit 6 notes DBMS final
No ratings yet
Unit 6 notes DBMS final
14 pages
Lesson 4 - Indexing
No ratings yet
Lesson 4 - Indexing
6 pages
Module 12 - Managing Indexes
No ratings yet
Module 12 - Managing Indexes
19 pages
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
100% (1)
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
7 pages
M12 Indexing in DBMS
No ratings yet
M12 Indexing in DBMS
18 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
CMP 312
No ratings yet
CMP 312
2 pages
Indexing
No ratings yet
Indexing
6 pages
sqlIndexes2
No ratings yet
sqlIndexes2
10 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
5 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
Indexing - DBMS
No ratings yet
Indexing - DBMS
20 pages
Indexing_Hashing_Files
No ratings yet
Indexing_Hashing_Files
68 pages
Introduction to Indexing in Database Management Systems Print
No ratings yet
Introduction to Indexing in Database Management Systems Print
12 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
Index Architecture: Febriliyan Samopa
No ratings yet
Index Architecture: Febriliyan Samopa
110 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Indexes
No ratings yet
Indexes
70 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
4 pages
S - UNIT VII Indexing in Database
No ratings yet
S - UNIT VII Indexing in Database
9 pages
Unit -5 - part 2
No ratings yet
Unit -5 - part 2
33 pages
PPT-203105251-3
No ratings yet
PPT-203105251-3
35 pages
Indexing in Relational Databases
No ratings yet
Indexing in Relational Databases
2 pages
Hashing & Indexing Structures_ Single Level & Multi Level Indices
No ratings yet
Hashing & Indexing Structures_ Single Level & Multi Level Indices
1 page
File Organization and Indexing
No ratings yet
File Organization and Indexing
13 pages
Indexing
No ratings yet
Indexing
10 pages
11.2 Indexing
No ratings yet
11.2 Indexing
26 pages
Indexing
No ratings yet
Indexing
6 pages
Indexes in Database
100% (1)
Indexes in Database
38 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
6 pages
Database Indexing
No ratings yet
Database Indexing
4 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
33 pages
Indexing
No ratings yet
Indexing
11 pages
Index: Presented By-VISHAKHA CHANDRA (10030141082)
No ratings yet
Index: Presented By-VISHAKHA CHANDRA (10030141082)
29 pages
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
No ratings yet
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
16 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
Unit-6 Storage Strategies
No ratings yet
Unit-6 Storage Strategies
43 pages
CO3 Notes Indexing
No ratings yet
CO3 Notes Indexing
11 pages
Indexing
No ratings yet
Indexing
6 pages
INDEXING
No ratings yet
INDEXING
10 pages
Dmbs New Slides Unit 2
No ratings yet
Dmbs New Slides Unit 2
28 pages
How Does Database Indexing Work
No ratings yet
How Does Database Indexing Work
4 pages
Taking Advantage of Indexes: How It Works
No ratings yet
Taking Advantage of Indexes: How It Works
7 pages
Chapter_3_File_Organization_Indexed_methods
No ratings yet
Chapter_3_File_Organization_Indexed_methods
31 pages
Unit5 File Organization
No ratings yet
Unit5 File Organization
112 pages
1 Indexing Techniques
No ratings yet
1 Indexing Techniques
30 pages
Indexing
No ratings yet
Indexing
8 pages
Dbms r18 Unit 5 Notes
No ratings yet
Dbms r18 Unit 5 Notes
24 pages
Chapter 11: Indexing and Hashing
No ratings yet
Chapter 11: Indexing and Hashing
47 pages
Data Structures and Algorithm
From Everand
Data Structures and Algorithm
Knowledge Flow
No ratings yet
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
MCSL-223 Section 2 Data Mining Lab
No ratings yet
MCSL-223 Section 2 Data Mining Lab
55 pages
Data Warehouse and Mining Techmax - Compressed
No ratings yet
Data Warehouse and Mining Techmax - Compressed
429 pages
Final Yr Project PhishingAttack Ppt
No ratings yet
Final Yr Project PhishingAttack Ppt
12 pages
User Interface - The Features of A Computer System Which Allows The User To Interact With It
0% (1)
User Interface - The Features of A Computer System Which Allows The User To Interact With It
1 page
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
36 pages
5 Algoritma Klastering
No ratings yet
5 Algoritma Klastering
85 pages
(Ebooks PDF) Download Information Assurance For The Enterprise A Roadmap To Information Security 1st Edition Corey Schou Full Chapters
100% (12)
(Ebooks PDF) Download Information Assurance For The Enterprise A Roadmap To Information Security 1st Edition Corey Schou Full Chapters
84 pages
Printer Server X Med Print
No ratings yet
Printer Server X Med Print
16 pages
Introduction To GTP
No ratings yet
Introduction To GTP
2 pages
Dbms Lab Controlled
No ratings yet
Dbms Lab Controlled
100 pages
Althusser The Detour of Theory PDF
No ratings yet
Althusser The Detour of Theory PDF
112 pages
Business Objects Step by Step Tutorial
No ratings yet
Business Objects Step by Step Tutorial
27 pages
Student Clustering Based On Academic Using K-Means Algorithma
No ratings yet
Student Clustering Based On Academic Using K-Means Algorithma
34 pages
Document Managment System
No ratings yet
Document Managment System
12 pages
Multi Version Two-Phase Locking Protocol
100% (9)
Multi Version Two-Phase Locking Protocol
2 pages
Zen Munawar
No ratings yet
Zen Munawar
9 pages
Big Data Analytics Important Questions
No ratings yet
Big Data Analytics Important Questions
1 page
Unit 8 Lesson 8B: Name: - Class
No ratings yet
Unit 8 Lesson 8B: Name: - Class
1 page
Smart City Project Modules
0% (1)
Smart City Project Modules
3 pages
Best Practices For Deploying Full-Text Indexing
No ratings yet
Best Practices For Deploying Full-Text Indexing
30 pages
Unit 1 Mis Bba Notes
No ratings yet
Unit 1 Mis Bba Notes
18 pages
Tutorial análise Qiime2[3115]
No ratings yet
Tutorial análise Qiime2[3115]
3 pages
Ais CH4
No ratings yet
Ais CH4
3 pages
SQL Homework Sample
100% (1)
SQL Homework Sample
8 pages
Synopsis
No ratings yet
Synopsis
3 pages
OpenText Enterprise Library Services 10.1.0 Configuration and Scenario Guide
100% (1)
OpenText Enterprise Library Services 10.1.0 Configuration and Scenario Guide
127 pages
For Checking Secondary School Level - Form 1.1
No ratings yet
For Checking Secondary School Level - Form 1.1
164 pages
U1-notes
No ratings yet
U1-notes
9 pages
Best Practices For Machine Learning Operations (MLOps)
No ratings yet
Best Practices For Machine Learning Operations (MLOps)
1 page