0% found this document useful (0 votes)

71 views18 pages

Volcano Query Processing Model

The document discusses three main query evaluation processing models used in database management systems: the Iterator model, Vectorised model, and Materialisation model. The Iterator model processes queries in a top-down manner, emitting single tuples and allowing for composability of operators, while the Vectorised model processes batches of tuples, making it ideal for OLAP queries. The Materialisation model processes all input at once, emitting entire result sets and is better suited for OLTP workloads.

Uploaded by

p20232002567

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views18 pages

Volcano Query Processing Model

Uploaded by

p20232002567

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Advanced Database Systems

Spring 2025

Lecture #12:
Query Evaluation: Processing Models

R&G: Chapter 14
2

P ROCESSING M ODEL
Processing model defines how the DBMS executes a query plan
Different trade-offs for different workloads

Three main approaches:

Iterator model
Vectorised (batch) model
Materialisation model
3

I TERATOR M ODEL
Each query plan operator implements three functions:
open() – initialise the operator’s internal state
next() – return either the next result tuple or a null marker if there are no more tuples
close() – clean up all allocated resources
Each operator instance maintains an internal state

Any operator can be input to any other (composability)

Since they all implement the same interface

Also called Volcano or Pipeline Model

Goetz Graefe. Volcano – An Extensible and Parallel Query Evaluation System. IEEE TKDE 1994
4

I TERATOR M ODEL
Top-down plan processing
The whole plan is initially reset by calling open() on the root operator

The open() call is forwarded through the plan by the operators themselves

Control returns to the query processor

The root is requested to produce its next() result record

Operators forward the next() request as needed. As soon as the next result record is
produced, control returns to the query processor again

Used in almost every DBMS

I TERATOR M ODEL
Query processor uses the following routine to evaluate a query plan

Function eval(q )
[Link]()
r = [Link]()
while r != EOF do
/* deliver record r (print, ship to DB client) */
emit(r )
r = [Link]()
/* resource deallocation now */
[Link]()

Output control (e.g., LIMIT) works easily with this model

E XAMPLE : S ELECTION σ p ( ON -THE - FLY )

A streaming operator: small amount of work per tuple

Predicate p stored in internal state

open() close()
[Link]() [Link]()

next()
while (r = [Link]()) != EOF do
if p(r) return r
return EOF
8

E XAMPLE : H EAP S CAN

Leaf of the query plan, often includes a selection predicate

open( )
heap = open heap file for this relation // file handle
cur_page = heap.first_page() // first page
cur_slot = cur_page.first_slot() // first slot on that page

next( )
if cur_page == NULL return EOF
current = tuple at (cur_page, cur_slot) // tuple to be returned
cur_slot = cur_slot.advance() // advance slot for subseq. calls
if cur_slot == NULL // advance to next page, first slot
cur_page = cur_page.advance()
if cur_page != NULL
cur_slot = cur_page.first_slot() close( )
return current [Link]()
9

E XAMPLE : N ESTED L OOPS J OIN

Volcano-style implementation of nested loops join R ⋈p S

open( ) next( )
left_child.open() while r != EOF do
right_child.open() while (s = right_child.next()) != EOF do
r = left_child.next() if p(r,s) return <r,s>
/* reset inner join input */
right_child.close()
close( ) right_child.open()
left_child.close() r = left_child.next()
right_child.close() return EOF
10

E XAMPLE : S ORT (2- PASS )

open( )
// first, all of pass 0, a blocking call
[Link]()
repeatedly call [Link]() and generate the sorted runs on disk, until child gives EOF
// second, set up for pass 1, assumes enough buffers to merge
open each sorted run file and load one page per run into input buffer for pass 1

next( ) // pass 1 merge (assumes enough buffers to merge)

output = min tuple across all buffers
if min tuple was last one in its buffer
fetch next page from that run into buffer
return output // (or EOF if no tuples remain)

close( )
deallocate the runs files
[Link]()
11

I TERATOR M ODEL
for t in [Link](): emit returns SELECT [Link], [Link]
emit(projection(t)) control to caller FROM R, S
WHERE [Link] = [Link]
AND [Link] > 100
for t1 in [Link]():
buildHashTable(t1)
for t2 in [Link]():
if probe(t2): emit(t1⋈ t2)
π [Link], [Link]

for t in [Link](): ⋈ [Link] = [Link]

if evalPred(t): emit(t)
σ value > 100
for t in R: for t in S:
emit(t) emit(t) R S
12

I TERATOR M ODEL
for t in [Link](): SELECT [Link], [Link]
1 emit(projection(t)) FROM R, S
WHERE [Link] = [Link]
AND [Link] > 100
for t1 in [Link]():
2 buildHashTable(t1)
for t2 in [Link]():
if probe(t2): emit(t1⋈ t2)
π [Link], [Link]

for t in [Link](): 4 ⋈ [Link] = [Link]

if evalPred(t): emit(t)
σ value > 100
3 for t in R: for t in S: 5
emit(t) emit(t) R S
13

I TERATOR M ODEL
Allows for tuple pipelining
The DBMS process a tuple through as many operators as possible
before having to retrieve the next tuple
Reduces memory requirements and response time since each chunk
of input is propagated to the output immediately

Some operators will block until children emit all of their tuples
E.g., sorting, hash join, grouping and duplicate elimination over
unsorted input, subqueries
The data is typically buffered (“materialised”) on disk
14

I TERATOR M ODEL
+ Nice & simple interface

+ Allows for easy combination of operators

– Next called for every single tuple & operator

– Virtual call via function pointer

Degrades branch prediction of modern CPUs

– Poor code locality and complex bookkeeping

Each operator keeps state to know where to resume
15

V ECTORISATION M ODEL
Like Iterator Model, each operator implements a next() function

Each operator emits a batch of tuples instead of a single tuple

The operator’s internal loop processes multiple tuples at a time
The size of the batch can vary based on hardware and query properties

Ideal for OLAP queries

Greatly reduces the number of invocations per operator
Operators can use vectorised (SIMD) instructions to process batches of tuples
16

V ECTORISATION M ODEL
out = { }
1 for t in [Link](): SELECT [Link], [Link]
[Link](projection(t))
if |out| > n: emit(out) FROM R, S
WHERE [Link] = [Link]
AND [Link] > 100
2 out = { }
for t1 in [Link]():
buildHashTable(t1)
for t2 in [Link]():
if probe(t2): [Link](t1⋈ t2)
π [Link], [Link]
if |out| > n: emit(out)

out = { }
for t in [Link](): 4
⋈ [Link] = [Link]

if evalPred(t): [Link](t)
if |out| > n: emit(out) σ value > 100
out = { } out = { }
3 5
for t in R:
[Link](t)
for t in S:
[Link](t) R S
if |out| > n: emit(out) if |out| > n: emit(out)
17

M ATERIALISATION M ODEL
Each operator processes its input all at once and then emits its output
The operator “materialises” its output as a single result

Bottom-up plan processing

Data not pulled by operators but pushed towards them
Leads to better code and data locality

Better for OLTP workloads

OLTP queries typically only access a small number of tuples at a time
Not good for OLAP queries with large intermediate results
18

M ATERIALISATION M ODEL
5 out = { } SELECT [Link], [Link]
for t in [Link]():
[Link](projection(t)) FROM R, S
WHERE [Link] = [Link]
AND [Link] > 100
4 out = { }
for t1 in [Link]():
buildHashTable(t1)
for t2 in [Link]():
if probe(t2): [Link](t1⋈ t2)
π [Link], [Link]

out = { }
for t in [Link](): 3
⋈ [Link] = [Link]

if evalPred(t): [Link](t)
σ value > 100
1 out = { } out = { }
2
for t in R:
[Link](t)
for t in S:
[Link](t)
R S
19

P ROCESSING M ODELS : S UMMARY

Iterator / Volcano
Direction: Top-Down
Emits: Single Tuple
Target: General Purpose

Vectorised Materialisation
Direction: Top-Down Direction: Bottom-Up
Emits: Tuple Batch Emits: Entire Tuple Set
Target: OLAP Target: OLTP

Query Execution Models in Databases
No ratings yet
Query Execution Models in Databases
56 pages
Efficiently Compiling Efficient Query Plans For Modern Hardware
No ratings yet
Efficiently Compiling Efficient Query Plans For Modern Hardware
12 pages
Pipelining vs Materialization in DBMS
No ratings yet
Pipelining vs Materialization in DBMS
27 pages
NEXMark: Relational Stream Processing
No ratings yet
NEXMark: Relational Stream Processing
40 pages
Riddhi Pandya's Engineering and Data Skills
No ratings yet
Riddhi Pandya's Engineering and Data Skills
11 pages
Spark SQL Execution Deep Dive
100% (2)
Spark SQL Execution Deep Dive
88 pages
Vectorization vs. Compilation in Queries
No ratings yet
Vectorization vs. Compilation in Queries
8 pages
Python and SQL Practical File Guide
No ratings yet
Python and SQL Practical File Guide
38 pages
Other Operations: A Query Is Essentially Treated As A Algebra C R Prc88'lon
No ratings yet
Other Operations: A Query Is Essentially Treated As A Algebra C R Prc88'lon
5 pages
Spark SQL: Relational Data Processing
No ratings yet
Spark SQL: Relational Data Processing
58 pages
Spark Structured Streaming Overview
No ratings yet
Spark Structured Streaming Overview
99 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
24 pages
Big Data Management and Architecture Guide
No ratings yet
Big Data Management and Architecture Guide
67 pages
ETL and Python Basics for Data Engineers
No ratings yet
ETL and Python Basics for Data Engineers
20 pages
Computer Science Concepts and Definitions
No ratings yet
Computer Science Concepts and Definitions
12 pages
Predicate Pushdown in Spark & Parquet
No ratings yet
Predicate Pushdown in Spark & Parquet
94 pages
Computer Science Project Overview 2024
No ratings yet
Computer Science Project Overview 2024
32 pages
CSSC Exam Practice Paper 1 Solutions
No ratings yet
CSSC Exam Practice Paper 1 Solutions
11 pages
Python Fundamentals Review Guide
No ratings yet
Python Fundamentals Review Guide
7 pages
Practical Record: Python & MySQL AISSCE
No ratings yet
Practical Record: Python & MySQL AISSCE
46 pages
Relational Operations with MapReduce
No ratings yet
Relational Operations with MapReduce
34 pages
Delta Lake: ACID Storage for Cloud Data
No ratings yet
Delta Lake: ACID Storage for Cloud Data
28 pages
Python Practical Record for Class XII
No ratings yet
Python Practical Record for Class XII
12 pages
Big Data Modeling in NoSQL Systems
No ratings yet
Big Data Modeling in NoSQL Systems
12 pages
Candidate Management System Project
No ratings yet
Candidate Management System Project
22 pages
Python Database Programming Guide
100% (1)
Python Database Programming Guide
17 pages
Python Database Programming Guide
100% (1)
Python Database Programming Guide
17 pages
Class XII Computer Science (083) - Comprehensive Exam Notes
No ratings yet
Class XII Computer Science (083) - Comprehensive Exam Notes
14 pages
Query Processing in Database Technologies
No ratings yet
Query Processing in Database Technologies
154 pages
Voter Management System Project Report
No ratings yet
Voter Management System Project Report
39 pages
Python Data Science Complete Reference
No ratings yet
Python Data Science Complete Reference
19 pages
Computer Science XII Answer Key 2025-2026
No ratings yet
Computer Science XII Answer Key 2025-2026
6 pages
Data Science Lab Record for B.Tech Students
No ratings yet
Data Science Lab Record for B.Tech Students
49 pages
DS Elaborative Question Bank Infosys
No ratings yet
DS Elaborative Question Bank Infosys
36 pages
Kolomvatsos K., Anagnostopoulos C., Hadjiefthymiades S., An Efficient Time Optimized Scheme For Progressive Analytics in Big Data", Big Data Research, Vol. 2, 2015, S. 155-165
No ratings yet
Kolomvatsos K., Anagnostopoulos C., Hadjiefthymiades S., An Efficient Time Optimized Scheme For Progressive Analytics in Big Data", Big Data Research, Vol. 2, 2015, S. 155-165
11 pages
Query Processing and Optimization Steps
No ratings yet
Query Processing and Optimization Steps
17 pages
Computer Science Revision For Boards
No ratings yet
Computer Science Revision For Boards
10 pages
Optimizing Continuous Traffic Queries
No ratings yet
Optimizing Continuous Traffic Queries
27 pages
Class 12 Aws Cs Pracfile 2526
No ratings yet
Class 12 Aws Cs Pracfile 2526
53 pages
Query Processing and Optimization Steps
No ratings yet
Query Processing and Optimization Steps
40 pages
In-Memory Computing with Spark Framework
No ratings yet
In-Memory Computing with Spark Framework
36 pages
DryadLINQ: High-Level Distributed Computing
No ratings yet
DryadLINQ: High-Level Distributed Computing
35 pages
DSL
No ratings yet
DSL
5 pages
GATE DA: Python DSA Revision Guide
No ratings yet
GATE DA: Python DSA Revision Guide
37 pages
Advanced Python Data Structures Guide
No ratings yet
Advanced Python Data Structures Guide
90 pages
Hive Vectorized Query Execution Design
No ratings yet
Hive Vectorized Query Execution Design
7 pages
Candidate Record Management Project
No ratings yet
Candidate Record Management Project
29 pages
Python Generators and Decorators Explained
No ratings yet
Python Generators and Decorators Explained
74 pages
Class 12 Computer Science Algorithms
No ratings yet
Class 12 Computer Science Algorithms
10 pages
Class 12 Python Project Report
No ratings yet
Class 12 Python Project Report
24 pages
Class 12 Python & SQL Revision Guide
No ratings yet
Class 12 Python & SQL Revision Guide
9 pages
Numpy Notes
No ratings yet
Numpy Notes
38 pages
Computer Science Marking Scheme 2020
No ratings yet
Computer Science Marking Scheme 2020
7 pages
Python Exception Handling and ADTs Guide
No ratings yet
Python Exception Handling and ADTs Guide
12 pages
Data Engineering Fundamentals and Lifecycle
No ratings yet
Data Engineering Fundamentals and Lifecycle
8 pages
Dbms
No ratings yet
Dbms
29 pages
Python Data Structures and MySQL Operations
No ratings yet
Python Data Structures and MySQL Operations
74 pages
User-Friendly Query Plan Explanations
No ratings yet
User-Friendly Query Plan Explanations
9 pages
Bit Manipulation Techniques in C++
No ratings yet
Bit Manipulation Techniques in C++
6 pages
Lecture 6 - Pointers and Strings 1
No ratings yet
Lecture 6 - Pointers and Strings 1
18 pages
Preprocessing in C++ Programming
No ratings yet
Preprocessing in C++ Programming
13 pages
Understanding Structures in C++ Programming
No ratings yet
Understanding Structures in C++ Programming
40 pages
Unions and Enumerations in C++
No ratings yet
Unions and Enumerations in C++
21 pages
ICT Course Overview and Evaluation
No ratings yet
ICT Course Overview and Evaluation
67 pages
Storage Models & Compression in Databases
No ratings yet
Storage Models & Compression in Databases
28 pages
Understanding SQL Joins and Algorithms
No ratings yet
Understanding SQL Joins and Algorithms
43 pages
Parallel Query Processing in DBMS
No ratings yet
Parallel Query Processing in DBMS
45 pages
Database Locking and Scheduling Techniques
No ratings yet
Database Locking and Scheduling Techniques
28 pages
Query Optimization in Database Systems
No ratings yet
Query Optimization in Database Systems
24 pages
Query Optimization in Databases
No ratings yet
Query Optimization in Databases
43 pages
Tree-Structured Indexing Techniques
No ratings yet
Tree-Structured Indexing Techniques
30 pages
Disk Space Management in DBMS
No ratings yet
Disk Space Management in DBMS
29 pages
18 M AI + Cloud Engineering Roadmap (Adjusted)
No ratings yet
18 M AI + Cloud Engineering Roadmap (Adjusted)
32 pages
SQL Queries for Employee and Movie Management
No ratings yet
SQL Queries for Employee and Movie Management
11 pages
SQL Schema and Procedures Overview
No ratings yet
SQL Schema and Procedures Overview
15 pages
Anomaly Detection in Online Banking
No ratings yet
Anomaly Detection in Online Banking
54 pages
Importance of Archival Materials in Libraries
No ratings yet
Importance of Archival Materials in Libraries
23 pages
Database Transaction Management Overview
No ratings yet
Database Transaction Management Overview
84 pages
Overview of IEC 82045 Standards
No ratings yet
Overview of IEC 82045 Standards
10 pages
DBAI Redo Files
No ratings yet
DBAI Redo Files
30 pages
Introduction to Geographic Information Systems
No ratings yet
Introduction to Geographic Information Systems
73 pages
A Neural Data Structure For Novelty Detection 10.1073@pnas.1814448115
No ratings yet
A Neural Data Structure For Novelty Detection 10.1073@pnas.1814448115
6 pages
Fixing Image Display in Protégé Ontology
No ratings yet
Fixing Image Display in Protégé Ontology
4 pages
AVEVA Engineering Documentation 15.7.3
No ratings yet
AVEVA Engineering Documentation 15.7.3
99 pages
Cloud-Based Blood Bank Management System
No ratings yet
Cloud-Based Blood Bank Management System
13 pages
Types and Processes of Decision Making
No ratings yet
Types and Processes of Decision Making
21 pages
XOR Cache: Enhancing Compression Efficiency
No ratings yet
XOR Cache: Enhancing Compression Efficiency
14 pages
Database System Architecture Overview
No ratings yet
Database System Architecture Overview
28 pages
End-to-End Data Engineering Projects
No ratings yet
End-to-End Data Engineering Projects
3 pages
Surface Utility Mapping in Barnawa, Kaduna
No ratings yet
Surface Utility Mapping in Barnawa, Kaduna
88 pages
MS Access Data Management Exercises
No ratings yet
MS Access Data Management Exercises
3 pages
GTN Database Update Instructions
No ratings yet
GTN Database Update Instructions
13 pages
2021 IT Diploma Assignment Guidelines
No ratings yet
2021 IT Diploma Assignment Guidelines
24 pages
Neo4j-GraphRAG + LLM + AI Cheatsheet
No ratings yet
Neo4j-GraphRAG + LLM + AI Cheatsheet
7 pages
React Admin Panel with ASP.NET Core
No ratings yet
React Admin Panel with ASP.NET Core
8 pages
Database Report Preparation Guide
No ratings yet
Database Report Preparation Guide
4 pages
Xii KV Mumbai Pre-Board Cs Qpms
No ratings yet
Xii KV Mumbai Pre-Board Cs Qpms
24 pages
B.Sc. Statistics Exam Paper 2018
No ratings yet
B.Sc. Statistics Exam Paper 2018
2 pages
Chapter 4 Part 1 (A) : Logical Database Design and The Relational Model
No ratings yet
Chapter 4 Part 1 (A) : Logical Database Design and The Relational Model
48 pages
Test Bank for Using MIS 9th Edition
No ratings yet
Test Bank for Using MIS 9th Edition
13 pages
Essential SQL Server Queries for Developers
No ratings yet
Essential SQL Server Queries for Developers
21 pages
Contoso Ltd. Quarterly Reporting Case Study
No ratings yet
Contoso Ltd. Quarterly Reporting Case Study
3 pages

Volcano Query Processing Model

Uploaded by

Volcano Query Processing Model

Uploaded by

Advanced Database Systems

Three main approaches:

Any operator can be input to any other (composability)

Also called Volcano or Pipeline Model

Control returns to the query processor

The root is requested to produce its next() result record

Used in almost every DBMS

Output control (e.g., LIMIT) works easily with this model

E XAMPLE : S ELECTION σ p ( ON -THE - FLY )

Predicate p stored in internal state

E XAMPLE : H EAP S CAN

E XAMPLE : N ESTED L OOPS J OIN

E XAMPLE : S ORT (2- PASS )

next( ) // pass 1 merge (assumes enough buffers to merge)

for t in [Link](): ⋈ [Link] = [Link]

for t in [Link](): 4 ⋈ [Link] = [Link]

+ Allows for easy combination of operators

– Next called for every single tuple & operator

– Virtual call via function pointer

– Poor code locality and complex bookkeeping

Each operator emits a batch of tuples instead of a single tuple

Ideal for OLAP queries

Bottom-up plan processing

Better for OLTP workloads

P ROCESSING M ODELS : S UMMARY

You might also like