SlideShare a Scribd company logo
QuestDB
Community Meetup #7
1
2022-07-28
QuestDB Team
Today’s agenda
2
Update about replication
1
2 AMA
3 Other news
Roadmap updates
3
Javier Ramirez, Developer Relations at QuestDB
@supercoco9 - @QuestDB
QuestDB
4
Recent developments
● Replication (WIP) <- today’s
focus!
● Partitions detach/attach
workflow (soon)
● CSV imports for unordered
data without size limitation
(soon)
What do we mean by replication
● Higher availability for reads, and higher throughput
○ Write to one QuestDB instance, propagate changes, and read
from many
● Higher availability and higher throughput for both reads and writes
(available only on QuestDB Cloud)
○ Write to many QuestDB instances, propagate changes, and
read from many
5
Current model for writing table data
● Data is received and kept in memory
● Once CommitLag is reached (or maxuncommitted rows, or table idle…), the
TableWriter writes data into the table files
● If Out-of-order rows are present, the operation will be costly, affecting reads
● Depending on how costly the writing operation is, ingestion performance will
also be affected
● Prone to Table Busy errors
6
First Step: Detaching ingestion and table writes
Implementing WAL (Write Ahead Log) (Work In Progress) 🚧
● Data (and any updates) is ingested to an append-only file (multiple WAL per table)
● If we take an empty DB and apply the changes in the WAL, we should get to the current
dataset
● WAL file is structured by columns and divided into segments
● Segments have sequential IDs and are tracked by a TableSequencer, who coordinates
and avoids conflicts
● Periodically, changes contained in the WAL are committed by the TableWritter
7
WAL benefits
● Enables replication
● Decouples ingestion and writing, allowing for future optimizations
● WAL will remove ingestion slow-down in the case of Out-of-order
data
● WAL will improve the TableBusy scenarios
8
Second step: WAL replication and read replicas
(Expected in a few months from now. Depends on WAL)
● Each WAL segment will automatically be written into Amazon S3
● On S3 notification, the TableSequencer will fetch the pending segments in
the right order and will use the TableWriter to apply changes locally
● The replicated tables will achieve eventual consistency
● The unit of replication will be the table
● Initially replication using only S3. If you want to contribute by making the
TableSequencer interact with segments in other Object Storages (cloud or
not), do get in touch
9
AMA
10
Javier, Developer Advocate
& QuestDB Team
Questions from the community
● Why does it take a while to see data after ingesting?
● Where can I find the Grafana plugin for QuestDB?
● What timestamp format do I need to use with ILP?
11
We’re hiring!
● Senior Backend Engineer (Python), QuestDB Cloud
● Cloud Engineer, QuestDB Cloud
Apply at: questdb.io/careers
12
Thank you!
13
@QuestDb QuestDB
Join Slack: slack.questdb.io
Star us: github.com/questdb/questdb
Clustering
14
Client Side
Library
Router Router Router
Raft
Primary node
Secondary
node(s)
Data Ingress
Load Balancer
Data Egress
Client Side
Library
E.g. ILP
E.g. REST API,
Postgres wire
Diagram

More Related Content

PPTX
AWS Redshift Introduction - Big Data Analytics
PPTX
Slashn Talk OLTP in Supply Chain - Handling Super-scale and Change Propagatio...
PPT
Introduction To Maxtable
PPTX
NoSQL Evolution
PDF
Webinar Slides: High Noon at AWS — Amazon RDS vs. Tungsten Clustering with My...
PPTX
Sql server 2019 New Features by Yevhen Nedaskivskyi
PDF
The Future of Fast Databases: Lessons from a Decade of QuestDB
PPTX
ClustrixDB: how distributed databases scale out
AWS Redshift Introduction - Big Data Analytics
Slashn Talk OLTP in Supply Chain - Handling Super-scale and Change Propagatio...
Introduction To Maxtable
NoSQL Evolution
Webinar Slides: High Noon at AWS — Amazon RDS vs. Tungsten Clustering with My...
Sql server 2019 New Features by Yevhen Nedaskivskyi
The Future of Fast Databases: Lessons from a Decade of QuestDB
ClustrixDB: how distributed databases scale out

Similar to QuestDB-Community-Call-20220728 (20)

PPTX
Oracle 23c New Features For DBAs and Developers.pptx
PPTX
Product Update: EDB Postgres Platform 2017
 
PPTX
adap-stability-202310.pptx
PDF
Data management in cloud study of existing systems and future opportunities
PDF
Cluster schedulerの紹介
PDF
20141206 4 q14_dataconference_i_am_your_db
PPT
Fudcon talk.ppt
PDF
Buytaert kris my_sql-pacemaker
PPTX
Megastore by Google
PPTX
MySQL 8.0 Featured for Developers
PDF
Chicago Kafka Meetup
PPT
Clustering van IT-componenten
PPTX
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
PDF
Running MySQL in AWS
PDF
NOSQL Meets Relational - The MySQL Ecosystem Gains More Flexibility
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQL
PPTX
ROLAP partitioning in MS SQL Server 2016
PPTX
Tech Talk Series, Part 2: Why is sharding not smart to do in MySQL?
PPTX
Cloud architectural patterns and Microsoft Azure tools
DOCX
Db2 Important questions to read
Oracle 23c New Features For DBAs and Developers.pptx
Product Update: EDB Postgres Platform 2017
 
adap-stability-202310.pptx
Data management in cloud study of existing systems and future opportunities
Cluster schedulerの紹介
20141206 4 q14_dataconference_i_am_your_db
Fudcon talk.ppt
Buytaert kris my_sql-pacemaker
Megastore by Google
MySQL 8.0 Featured for Developers
Chicago Kafka Meetup
Clustering van IT-componenten
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Running MySQL in AWS
NOSQL Meets Relational - The MySQL Ecosystem Gains More Flexibility
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQL
ROLAP partitioning in MS SQL Server 2016
Tech Talk Series, Part 2: Why is sharding not smart to do in MySQL?
Cloud architectural patterns and Microsoft Azure tools
Db2 Important questions to read
Ad

More from javier ramirez (20)

PDF
Cómo hemos implementado semántica de "Exactly Once" en nuestra base de datos ...
PDF
How We Added Replication to QuestDB - JonTheBeach
PDF
The Building Blocks of QuestDB, a Time Series Database
PDF
¿Se puede vivir del open source? T3chfest
PDF
QuestDB: The building blocks of a fast open-source time-series database
PDF
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
PDF
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
PDF
Deduplicating and analysing time-series data with Apache Beam and QuestDB
PDF
Your Database Cannot Do this (well)
PDF
Your Timestamps Deserve Better than a Generic Database
PDF
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
PDF
Processing and analysing streaming data with Python. Pycon Italy 2022
PDF
QuestDB: ingesting a million time series per second on a single instance. Big...
PDF
Servicios e infraestructura de AWS y la próxima región en Aragón
PPTX
Primeros pasos en desarrollo serverless
PDF
How AWS is reinventing the cloud
PDF
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
PDF
Getting started with streaming analytics
PDF
Getting started with streaming analytics: Setting up a pipeline
PDF
Getting started with streaming analytics: Deep Dive
Cómo hemos implementado semántica de "Exactly Once" en nuestra base de datos ...
How We Added Replication to QuestDB - JonTheBeach
The Building Blocks of QuestDB, a Time Series Database
¿Se puede vivir del open source? T3chfest
QuestDB: The building blocks of a fast open-source time-series database
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Deduplicating and analysing time-series data with Apache Beam and QuestDB
Your Database Cannot Do this (well)
Your Timestamps Deserve Better than a Generic Database
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Processing and analysing streaming data with Python. Pycon Italy 2022
QuestDB: ingesting a million time series per second on a single instance. Big...
Servicios e infraestructura de AWS y la próxima región en Aragón
Primeros pasos en desarrollo serverless
How AWS is reinventing the cloud
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
Getting started with streaming analytics
Getting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Deep Dive
Ad

Recently uploaded (20)

PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IB Computer Science - Internal Assessment.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
STUDY DESIGN details- Lt Col Maksud (21).pptx
Fluorescence-microscope_Botany_detailed content
Major-Components-ofNKJNNKNKNKNKronment.pptx
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
Quality review (1)_presentation of this 21
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Supervised vs unsupervised machine learning algorithms
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx

QuestDB-Community-Call-20220728

  • 2. Today’s agenda 2 Update about replication 1 2 AMA 3 Other news
  • 3. Roadmap updates 3 Javier Ramirez, Developer Relations at QuestDB @supercoco9 - @QuestDB
  • 4. QuestDB 4 Recent developments ● Replication (WIP) <- today’s focus! ● Partitions detach/attach workflow (soon) ● CSV imports for unordered data without size limitation (soon)
  • 5. What do we mean by replication ● Higher availability for reads, and higher throughput ○ Write to one QuestDB instance, propagate changes, and read from many ● Higher availability and higher throughput for both reads and writes (available only on QuestDB Cloud) ○ Write to many QuestDB instances, propagate changes, and read from many 5
  • 6. Current model for writing table data ● Data is received and kept in memory ● Once CommitLag is reached (or maxuncommitted rows, or table idle…), the TableWriter writes data into the table files ● If Out-of-order rows are present, the operation will be costly, affecting reads ● Depending on how costly the writing operation is, ingestion performance will also be affected ● Prone to Table Busy errors 6
  • 7. First Step: Detaching ingestion and table writes Implementing WAL (Write Ahead Log) (Work In Progress) 🚧 ● Data (and any updates) is ingested to an append-only file (multiple WAL per table) ● If we take an empty DB and apply the changes in the WAL, we should get to the current dataset ● WAL file is structured by columns and divided into segments ● Segments have sequential IDs and are tracked by a TableSequencer, who coordinates and avoids conflicts ● Periodically, changes contained in the WAL are committed by the TableWritter 7
  • 8. WAL benefits ● Enables replication ● Decouples ingestion and writing, allowing for future optimizations ● WAL will remove ingestion slow-down in the case of Out-of-order data ● WAL will improve the TableBusy scenarios 8
  • 9. Second step: WAL replication and read replicas (Expected in a few months from now. Depends on WAL) ● Each WAL segment will automatically be written into Amazon S3 ● On S3 notification, the TableSequencer will fetch the pending segments in the right order and will use the TableWriter to apply changes locally ● The replicated tables will achieve eventual consistency ● The unit of replication will be the table ● Initially replication using only S3. If you want to contribute by making the TableSequencer interact with segments in other Object Storages (cloud or not), do get in touch 9
  • 11. Questions from the community ● Why does it take a while to see data after ingesting? ● Where can I find the Grafana plugin for QuestDB? ● What timestamp format do I need to use with ILP? 11
  • 12. We’re hiring! ● Senior Backend Engineer (Python), QuestDB Cloud ● Cloud Engineer, QuestDB Cloud Apply at: questdb.io/careers 12
  • 13. Thank you! 13 @QuestDb QuestDB Join Slack: slack.questdb.io Star us: github.com/questdb/questdb
  • 14. Clustering 14 Client Side Library Router Router Router Raft Primary node Secondary node(s) Data Ingress Load Balancer Data Egress Client Side Library E.g. ILP E.g. REST API, Postgres wire Diagram