DriftDB

Experimental PostgreSQL-Compatible Time-Travel Database (v0.9.0-alpha) - An ambitious temporal database project with advanced architectural designs for enterprise features. Query your data at any point in history using standard SQL.

⚠️ ALPHA SOFTWARE - NOT FOR PRODUCTION USE: This version contains experimental implementations of enterprise features. The codebase now compiles cleanly with minimal warnings (reduced from 335 to 17). Many advanced features remain as architectural designs requiring implementation.

🎮 Try the Interactive Demo

Experience DriftDB's time-travel capabilities right now!

cd demo
./run-demo.sh
# Opens at http://localhost:8080

Or simply open demo/index.html in your browser - no installation required!

The interactive demo features:

Visual time-travel slider to query data at any point in history
Real-time SQL editor with example queries
Multiple sample datasets (e-commerce, users, inventory)
No setup needed - runs entirely in your browser

📖 See demo documentation

🚀 Quick Start

# Start the PostgreSQL-compatible server
./target/release/driftdb-server --data-path ./data

# Connect with any PostgreSQL client
psql -h localhost -p 5433 -d driftdb

# Use standard SQL with time-travel
CREATE TABLE events (id INT PRIMARY KEY, data VARCHAR);
INSERT INTO events (id, data) VALUES (1, 'original');
UPDATE events SET data = 'modified' WHERE id = 1;

-- Query historical state!
SELECT * FROM events AS OF @seq:1;  -- Shows 'original'
SELECT * FROM events;                -- Shows 'modified'

✅ Working Features

Full SQL Support

All 5 standard JOIN types: INNER, LEFT, RIGHT, FULL OUTER, CROSS (including self-joins)
Subqueries: IN/NOT IN, EXISTS/NOT EXISTS (including correlated!), scalar subqueries
Common Table Expressions (CTEs): WITH clause including RECURSIVE CTEs
Transactions: BEGIN, COMMIT, ROLLBACK with ACID guarantees and savepoint support
Views: CREATE/DROP VIEW with persistence across restarts
DDL operations: CREATE TABLE, ALTER TABLE ADD COLUMN, CREATE INDEX, TRUNCATE
Aggregation functions: COUNT(*), COUNT(DISTINCT), SUM, AVG, MIN, MAX
GROUP BY and HAVING: Full support for grouping with aggregate filtering
CASE WHEN expressions: Conditional logic in queries
Set operations: UNION, INTERSECT, EXCEPT
Multi-row INSERT: INSERT INTO ... VALUES (row1), (row2), ...
Foreign key constraints: Referential integrity enforcement
Time-travel queries: AS OF for querying historical states

Core Database Engine

Event sourcing: Every change is an immutable event with full history
Time-travel queries: Query any historical state by sequence number
ACID compliance: Full transaction support with isolation levels
CRC32 verification: Data integrity on every frame
Append-only storage: Never lose data, perfect audit trail
JSON documents: Flexible schema with structured data

Tested & Verified

✅ Python psycopg2 driver
✅ Node.js pg driver
✅ JDBC PostgreSQL driver
✅ SQLAlchemy ORM
✅ Any PostgreSQL client

🎯 Perfect For

Debugging Production Issues: "What was the state when the bug occurred?"
Compliance & Auditing: Complete audit trail built-in, no extra work
Data Recovery: Accidentally deleted data? It's still there!
Analytics: Track how metrics changed over time
Testing: Reset to any point, perfect for test scenarios
Development: Branch your database like Git

✨ Core Features

SQL:2011 Temporal Queries (Native Support)

FOR SYSTEM_TIME AS OF: Query data at any point in time
FOR SYSTEM_TIME BETWEEN: Get all versions in a time range
FOR SYSTEM_TIME FROM...TO: Exclusive range queries
FOR SYSTEM_TIME ALL: Complete history of changes
System-versioned tables: Automatic history tracking

Data Model & Storage

Append-only storage: Immutable events preserve complete history
Time travel queries: Standard SQL:2011 temporal syntax
ACID transactions: Full transaction support with isolation levels
Secondary indexes: B-tree indexes for fast lookups
Snapshots & compaction: Optimized performance with compression

Planned Enterprise Features (Not Yet Functional)

The following features have been architecturally designed but are not yet operational:

Authentication & Authorization: Planned RBAC with user management (code incomplete)
Encryption at Rest: Designed AES-256-GCM encryption (not functional)
Distributed Consensus: Raft protocol structure (requires debugging)
Advanced Transactions: MVCC design for isolation levels (partial implementation)
Enterprise Backup: Backup system architecture (compilation errors)
Security Monitoring: Monitoring framework (not integrated)

Working Infrastructure

Connection pooling: Thread-safe connection pool with RAII guards
Health checks: Basic metrics endpoint
Rate limiting: Token bucket algorithm for connection limits

Query Features (Partially Working)

B-tree indexes: Secondary indexes for fast lookups (functional)
Basic query planner: Simple execution plans (working)
Prepared statements: Statement caching (functional)

Planned Query Optimization (Design Phase)

Advanced Query Optimizer: Cost-based optimization design (not implemented)
Join Strategies: Theoretical star schema optimization (code incomplete)
Subquery Optimization: Flattening algorithms designed (not functional)
Materialized Views: Architecture planned (not implemented)
Parallel Execution: Threading design (not operational)

Quick Start

Docker Installation (Recommended)

# Quick start with Docker
git clone https://github.com/driftdb/driftdb
cd driftdb
./scripts/docker-quickstart.sh

# Connect to DriftDB
psql -h localhost -p 5433 -d driftdb -U driftdb
# Password: driftdb

Manual Installation

# Clone and build from source
git clone https://github.com/driftdb/driftdb
cd driftdb
make build

# Or install with cargo
cargo install driftdb-cli driftdb-server

60-second demo

# Run the full demo (creates sample data and runs queries)
make demo

PostgreSQL-Compatible Server

DriftDB now includes a PostgreSQL wire protocol server, allowing you to connect with any PostgreSQL client:

# Start the server
./target/release/driftdb-server

# Connect with psql
psql -h 127.0.0.1 -p 5433 -d driftdb -U driftdb

# Connect with any PostgreSQL driver
postgresql://driftdb:[email protected]:5433/driftdb

The server supports:

PostgreSQL wire protocol v3
SQL queries with automatic temporal tracking
Authentication (cleartext and MD5)
Integration with existing PostgreSQL tools and ORMs

Manual CLI usage

-- Initialize and connect to database
driftdb init ./mydata
driftdb sql ./mydata

-- Create a temporal table (SQL:2011)
CREATE TABLE users (
    id INTEGER PRIMARY KEY,
    email VARCHAR(255),
    status VARCHAR(20),
    created_at TIMESTAMP
) WITH SYSTEM VERSIONING;

-- Insert data
INSERT INTO users VALUES (1, '[email protected]', 'active', CURRENT_TIMESTAMP);

-- Standard SQL queries with WHERE clauses
SELECT * FROM users WHERE status = 'active';
SELECT * FROM users WHERE id > 100 AND status != 'deleted';

-- UPDATE with conditions
UPDATE users SET status = 'inactive' WHERE last_login < '2024-01-01';

-- DELETE with conditions (soft delete preserves history)
DELETE FROM users WHERE status = 'inactive' AND created_at < '2023-01-01';

-- Time travel query (SQL:2011)
SELECT * FROM users
FOR SYSTEM_TIME AS OF '2024-01-15T10:00:00Z'
WHERE id = 1;

-- Query all historical versions
SELECT * FROM users
FOR SYSTEM_TIME ALL
WHERE id = 1;

-- Query range of time
SELECT * FROM users
FOR SYSTEM_TIME BETWEEN '2024-01-01' AND '2024-01-31'
WHERE status = 'active';

-- Advanced SQL Features (v0.6.0)
-- Column selection
SELECT name, email FROM users WHERE status = 'active';

-- Aggregation functions
SELECT COUNT(*) FROM users;
SELECT COUNT(email), AVG(age) FROM users WHERE status = 'active';
SELECT MIN(created_at), MAX(created_at) FROM users;

-- GROUP BY and aggregations
SELECT status, COUNT(*) FROM users GROUP BY status;
SELECT department, AVG(salary), MIN(salary), MAX(salary)
FROM employees GROUP BY department;

-- HAVING clause for group filtering
SELECT department, AVG(salary) FROM employees
GROUP BY department HAVING AVG(salary) > 50000;

-- ORDER BY and LIMIT
SELECT * FROM users ORDER BY created_at DESC LIMIT 10;
SELECT name, email FROM users WHERE status = 'active'
ORDER BY name ASC LIMIT 5;

-- Complex queries with all features
SELECT department, COUNT(*) as emp_count, AVG(salary) as avg_salary
FROM employees
WHERE hire_date >= '2023-01-01'
GROUP BY department
HAVING COUNT(*) >= 3
ORDER BY AVG(salary) DESC
LIMIT 5;

SQL:2011 Temporal Syntax

Standard Temporal Queries

-- AS OF: Query at a specific point in time
SELECT * FROM orders
FOR SYSTEM_TIME AS OF '2024-01-15T10:30:00Z'
WHERE customer_id = 123;

-- BETWEEN: All versions in a time range (inclusive)
SELECT * FROM accounts
FOR SYSTEM_TIME BETWEEN '2024-01-01' AND '2024-01-31'
WHERE balance > 10000;

-- FROM...TO: Range query (exclusive end)
SELECT * FROM inventory
FOR SYSTEM_TIME FROM '2024-01-01' TO '2024-02-01'
WHERE product_id = 'ABC-123';

-- ALL: Complete history
SELECT * FROM audit_log
FOR SYSTEM_TIME ALL
WHERE action = 'DELETE';

Creating Temporal Tables

-- Create table with system versioning
CREATE TABLE orders (pk=id, INDEX(status, customer_id))

-- Insert full document
INSERT INTO orders {"id": "order1", "status": "pending", "amount": 100}

-- Partial update
PATCH orders KEY "order1" SET {"status": "paid"}

-- Soft delete (data remains for audit)
SOFT DELETE FROM orders KEY "order1"

Transactions

-- Start a transaction
BEGIN TRANSACTION ISOLATION LEVEL REPEATABLE READ

-- Multiple operations in transaction
INSERT INTO orders {"id": "order2", "amount": 200}
PATCH orders KEY "order1" SET {"status": "shipped"}

-- Commit or rollback
COMMIT
-- or
ROLLBACK

Time Travel Queries

-- Query historical state by timestamp
SELECT * FROM orders WHERE status="paid" AS OF "2025-01-01T00:00:00Z"

-- Query by sequence number
SELECT * FROM orders WHERE customer_id="cust1" AS OF "@seq:1000"

-- Show complete history of a record
SHOW DRIFT orders KEY "order1"

Schema Migrations

-- Add a new column with default value
ALTER TABLE orders ADD COLUMN priority DEFAULT "normal"

-- Add an index
CREATE INDEX ON orders(created_at)

-- Drop a column (requires downtime)
ALTER TABLE orders DROP COLUMN legacy_field

Maintenance

-- Create snapshot for performance
SNAPSHOT orders

-- Compact storage
COMPACT orders

-- Backup database
BACKUP TO './backups/2024-01-15'

-- Show table statistics
ANALYZE TABLE orders

Architecture

Storage Layout

data/
  tables/<table>/
    schema.yaml           # Table schema definition
    segments/            # Append-only event logs with CRC32
      00000001.seg
      00000002.seg
    snapshots/           # Compressed materialized states
      00000100.snap
    indexes/             # Secondary B-tree indexes
      status.idx
      customer_id.idx
    meta.json           # Table metadata
  wal/                   # Write-ahead log for durability
    wal.log
    wal.log.1            # Rotated WAL files
  migrations/            # Schema migrations
    history.json
    pending/
  backups/               # Backup snapshots

Event Types

INSERT: Add new row with full document
PATCH: Partial update by primary key
SOFT_DELETE: Mark row as deleted (audit trail preserved)

Segment Format

[u32 length][u32 crc32][varint seq][u64 unix_ms][u8 event_type][msgpack payload]

Safety & Reliability

Data Integrity

Write-Ahead Logging: All writes go through WAL first for durability
CRC32 verification: Every frame is checksummed
Atomic operations: fsync on critical boundaries
Crash recovery: Automatic WAL replay on startup

Concurrency Control

ACID transactions: Serializable isolation available
MVCC: Multi-version concurrency control for readers
Deadlock detection: Automatic detection and resolution
Connection pooling: Fair scheduling with backpressure

Security

Encryption at rest: AES-256-GCM for stored data
Encryption in transit: TLS 1.3 for network communication
Key rotation: Automatic key rotation support
Rate limiting: DoS protection with per-client limits

🎯 Use Cases

Compliance & Audit

-- "Prove we had user consent when we sent that email"
SELECT consent_status, consent_timestamp
FROM users
FOR SYSTEM_TIME AS OF '2024-01-15T14:30:00Z'
WHERE email = '[email protected]';

Debugging Production Issues

-- "What was the state when the error occurred?"
SELECT * FROM shopping_carts
FOR SYSTEM_TIME AS OF '2024-01-15T09:45:00Z'
WHERE session_id = 'xyz-789';

Analytics & Reporting

-- "Show me how this metric changed over time"
SELECT DATE(SYSTEM_TIME_START) as date, COUNT(*) as daily_users
FROM users
FOR SYSTEM_TIME ALL
WHERE status = 'active'
GROUP BY DATE(SYSTEM_TIME_START);

Data Recovery

-- "Restore accidentally deleted data"
INSERT INTO users
SELECT * FROM users
FOR SYSTEM_TIME AS OF '2024-01-15T08:00:00Z'
WHERE id NOT IN (SELECT id FROM users);

Comparison with Other Databases

Feature	DriftDB	PostgreSQL	MySQL	Oracle	SQL Server
SQL:2011 Temporal	✅ Native	⚠️ Extension	❌	💰 Flashback	⚠️ Complex
Storage Overhead	✅ Low (events)	❌ High	❌ High	❌ High	❌ High
Query Past Data	✅ Simple SQL	❌ Complex	❌	💰 Extra cost	⚠️ Complex
Audit Trail	✅ Automatic	❌ Manual	❌ Manual	💰	⚠️ Manual
Open Source	✅	✅	✅	❌	❌

Testing

DriftDB includes a comprehensive test suite with both Rust and Python tests organized into different categories.

Quick Start

# Run all tests (Rust + Python)
make test

# Run quick tests only (no slow/performance tests)
make test-quick

# Run specific test categories
make test-unit        # Unit tests only
make test-integration # Integration tests
make test-sql        # SQL compatibility tests
make test-python     # All Python tests

Test Organization

The test suite is organized into the following categories:

tests/
├── unit/           # Fast, isolated unit tests
├── integration/    # Cross-component integration tests
├── sql/           # SQL standard compatibility tests
├── performance/   # Performance benchmarks
├── stress/        # Load and stress tests
├── legacy/        # Migrated from root directory
└── utils/         # Shared test utilities

Running Specific Tests

# Run a specific test file
python tests/unit/test_basic_operations.py

# Run tests matching a pattern
pytest tests/ -k "constraint"

# Run with verbose output
python tests/run_all_tests.py --verbose

# Generate coverage report
make test-coverage

Writing Tests

Tests should extend the DriftDBTestCase base class which provides:

Database connection management
Test table creation/cleanup
Assertion helpers for query validation
Transaction and savepoint support

Example test:

from tests.utils import DriftDBTestCase

class TestNewFeature(DriftDBTestCase):
    def test_feature(self):
        self.create_test_table()
        self.assert_query_succeeds("INSERT INTO test_table ...")
        result = self.execute_query("SELECT * FROM test_table")
        self.assert_result_count(result, 1)

Development

# Run tests
make test

# Run benchmarks
make bench

# Format code
make fmt

# Run linter
make clippy

# Full CI checks
make ci

Performance

Benchmark Results

Measured on MacBook Air M3 (2024) - 8-core (4P+4E), 16GB unified memory, NVMe SSD:

Insert Operations:

Single insert:     4.3 ms
10 inserts:       35.5 ms  (~3.5 ms each)
100 inserts:     300 ms    (~3.0 ms each)

Query Operations:

SELECT 100 rows:    101 µs
SELECT 1000 rows:   644 µs
Full table scan:    719 µs  (1000 rows)

Update/Delete Operations:

Single update:      6.7 ms
Single delete:      6.5 ms

Time Travel Queries:

Historical query:  131 µs  (AS OF @seq:N)

Throughput:

Inserts: ~230 inserts/sec (single-threaded)
Queries: ~9,800 queries/sec (100-row selects)
Time Travel: ~7,600 historical queries/sec

Run benchmarks yourself: cargo bench --bench simple_benchmarks

See benchmarks/HARDWARE.md for hardware specs and benchmarks/baselines/ for detailed results.

Optimization Features

Query optimizer: Cost-based planning with statistics
Index selection: Automatic index usage for queries
Streaming APIs: Memory-bounded operations
Connection pooling: Reduced connection overhead
Plan caching: Reuse of optimized query plans

Storage Efficiency

Zstd compression: For snapshots and backups
MessagePack serialization: Compact binary format
Incremental snapshots: Only changed data
Compaction: Automatic segment consolidation
B-tree indexes: O(log n) lookup performance

Scalability

Configurable limits: Memory, connections, request rates
Backpressure: Automatic load shedding
Batch operations: Efficient bulk inserts
Parallel processing: Multi-threaded where safe

License

MIT

Production Readiness

⚠️ Alpha Stage - Development/Testing Use

DriftDB is currently in alpha stage with significant recent improvements but requires additional testing and validation.

Current Status:

Core functionality implemented and working well
Time travel queries fully functional with AS OF @seq:N
PostgreSQL wire protocol fully implemented
SQL support for SELECT, INSERT, UPDATE, DELETE with WHERE clauses
Replication framework in place (tests fixed)
WAL implementation corrected

Safe for:

Development and experimentation
Learning about database internals
Proof of concept projects
Testing time-travel database concepts

NOT safe for:

Production workloads
Data you cannot afford to lose
High-availability requirements
Security-sensitive applications

Feature Maturity

Component	Status	Development Ready
Core Storage Engine	🟡 Alpha	For Testing
SQL Execution	🟢 Working	Yes
Time Travel Queries	🟢 Working	Yes
PostgreSQL Protocol	🟢 Working	Yes
WAL & Crash Recovery	🟡 Beta	Almost
ACID Transactions	🟡 Beta	Almost
Event Sourcing	🟢 Working	Yes
WHERE Clause Support	🟢 Working	Yes
UPDATE/DELETE	🟢 Working	Yes
Replication Framework	🟡 Beta	Almost
Schema Migrations	🟡 Beta	Almost
Connection Pooling	🔶 Alpha	No
Monitoring & Metrics	🔶 Placeholder	No
Admin Tools	🔶 Alpha	No

Roadmap

v0.1.0 (Alpha - Complete)

✅ Basic storage engine
✅ Simple event sourcing
✅ Basic time-travel queries
✅ CLI interface
✅ Experimental features added

v0.2.0 (SQL:2011 Support - Complete)

✅ SQL:2011 temporal query syntax
✅ FOR SYSTEM_TIME support
✅ Standard SQL parser integration
✅ Temporal table DDL

v0.3.0 (PostgreSQL Compatibility - Complete)

✅ PostgreSQL wire protocol v3
✅ Connect with any PostgreSQL client
✅ Basic SQL execution
✅ Time-travel through PostgreSQL

v0.4.0 (Full SQL Support - Complete)

✅ WHERE clause with multiple operators
✅ UPDATE statement with conditions
✅ DELETE statement with conditions
✅ AND logic for multiple conditions
✅ Soft deletes preserve history

v0.5.0 (Time Travel & Fixes - Complete)

✅ Time travel queries with AS OF @seq:N
✅ Fixed replication integration tests
✅ Corrected WAL implementation
✅ Updated documentation accuracy
✅ PostgreSQL protocol improvements

v0.6.0 (Current - Advanced SQL Features - Complete)

✅ Aggregations (COUNT, SUM, AVG, MIN, MAX)
✅ GROUP BY and HAVING clauses
✅ ORDER BY and LIMIT clauses
✅ Column selection (SELECT column1, column2)
✅ JOIN operations (INNER, LEFT, RIGHT, FULL OUTER, CROSS)
✅ Subqueries (IN, EXISTS, NOT IN, NOT EXISTS, scalar subqueries)
✅ ROLLBACK support with savepoints
✅ Common Table Expressions (CTEs) including RECURSIVE

v0.6.0 (Release Candidate)

📋 Production monitoring
📋 Proper replication implementation
📋 Security hardening
📋 Stress testing
📋 Cloud deployment support

v1.0 (Future Production Consideration)

📋 Extensive testing and validation
📋 Full production documentation
📋 Security audits and performance testing
📋 Performance guarantees
📋 High availability
📋 Enterprise features

v2.0 (Future Vision)

📋 Multi-master replication
📋 Sharding support
📋 Full SQL compatibility
📋 Change data capture (CDC)
📋 GraphQL API

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.claude		.claude
.github		.github
benches		benches
benchmarks		benchmarks
bin		bin
client_tests		client_tests
clients		clients
crates		crates
demo-data/tables/orders		demo-data/tables/orders
demo		demo
demo_data/tables/products		demo_data/tables/products
docs		docs
examples		examples
grafana-dashboards		grafana-dashboards
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
ACHIEVEMENT.md		ACHIEVEMENT.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DEVELOPMENT_ROADMAP.md		DEVELOPMENT_ROADMAP.md
DOCS_INDEX.md		DOCS_INDEX.md
Dockerfile		Dockerfile
IMPLEMENTATION_STATUS.md		IMPLEMENTATION_STATUS.md
LICENSE		LICENSE
Makefile		Makefile
NEXT_SPRINT_PLAN.md		NEXT_SPRINT_PLAN.md
PRODUCTION_READINESS_REPORT.md		PRODUCTION_READINESS_REPORT.md
README.md		README.md
SECURITY_AUDIT_REPORT.md		SECURITY_AUDIT_REPORT.md
SESSION_PROGRESS.md		SESSION_PROGRESS.md
SPRINT_COMPLETE.md		SPRINT_COMPLETE.md
SPRINT_SUMMARY.md		SPRINT_SUMMARY.md
SQL_BRIDGE_SOLUTION.md		SQL_BRIDGE_SOLUTION.md
TEST_RESULTS.md		TEST_RESULTS.md
WORK_SUMMARY.md		WORK_SUMMARY.md
docker-compose.yml		docker-compose.yml
fix_column_ordering.md		fix_column_ordering.md

License

DavidLiedle/DriftDB

Folders and files

Latest commit

History

Repository files navigation

DriftDB

🎮 Try the Interactive Demo

🚀 Quick Start

✅ Working Features

Full SQL Support

Core Database Engine

Tested & Verified

🎯 Perfect For

✨ Core Features

SQL:2011 Temporal Queries (Native Support)

Data Model & Storage

Planned Enterprise Features (Not Yet Functional)

Working Infrastructure

Query Features (Partially Working)

Planned Query Optimization (Design Phase)

Quick Start

Docker Installation (Recommended)

Manual Installation

60-second demo

PostgreSQL-Compatible Server

Manual CLI usage

SQL:2011 Temporal Syntax

Standard Temporal Queries

Creating Temporal Tables

Transactions

Time Travel Queries

Schema Migrations

Maintenance

Architecture

Storage Layout

Event Types

Segment Format

Safety & Reliability

Data Integrity

Concurrency Control

Security

🎯 Use Cases

Compliance & Audit

Debugging Production Issues

Analytics & Reporting

Data Recovery

Comparison with Other Databases

Testing

Quick Start

Test Organization

Running Specific Tests

Writing Tests

Development

Performance

Benchmark Results

Optimization Features

Storage Efficiency

Scalability

License

Production Readiness

⚠️ Alpha Stage - Development/Testing Use

Feature Maturity

Roadmap

v0.1.0 (Alpha - Complete)

v0.2.0 (SQL:2011 Support - Complete)

v0.3.0 (PostgreSQL Compatibility - Complete)

v0.4.0 (Full SQL Support - Complete)

v0.5.0 (Time Travel & Fixes - Complete)

v0.6.0 (Current - Advanced SQL Features - Complete)

v0.6.0 (Release Candidate)

v1.0 (Future Production Consideration)

v2.0 (Future Vision)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages