PushdownDB: Accelerating a DBMS using S3 Computation

Yu, Xiangyao; Youill, Matt; Woicik, Matthew; Ghanem, Abdurrahman; Serafini, Marco; Aboulnaga, Ashraf; Stonebraker, Michael

Computer Science > Databases

arXiv:2002.05837 (cs)

[Submitted on 14 Feb 2020]

Title:PushdownDB: Accelerating a DBMS using S3 Computation

Authors:Xiangyao Yu, Matt Youill, Matthew Woicik, Abdurrahman Ghanem, Marco Serafini, Ashraf Aboulnaga, Michael Stonebraker

View PDF

Abstract:This paper studies the effectiveness of pushing parts of DBMS analytics queries into the Simple Storage Service (S3) engine of Amazon Web Services (AWS), using a recently released capability called S3 Select. We show that some DBMS primitives (filter, projection, aggregation) can always be cost-effectively moved into S3. Other more complex operations (join, top-K, group-by) require reimplementation to take advantage of S3 Select and are often candidates for pushdown. We demonstrate these capabilities through experimentation using a new DBMS that we developed, PushdownDB. Experimentation with a collection of queries including TPC-H queries shows that PushdownDB is on average 30% cheaper and 6.7X faster than a baseline that does not use S3 Select.

Subjects:	Databases (cs.DB)
Cite as:	arXiv:2002.05837 [cs.DB]
	(or arXiv:2002.05837v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2002.05837

Submission history

From: Xiangyao Yu [view email]
[v1] Fri, 14 Feb 2020 01:23:54 UTC (440 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DB

< prev | next >

new | recent | 2020-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiangyao Yu
Marco Serafini
Ashraf Aboulnaga
Michael Stonebraker

export BibTeX citation

Computer Science > Databases

Title:PushdownDB: Accelerating a DBMS using S3 Computation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:PushdownDB: Accelerating a DBMS using S3 Computation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators