Buy New
-
Ships from: Amazon.com Sold by: Amazon.com
Used - Good
-
Ships from: World of Books (previously glenthebookseller) Sold by: World of Books (previously glenthebookseller)
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Follow the authors
OK
Streaming Systems: The What, Where, When, and How of Large-Scale Data Processing
Purchase options and add-ons
Streaming data is a big deal in big data these days. As more and more businesses seek to tame the massive unbounded data sets that pervade our world, streaming systems have finally reached a level of maturity sufficient for mainstream adoption. With this practical guide, data engineers, data scientists, and developers will learn how to work with streaming data in a conceptual and platform-agnostic way.
Expanded from Tyler Akidau’s popular blog posts "Streaming 101" and "Streaming 102", this book takes you from an introductory level to a nuanced understanding of the what, where, when, and how of processing real-time data streams. You’ll also dive deep into watermarks and exactly-once processing with co-authors Slava Chernyak and Reuven Lax.
You’ll explore:
- How streaming and batch data processing patterns compare
- The core principles and concepts behind robust out-of-order data processing
- How watermarks track progress and completeness in infinite datasets
- How exactly-once data processing techniques ensure correctness
- How the concepts of streams and tables form the foundations of both batch and streaming data processing
- The practical motivations behind a powerful persistent state mechanism, driven by a real-world example
- How time-varying relations provide a link between stream processing and the world of SQL and relational algebra
- ISBN-101491983876
- ISBN-13978-1491983874
- Edition1st
- PublisherO'Reilly Media
- Publication dateAugust 28, 2018
- LanguageEnglish
- Dimensions7.25 x 0.75 x 9.5 inches
- Print length349 pages
![]() |
Customers who viewed this item also viewed
Customers also bought or read
- Database Internals: A Deep Dive into How Distributed Data Systems Work#1 Best SellerManagement Information Systems
PaperbackFREE delivery Sun, Jun 28 - Fundamentals of Data Engineering: Plan and Build Robust Data Systems
PaperbackFREE delivery Sun, Jun 28 - Stream Processing with Apache Flink: Fundamentals, Implementation, and Operation of Streaming Applications
PaperbackFREE delivery Sun, Jun 28 - Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale
PaperbackFREE delivery Sun, Jun 28 - The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling
PaperbackFREE delivery Sun, Jun 28 - Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes
PaperbackFREE delivery Sun, Jun 28 - Data Engineering Design Patterns: Recipes for Solving the Most Common Data Engineering Problems
PaperbackFREE delivery Sun, Jun 28 - The Staff Engineer's Path: A Guide for Individual Contributors Navigating Growth and Change
PaperbackDelivery Sun, Jun 28 - Understanding Distributed Systems, Second Edition: What every developer should know about large distributed applications
PaperbackFREE delivery Sun, Jun 28 - Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming
PaperbackFREE delivery Sun, Jun 28 - Operating Systems: Three Easy Pieces#1 Best SellerComputer Operating Systems Theory
PaperbackDelivery Sun, Jun 28 - Data Pipelines Pocket Reference: Moving and Processing Data for Analytics
PaperbackDelivery Sun, Jun 28 - Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
PaperbackFREE delivery Sun, Jun 28 - Systems Performance (Addison-Wesley Professional Computing Series)#1 Best SellerComputer Performance Optimization
PaperbackFREE delivery Jul 14 - 17 - Machine Learning Design Patterns: Solutions to Common Challenges in Data Preparation, Model Building, and MLOps
PaperbackFREE delivery Sun, Jun 28 - Building Medallion Architectures: Designing with Delta Lake and Spark
PaperbackFREE delivery Sun, Jun 28 - Foundations of Scalable Systems: Designing Distributed Architectures
PaperbackFREE delivery Sun, Jun 28 - Deciphering Data Architectures: Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh
PaperbackFREE delivery Sun, Jun 28
Editorial Reviews
About the Author
Slava Chernyak is a senior software engineer at Google Seattle. Slava spent over five years working on Google’s internal massive-scale streaming data processing systems and has since become involved with designing and building Windmill, Google Cloud Dataflow's next-generation streaming backend, from the ground up. Slava is passionate about making massive-scale stream processing available and useful to a broader audience. When he is not working on streaming systems, Slava is out enjoying the natural beauty of the Pacific Northwest.
Reuven Lax is a senior staff software engineer at Google Seattle, and has spent the past nine years helping to shape Google's data processing and analysis strategy. For much of that time he has focused on Google's low-latency, streaming data processing efforts, first as a long-time member and lead of the MillWheel team, and more recently founding and leading the team responsible for Windmill, the next-generation stream processing engine powering Google Cloud Dataflow. He's very excited to bring Google's data-processing experience to the world at large, and proud to have been a part of publishing both the MillWheel paper in 2013 and the Dataflow Model paper in 2015. When not at work, Reuven enjoys swing dancing, rock climbing, and exploring new parts of the world.
Product details
- Publisher : O'Reilly Media
- Publication date : August 28, 2018
- Edition : 1st
- Language : English
- Print length : 349 pages
- ISBN-10 : 1491983876
- ISBN-13 : 978-1491983874
- Item Weight : 1.26 pounds
- Dimensions : 7.25 x 0.75 x 9.5 inches
- Best Sellers Rank: #169,066 in Books (See Top 100 in Books)
- #52 in Data Processing
- #65 in Software Design & Engineering
- #159 in Software Development (Books)
About the authors

Discover more of the author’s books, see similar authors, read book recommendations and more.

Discover more of the author’s books, see similar authors, read book recommendations and more.























